Channels - Leveraging broadcast media subtitle transcripts for automatic speech recognition and subtitling :: FRELIP Discovery

Similar Items: Leveraging broadcast media subtitle transcripts for automatic speech recognition and subtitling

Quick Look
Speech recognition in adverse conditions: synthetic transformations and real environmental noise
Quick Look
Comparative accuracy of AI speech recognition tools for Somali language use
Quick Look
Learning-based a posteriori speech presence probability estimation and applications
Quick Look
Editorial: a new chapter for the Journal of Audio, Speech, and Music Processing
Quick Look
Detection of actionable domain shifts in speech enhancement systems by tracking prediction uncertainty
Quick Look
Construction of music emotion recognition and classification model supported by neural networks
Quick Look
Multi-resolution spectrogram based multi-branch hybrid attention network for music emotion recognition
Quick Look
Learning graph representations from Mel-spectrogram segments for predominant instrument recognition in polyphonic music
Quick Look
A low parameter channel grouped iterative convolutional recurrent network for speech enhancement of noise-reducing headphones
Quick Look
A unified deep learning framework for estimating acoustic context parameters from first order ambisonic speech recordings
Quick Look
Can speed perturbation plus SpecAugment be outperformed by novel combinations of speech data augmentations for ASR? A low-resource evaluation
Quick Look
Characterizing continual learning scenarios and strategies for audio analysis
Quick Look
Hybrid real- and complex-valued neural network architecture
Quick Look
The Hi-Audio online platform for recording and distributing multi-track music datasets
Quick Look
On the influence of head-above-torso orientation on deep-learning-based binaural sound source localization
Quick Look
The trajectoRIR database: room acoustic recordings along a trajectory of moving microphones
Quick Look
PCL-AED: progressive acoustic event detection based on contrastive learning
Quick Look
Interactive optimization of parametric music processing for cochlear implants
Quick Look
Performance and robustness of signal-dependent vs. signal-independent binaural signal matching with wearable microphone arrays
Quick Look
A quarter of a century of polar mesospheric summer echo observations over Andøya: climatology and trends
Quick Look
Comprehensive signal processing approaches for non-contact heartbeat detection using 24 GHz FMCW radar
Quick Look
A review of orthogonal waveforms for spaceborne Multiple-Input Multiple-Output Synthetic Aperture Radar
Quick Look
Concepts for a cost-efficient, additively manufactured WR90 coaxial waveguide transition
Quick Look
Improving spatial control for two listeners via cue-constrained equalization