Similar Items: Leveraging broadcast media subtitle transcripts for automatic speech recognition and subtitling
- Speech recognition in adverse conditions: synthetic transformations and real environmental noise
- Editorial: a new chapter for the Journal of Audio, Speech, and Music Processing
- Learning-based a posteriori speech presence probability estimation and applications
- Construction of music emotion recognition and classification model supported by neural networks
- Detection of actionable domain shifts in speech enhancement systems by tracking prediction uncertainty
- Multi-resolution spectrogram based multi-branch hybrid attention network for music emotion recognition