Speech transcription, emotion recognition, and language identification a...
Recent innovations in self-supervised representation learning have led t...
Self-supervised pre-trained features have consistently delivered state-o...
Large speech emotion recognition datasets are hard to obtain, and small
...
With the rise of voice-activated applications, the need for speaker
reco...