Automatic Speech Recognition (ASR) models need to be optimized for speci...
Large language models have proven themselves highly flexible, able to so...
State space models (SSMs) have recently shown promising results on
small...
Interactive voice assistants have been widely used as input interfaces i...
This paper improves the streaming transformer transducer for speech
reco...
Detection of common events and scenes from audio is useful for extractin...
Hybrid automatic speech recognition (ASR) models are typically sequentia...
This paper proposes an efficient memory transformer Emformer for low lat...