With the development of deep learning, automatic speech recognition (ASR...
Text language models have shown remarkable zero-shot capability in
gener...
Non-autoregressive automatic speech recognition (ASR) modeling has recei...
In this paper, inspired by the successes of visionlanguage pre-trained m...