Conformer models maintain a large number of internal states, the vast
ma...
We introduce the Universal Speech Model (USM), a single large model that...
Word-piece models (WPMs) are commonly used subword units in state-of-the...
While large language models (LLM) have made impressive progress in natur...
In this work, we propose a new parameter-efficient learning framework ba...
Text-only adaptation of a transducer model remains challenging for end-t...
We present JOIST, an algorithm to train a streaming, cascaded, encoder
e...
Language identification is critical for many downstream tasks in automat...
On-device end-to-end (E2E) models have shown improvements over a convent...
In voice-enabled applications, a predetermined hotword isusually used to...
While a streaming voice assistant system has been used in many applicati...
Text-only and semi-supervised training based on audio-only data has gain...
Language models (LMs) significantly improve the recognition accuracy of
...
In this paper, we propose a dynamic cascaded encoder Automatic Speech
Re...
State-of-the-art automatic speech recognition (ASR) systems are trained ...
Language model fusion helps smart assistants recognize words which are r...
End-to-end models have achieved state-of-the-art results on several auto...
Fast contextual adaptation has shown to be effective in improving Automa...
Self- and semi-supervised learning methods have been actively investigat...
Streaming end-to-end speech recognition models have been widely applied ...
We introduce Lookup-Table Language Models (LookupLM), a method for scali...
Interactive speech recognition systems must generate words quickly while...
End-to-end models that condition the output label sequence on all previo...
End-to-end (E2E) models have shown to outperform state-of-the-art
conven...
End-to-end (E2E) automatic speech recognition (ASR) models, by now, have...
For various speech-related tasks, confidence scores from a speech recogn...
Thus far, end-to-end (E2E) models have not been shown to outperform
stat...
All-neural end-to-end (E2E) automatic speech recognition (ASR) systems t...
The requirements for many applications of state-of-the-art speech recogn...
Current state-of-the-art automatic speech recognition systems are traine...