Unsupervised cross-lingual speech representation learning (XLSR) has rec...
Recurrent neural transducer (RNN-T) is a promising end-to-end (E2E) mode...
In this work we propose an inference technique, asynchronous revision, t...
End-to-end (E2E) systems have played a more and more important role in
a...