We introduce LAST, a LAttice-based Speech Transducer library in JAX. Wit...
We present a noisy channel generative model of two sequences, for exampl...
This work explores the task of synthesizing speech in nonexistent
human-...
Non-saturating generative adversarial network (GAN) training is widely u...
Despite the ability to produce human-level speech for in-domain text,
at...
We present a novel generative model that combines state-of-the-art neura...
Recent work has explored sequence-to-sequence latent variable models for...
Unitary Evolution Recurrent Neural Networks (uRNNs) have three attractiv...
End-to-end (E2E) models, which directly predict output character sequenc...