Tom Bagby

research

∙ 04/25/2023

LAST: Scalable Lattice-Based Speech Modelling in JAX

We introduce LAST, a LAttice-based Speech Transducer library in JAX. Wit...

0 Ke Wu, et al. ∙

research

∙ 12/06/2022

Learning the joint distribution of two sequences using little or no paired data

We present a noisy channel generative model of two sequences, for exampl...

0 Soroosh Mariooryad, et al. ∙

research

∙ 11/07/2021

Speaker Generation

This work explores the task of synthesizing speech in nonexistent human-...

0 Daisy Stanton, et al. ∙

research

∙ 10/15/2020

Non-saturating GAN training as divergence minimization

Non-saturating generative adversarial network (GAN) training is widely u...

0 Matt Shannon, et al. ∙

research

∙ 10/23/2019

Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Despite the ability to produce human-level speech for in-domain text, at...

0 Eric Battenberg, et al. ∙

research

∙ 10/03/2019

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

We present a novel generative model that combines state-of-the-art neura...

0 Raza Habib, et al. ∙

research

∙ 06/08/2019

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Recent work has explored sequence-to-sequence latent variable models for...

0 Eric Battenberg, et al. ∙

research

∙ 06/05/2019

Complex Evolution Recurrent Neural Networks (ceRNNs)

Unitary Evolution Recurrent Neural Networks (uRNNs) have three attractiv...

0 Izhak Shafran, et al. ∙

research

∙ 11/15/2018

Streaming End-to-end Speech Recognition For Mobile Devices

End-to-end (E2E) models, which directly predict output character sequenc...

0 Yanzhang He, et al. ∙

Tom Bagby

Featured Co-authors

Sign in with Google

Consider DeepAI Pro