State-of-the-art text-to-speech (TTS) systems have utilized pretrained
l...
We present a scalable method to produce high quality emphasis for
text-t...
We present eCat, a novel end-to-end multispeaker model capable of: a)
ge...
Generating expressive and contextually appropriate prosody remains a
cha...
In this paper, we present CopyCat2 (CC2), a novel model capable of: a)
s...
We propose a novel Multi-Scale Spectrogram (MSS) modelling approach to
s...
Many factors influence speech yielding different renditions of a given
s...
In this paper, we introduce Kathaka, a model trained with a novel two-st...
This paper investigates the use of Machine Translation (MT) to bootstrap...