We present a scalable method to produce high quality emphasis for
text-t...
Generating expressive and contextually appropriate prosody remains a
cha...
In this paper, we present CopyCat2 (CC2), a novel model capable of: a)
s...
This paper presents a novel data augmentation technique for text-to-spee...
We present an SVQ-VAE architecture using a split vector quantizer for NT...
This paper proposed a novel approach for the detection and reconstructio...
Neural text-to-speech synthesis (NTTS) models have shown significant pro...