We present a scalable method to produce high quality emphasis for
text-t...
Generating expressive and contextually appropriate prosody remains a
cha...
This paper presents a novel data augmentation technique for text-to-spee...
We propose a novel Multi-Scale Spectrogram (MSS) modelling approach to
s...
Many factors influence speech yielding different renditions of a given
s...
In this paper, we introduce Kathaka, a model trained with a novel two-st...
Prosody Transfer (PT) is a technique that aims to use the prosody from a...
In many applications of supervised learning, multiple classification or
...
Within machine learning, the supervised learning field aims at modeling ...
In this work, we propose a simple yet effective solution to the problem ...
We adapt the idea of random projections applied to the output space, so ...