Zero-shot text-to-speech (TTS) synthesis aims to clone any unseen speake...
The spontaneous behavior that often occurs in conversations makes speech...
Unsupervised anomaly detection (UAD) attracts a lot of research interest...
Class imbalance is a common challenge in real-world recognition tasks, w...
Expressive speech synthesis is crucial for many human-computer interacti...
Semi-supervised anomaly detection (SSAD) methods have demonstrated their...
Recent advances in text-to-speech have significantly improved the
expres...
Human densepose estimation, aiming at establishing dense correspondences...
Previous works on expressive speech synthesis focus on modelling the
mon...
The accuracy of prosodic structure prediction is crucial to the naturaln...
Previous works on expressive speech synthesis mainly focus on current
se...
While neural networks have been remarkably successful in a wide array of...
Semantic information of a sentence is crucial for improving the
expressi...
Syntactic structure of a sentence text is correlated with the prosodic
s...