While the performance of cross-lingual TTS based on monolingual corpora ...
The front-end is a critical component of English text-to-speech (TTS)
sy...
Automatic dubbing, which generates a corresponding version of the input
...
Some recent studies have demonstrated the feasibility of single-stage ne...
Although deep learning and end-to-end models have been widely used and s...
Dubbing is a post-production process of re-recording actors' dialogues, ...
With the increasing popularity of speech synthesis products, the industr...
The news ecosystem has become increasingly complex, encompassing a wide ...
Despite the influence that image-based communication has on online disco...
This paper proposes the building of Xiaomingbot, an intelligent, multili...
The continual improvement of 3D sensors has driven the development of
al...
With the popularity of deep neural network, speech synthesis task has
ac...