Mapping two modalities, speech and text, into a shared representation sp...
Recently, excellent progress has been made in speech recognition. Howeve...
For text-to-speech (TTS) synthesis, prosodic structure prediction (PSP) ...
The accuracy of prosodic structure prediction is crucial to the naturaln...
Non-parallel data voice conversion (VC) have achieved considerable
break...
Semantic information of a sentence is crucial for improving the
expressi...
Syntactic structure of a sentence text is correlated with the prosodic
s...