Recent advances in neural text-to-speech (TTS) models bring thousands of...
In this paper, we present ZeroPrompt (Figure 1-(a)) and the correspondin...
Recently, the unified streaming and non-streaming two-pass (U2/U2++)
end...
In this paper, we present TrimTail, a simple but effective emission
regu...
The recently proposed Conformer architecture which combines convolution ...
Recently, we made available WeNet, a production-oriented end-to-end spee...
In this paper, we present WenetSpeech, a multi-domain Mandarin corpus
co...
The unified streaming and non-streaming two-pass (U2) end-to-end model f...
In this paper, we present a new open source, production first and produc...