Multilingual intelligent assistants, such as ChatGPT, have recently gain...
With the extensive accumulation of conversational data on the Internet,
...
End-to-end generation-based approaches have been investigated and applie...
Very deep models for speaker recognition (SR) have demonstrated remarkab...
Probabilistic linear discriminant analysis (PLDA) is commonly used in sp...
Event detection (ED) identifies and classifies event triggers from
unstr...
The CTC model has been widely applied to many application scenarios beca...
Though widely used in industry, traditional task-oriented dialogue syste...
State-of-art speaker verification (SV) systems use a back-end model to s...
Utilizing text-only data with an external language model (LM) in end-to-...
History and future contextual information are known to be important for
...
Data-driven methods have achieved notable performance on intent detectio...
Pre-trained language models have achieved noticeable performance on the
...
Data efficient voice cloning aims at synthesizing target speaker's voice...
Speaker verification can be formulated as a representation learning task...