b'Guanglu Wan'

research

∙ 09/18/2023

Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter

Multilingual intelligent assistants, such as ChatGPT, have recently gain...

0 Song Li, et al. ∙

research

∙ 06/27/2023

Exploiting Pseudo Future Contexts for Emotion Recognition in Conversations

With the extensive accumulation of conversational data on the Internet, ...

0 Yinyi Wei, et al. ∙

research

∙ 04/03/2023

Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation

End-to-end generation-based approaches have been investigated and applie...

0 Yuncheng Hua, et al. ∙

research

∙ 12/06/2022

Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition

Very deep models for speaker recognition (SR) have demonstrated remarkab...

0 Zhiyuan Peng, et al. ∙

research

∙ 12/06/2022

Covariance Regularization for Probabilistic Linear Discriminant Analysis

Probabilistic linear discriminant analysis (PLDA) is commonly used in sp...

0 Zhiyuan Peng, et al. ∙

research

∙ 11/25/2022

MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts

Event detection (ED) identifies and classifies event triggers from unstr...

0 Xiangyu Xi, et al. ∙

research

∙ 11/07/2022

Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization

The CTC model has been widely applied to many application scenarios beca...

0 Zhengkun Tian, et al. ∙

research

∙ 05/13/2022

A Low-Cost, Controllable and Interpretable Task-Oriented Chatbot: With Real-World After-Sale Services as Example

Though widely used in industry, traditional task-oriented dialogue syste...

0 Xiangyu Xi, et al. ∙

research

∙ 04/22/2022

Unifying Cosine and PLDA Back-ends for Speaker Verification

State-of-art speaker verification (SV) systems use a back-end model to s...

0 Zhiyuan Peng, et al. ∙

research

∙ 03/31/2022

An Empirical Study of Language Model Integration for Transducer based Speech Recognition

Utilizing text-only data with an external language model (LM) in end-to-...

0 Huahuan Zheng, et al. ∙

research

∙ 03/31/2022

CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR

History and future contextual information are known to be important for ...

0 Keyu An, et al. ∙

research

∙ 03/17/2022

Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty Loss

Data-driven methods have achieved notable performance on intent detectio...

0 Yantao Gong, et al. ∙

research

∙ 08/24/2021

Density-Based Dynamic Curriculum Learning for Intent Detection

Pre-trained language models have achieved noticeable performance on the ...

0 Yantao Gong, et al. ∙

research

∙ 08/10/2020

Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training

Data efficient voice cloning aims at synthesizing target speaker's voice...

0 Jian Cong, et al. ∙

research

∙ 01/07/2020

Learning Speaker Embedding with Momentum Contrast

Speaker verification can be formulated as a representation learning task...

0 Ke Ding, et al. ∙

Guanglu Wan

Featured Co-authors