Guanglai Gao

research

∙ 05/25/2023

Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion

Audio Deepfake Detection (ADD) aims to detect the fake audio generated b...

0 Rui Liu, et al. ∙

research

∙ 10/27/2022

Explicit Intensity Control for Accented Text-to-speech

Accented text-to-speech (TTS) synthesis seeks to generate speech with an...

0 Rui Liu, et al. ∙

research

∙ 10/27/2022

FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis

Conversational Text-to-Speech (TTS) aims to synthesis an utterance with ...

0 Yifan Hu, et al. ∙

research

∙ 10/27/2022

Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities

Multimodal emotion recognition leverages complementary information acros...

0 Haolin Zuo, et al. ∙

research

∙ 09/24/2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion

Cyrillic and Traditional Mongolian are the two main members of the Mongo...

0 Muhan Na, et al. ∙

research

∙ 09/22/2022

MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline

This paper introduces a high-quality open-source text-to-speech (TTS) sy...

0 Yifan Hu, et al. ∙

research

∙ 09/22/2022

Controllable Accented Text-to-Speech Synthesis

Accented text-to-speech (TTS) synthesis seeks to generate speech with an...

0 Rui Liu, et al. ∙

research

∙ 06/15/2022

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

Emotion classification of speech and assessment of the emotion strength ...

0 Rui Liu, et al. ∙

research

∙ 03/26/2021

Guided Training: A Simple Method for Single-channel Speaker Separation

Deep learning has shown a great potential for speech separation, especia...

0 Hao Li, et al. ∙

research

∙ 08/11/2020

Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS

Tacotron-based end-to-end speech synthesis has shown remarkable voice qu...

0 Rui Liu, et al. ∙

research

∙ 08/04/2020

Expressive TTS Training with Frame and Style Reconstruction Loss

We propose a novel training strategy for Tacotron-based text-to-speech (...

0 Rui Liu, et al. ∙

research

∙ 06/11/2020

An Edge Information and Mask Shrinking Based Image Inpainting Approach

In the image inpainting task, the ability to repair both high-frequency ...

0 Huali Xu, et al. ∙

research

∙ 05/29/2020

SNR-based teachers-student technique for speech enhancement

It is very challenging for speech enhancement methods to achieves robust...

0 Xiang Hao, et al. ∙

research

∙ 05/29/2020

Sub-band Knowledge Distillation Framework for Speech Enhancement

In single-channel speech enhancement, methods based on full-band spectra...

0 Xiang Hao, et al. ∙

research

∙ 02/02/2020

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss

Tacotron-based text-to-speech (TTS) systems directly synthesize speech f...

2 Rui Liu, et al. ∙

research

∙ 11/07/2019

Teacher-Student Training for Robust Tacotron-based TTS

While neural end-to-end text-to-speech (TTS) is superior to conventional...

0 Rui Liu, et al. ∙

research

∙ 08/28/2017

Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising

Both reverberation and additive noises degrade the speech quality and in...

0 Hao Li, et al. ∙

Guanglai Gao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro