Speech enhancement guided by contextual articulatory information

11/15/2020
by   Yen-Ju Lu, et al.
0

Previous studies have confirmed the effectiveness of leveraging articulatory information to attain improved speech enhancement (SE) performance. By augmenting the original acoustic features with the place/manner of articulatory features, the SE process can be guided to consider the articulatory properties of the input speech when performing enhancement. Hence, we believe that the contextual information of articulatory attributes should include useful information and can further benefit SE. In this study, we propose an SE system that incorporates contextual articulatory information; such information is obtained using broad phone class (BPC) end-to-end automatic speech recognition (ASR). Meanwhile, two training strategies are developed to train the SE system based on the BPC-based ASR: multitask-learning and deep-feature training strategies. Experimental results on the TIMIT dataset confirm that the contextual articulatory information facilitates an SE system in achieving better results. Moreover, in contrast to another SE system that is trained with monophonic ASR, the BPC-based ASR (providing contextual articulatory information) can improve the SE performance more effectively under different signal-to-noise ratios(SNR).

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
08/13/2020

Incorporating Broad Phonetic Information for Speech Enhancement

In noisy conditions, knowing speech contents facilitates listeners to mo...
research
11/01/2021

SNRi Target Training for Joint Speech Enhancement and Recognition

This study aims to improve the performance of automatic speech recogniti...
research
10/28/2020

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement

Speech enhancement (SE) aims to improve speech quality and intelligibili...
research
03/22/2005

Semi-automatic vectorization of linear networks on rasterized cartographic maps

A system for semi-automatic vectorization of linear networks (roads, riv...
research
11/10/2018

Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition

Conventional deep neural network (DNN)-based speech enhancement (SE) app...
research
07/04/2021

TENET: A Time-reversal Enhancement Network for Noise-robust ASR

Due to the unprecedented breakthroughs brought about by deep learning, s...
research
10/12/2021

Improving Character Error Rate Is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-box Acoustic Models

A deep neural network (DNN)-based speech enhancement (SE) aiming to maxi...

Please sign up or login with your details

Forgot password? Click here to reset