PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

03/04/2022
by   Xiaofeng Ge, et al.
0

PercepNet, a recent extension of the RNNoise, an efficient, high-quality and real-time full-band speech enhancement technique, has shown promising performance in various public deep noise suppression tasks. This paper proposes a new approach, named PercepNet+, to further extend the PercepNet with four significant improvements. First, we introduce a phase-aware structure to leverage the phase information into PercepNet, by adding the complex features and complex subband gains as the deep network input and output respectively. Then, a signal-to-noise ratio (SNR) estimator and an SNR switched post-processing are specially designed to alleviate the over attenuation (OA) that appears in high SNR conditions of the original PercepNet. Moreover, the GRU layer is replaced by TF-GRU to model both temporal and frequency dependencies. Finally, we propose to integrate the loss of complex subband gain, SNR, pitch filtering strength, and an OA loss in a multi-objective learning manner to further improve the speech enhancement performance. Experimental results show that, the proposed PercepNet+ outperforms the original PercepNet significantly in terms of both PESQ and STOI, without increasing the model size too much.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2022

GMM based multi-stage Wiener filtering for low SNR speech enhancement

This paper proposes a single-channel speech enhancement method to reduce...
research
05/29/2020

SNR-based teachers-student technique for speech enhancement

It is very challenging for speech enhancement methods to achieves robust...
research
10/29/2020

UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition

Speech enhancement at extremely low signal-to-noise ratio (SNR) conditio...
research
06/16/2021

DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement

Deep complex convolution recurrent network (DCCRN), which extends CRN wi...
research
06/01/2020

Phase-aware Single-stage Speech Denoising and Dereverberation with U-Net

In this work, we tackle a denoising and dereverberation problem with a s...
research
06/14/2021

F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement

With the increasing demand for audio communication and online conference...
research
09/17/2019

A scalable noisy speech dataset and online subjective test framework

Background noise is a major source of quality impairments in Voice over ...

Please sign up or login with your details

Forgot password? Click here to reset