Monaural source separation: From anechoic to reverberant environments

11/15/2021
by   Tobias Cord-Landwehr, et al.
0

Impressive progress in neural network-based single-channel speech source separation has been made in recent years. But those improvements have been mostly reported on anechoic data, a situation that is hardly met in practice. Taking the SepFormer as a starting point, which achieves state-of-the-art performance on anechoic mixtures, we gradually modify it to optimize its performance on reverberant mixtures. Although this leads to a word error rate improvement by 8 percentage points compared to the standard SepFormer implementation, the system ends up with only marginally better performance than our improved PIT-BLSTM separation system, that is optimized with rather straightforward means. This is surprising and at the same time sobering, challenging the practical usefulness of many improvements reported in recent years for monaural source separation on nonreverberant data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2020

Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation

Time-domain training criteria have proven to be very effective for the s...
research
10/20/2021

REAL-M: Towards Speech Separation on Real Mixtures

In recent years, deep learning based source separation has achieved impr...
research
11/11/2020

Surrogate Source Model Learning for Determined Source Separation

We propose to learn surrogate functions of universal speech priors for d...
research
01/30/2021

Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures

In blind source separation of speech signals, the inherent imbalance in ...
research
07/27/2023

Complete and separate: Conditional separation with missing target source attribute completion

Recent approaches in source separation leverage semantic information abo...
research
07/24/2022

Source Separation of Unknown Numbers of Single-Channel Underwater Acoustic Signals Based on Autoencoders

The separation of single-channel underwater acoustic signals is a challe...
research
04/02/2019

Unsupervised training of a deep clustering model for multichannel blind source separation

We propose a training scheme to train neural network-based source separa...

Please sign up or login with your details

Forgot password? Click here to reset