CWS-PResUNet: Music Source Separation with Channel-wise Subband Phase-aware ResUNet

12/09/2021
by   Haohe Liu, et al.
0

Music source separation (MSS) shows active progress with deep learning models in recent years. Many MSS models perform separations on spectrograms by estimating bounded ratio masks and reusing the phases of the mixture. When using convolutional neural networks (CNN), weights are usually shared within a spectrogram during convolution regardless of the different patterns between frequency bands. In this study, we propose a new MSS model, channel-wise subband phase-aware ResUNet (CWS-PResUNet), to decompose signals into subbands and estimate an unbound complex ideal ratio mask (cIRM) for each source. CWS-PResUNet utilizes a channel-wise subband (CWS) feature to limit unnecessary global weights sharing on the spectrogram and reduce computational resource consumptions. The saved computational cost and memory can in turn allow for a larger architecture. On the MUSDB18HQ test set, we propose a 276-layer CWS-PResUNet and achieve state-of-the-art (SoTA) performance on vocals with an 8.92 signal-to-distortion ratio (SDR) score. By combining CWS-PResUNet and Demucs, our ByteMSS system ranks the 2nd on vocals score and 5th on average score in the 2021 ISMIR Music Demixing (MDX) Challenge limited training data track (leaderboard A). Our code and pre-trained models are publicly available at: https://github.com/haoheliu/2021-ISMIR-MSS-Challenge-CWS-PResUNet

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2021

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

Deep neural network based methods have been successfully applied to musi...
research
08/12/2020

Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music

This paper presents a new input format, channel-wise subband input (CWS)...
research
08/02/2023

Music De-limiter Networks via Sample-wise Gain Inversion

The loudness war, an ongoing phenomenon in the music industry characteri...
research
11/28/2021

Transfer Learning with Jukebox for Music Source Separation

In this work, we demonstrate how to adapt a publicly available pre-train...
research
06/27/2023

RMVPE: A Robust Model for Vocal Pitch Estimation in Polyphonic Music

Vocal pitch is an important high-level feature in music audio processing...
research
08/31/2021

Music Demixing Challenge 2021

Music source separation has been intensively studied in the last decade ...
research
11/04/2021

Lipid domain coarsening and fluidity in multicomponent lipid vesicles: A continuum based model and its experimental validation

Liposomes that achieve a heterogeneous and spatially organized surface t...

Please sign up or login with your details

Forgot password? Click here to reset