Content Based Singing Voice Extraction From a Musical Mixture

02/12/2020
by   Pritish Chandna, et al.
0

We present a deep learning based methodology for extracting the singing voice signal from a musical mixture based on the underlying linguistic content. Our model follows an encoder decoder architecture and takes as input the magnitude component of the spectrogram of a musical mixture with vocals. The encoder part of the model is trained via knowledge distillation using a teacher network to learn a content embedding, which is decoded to generate the corresponding vocoder features. Using this methodology, we are able to extract the unprocessed raw vocal signal from the mixture even for a processed mixture dataset with singers not seen during training. While the nature of our system makes it incongruous with traditional objective evaluation metrics, we use subjective evaluation via listening tests to compare the methodology to state-of-the-art deep learning based source separation algorithms. We also provide sound examples and source code for reproducibility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2019

A Vocoder Based Method For Singing Voice Extraction

This paper presents a novel method for extracting the vocal track from a...
research
02/01/2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation

Monaural singing voice separation task focuses on the prediction of the ...
research
12/02/2019

Investigating Deep Neural Transformations for Spectrogram-based Musical Source Separation

Musical Source Separation (MSS) is a signal processing task that tries t...
research
08/17/2020

Deep Learning Based Source Separation Applied To Choir Ensembles

Choral singing is a widely practiced form of ensemble singing wherein a ...
research
04/12/2019

Examining the Mapping Functions of Denoising Autoencoders in Music Source Separation

The goal of this work is to investigate what music source separation app...
research
09/21/2020

A Deep Learning Based Analysis-Synthesis Framework For Unison Singing

Unison singing is the name given to an ensemble of singers simultaneousl...
research
04/17/2015

Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network

Identification and extraction of singing voice from within musical mixtu...

Please sign up or login with your details

Forgot password? Click here to reset