Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation

10/28/2017
by   Emad M. Grais, et al.
0

In deep neural networks with convolutional layers, each layer typically has fixed-size/single-resolution receptive field (RF). Convolutional layers with a large RF capture global information from the input features, while layers with small RF size capture local details with high resolution from the input features. In this work, we introduce novel deep multi-resolution fully convolutional neural networks (MR-FCNN), where each layer has different RF sizes to extract multi-resolution features that capture the global and local details information from its input features. The proposed MR-FCNN is applied to separate a target audio source from a mixture of many audio sources. Experimental results show that using MR-FCNN improves the performance compared to feedforward deep neural networks (DNNs) and single resolution deep fully convolutional neural networks (FCNNs) on the audio source separation problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/02/2018

Raw Multi-Channel Audio Source Separation using Multi-Resolution Convolutional Auto-Encoders

Supervised multi-channel audio source separation requires extracting use...
research
06/14/2023

WavPool: A New Block for Deep Neural Networks

Modern deep neural networks comprise many operational layers, such as de...
research
04/15/2019

Learning Spatiotemporal Features of Ride-sourcing Services with Fusion Convolutional Network

In order to collectively forecast the demand of ride-sourcing services i...
research
06/16/2020

Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks

In this paper, we present a new sound source DOA estimation and tracking...
research
02/02/2021

Size Matters

Fully convolutional neural networks can process input of arbitrary size ...
research
09/25/2020

DeepControl: 2D RF pulses facilitating B_1^+ inhomogeneity and B_0 off-resonance compensation in vivo at 7T

Purpose: Rapid 2D RF pulse design with subject specific B_1^+ inhomogene...
research
03/03/2021

Compute and memory efficient universal sound source separation

Recent progress in audio source separation lead by deep learning has ena...

Please sign up or login with your details

Forgot password? Click here to reset