1D CNN Architectures for Music Genre Classification

05/15/2021
by   Safaa Allamy, et al.
0

This paper proposes a 1D residual convolutional neural network (CNN) architecture for music genre classification and compares it with other recent 1D CNN architectures. The 1D CNNs learn a representation and a discriminant directly from the raw audio signal. Several convolutional layers capture the time-frequency characteristics of the audio signal and learn various filters relevant to the music genre recognition task. The proposed approach splits the audio signal into overlapped segments using a sliding window to comply with the fixed-length input constraint of the 1D CNNs. As a result, music genre classification can be carried out on a single audio segment or on the aggregation of the predictions on several audio segments, which improves the final accuracy. The performance of the proposed 1D residual CNN is assessed on a public dataset of 1,000 audio clips. The experimental results have shown that it achieves 80.93 other 1D CNN architectures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2019

End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network

In this paper, we present an end-to-end approach for environmental sound...
research
02/27/2018

Convolutional Neural Network Achieves Human-level Accuracy in Music Genre Classification

Music genre classification is one example of content-based analysis of m...
research
07/23/2018

Auto-adaptive Resonance Equalization using Dilated Residual Networks

In music and audio production, attenuation of spectral resonances is an ...
research
12/01/2018

SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation

Speech, Music and Noise classification/segmentation is an important prep...
research
06/17/2014

Automatic Fado Music Classification

In late 2011, Fado was elevated to the oral and intangible heritage of h...
research
02/14/2020

Acoustic Scene Classification Using Bilinear Pooling on Time-liked and Frequency-liked Convolution Neural Network

The current methodology in tackling Acoustic Scene Classification (ASC) ...
research
06/19/2017

Kapre: On-GPU Audio Preprocessing Layers for a Quick Implementation of Deep Neural Network Models with Keras

We introduce Kapre, Keras layers for audio and music signal preprocessin...

Please sign up or login with your details

Forgot password? Click here to reset