Music Artist Classification with Convolutional Recurrent Neural Networks

01/14/2019
by   Zain Nasrullah, et al.
8

Previous attempts at music artist classification use frame-level audio features which summarize frequency content within short intervals of time. Comparatively, more recent music information retrieval tasks take advantage of temporal structure in audio spectrograms using deep convolutional and recurrent models. This paper revisits artist classification with this new framework and empirically explores the impacts of incorporating temporal structure in the feature representation. To this end, an established classification architecture, a Convolutional Recurrent Neural Network (CRNN), is applied to the artist20 music artist identification dataset under a comprehensive set of conditions. These include audio clip length, which is a novel contribution in this work, and previously identified considerations such as dataset split and feature-level. Our results improve upon baseline works, verify the influence of the production details on classification performance and demonstrate the trade-offs between sample length and training set size. The best performing model achieves an average F1-score of 0.937 across three independent trials which is a substantial improvement over the corresponding baseline under similar conditions. Finally, to showcase the effectiveness of the CRNN's feature extraction capabilities, we visualize audio samples at its bottleneck layer demonstrating that learned representations segment into clusters belonging to their respective artists.

READ FULL TEXT

page 1

page 3

page 7

research
09/14/2016

Convolutional Recurrent Neural Networks for Music Classification

We introduce a convolutional recurrent neural network (CRNN) for music t...
research
04/09/2020

Music Artist Classification with WaveNet Classifier for Raw Waveform Audio Data

Models for music artist classification usually were operated in the freq...
research
05/24/2022

Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features

Metaverse is an interactive world that combines reality and virtuality, ...
research
10/28/2020

Large-Scale MIDI-based Composer Classification

Music classification is a task to classify a music piece into labels suc...
research
02/02/2022

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Singing melody extraction is an important problem in the field of music ...
research
10/26/2019

A holistic approach to polyphonic music transcription with neural networks

We present a framework based on neural networks to extract music scores ...
research
08/16/2019

Sub-Spectrogram Segmentation for Environmental Sound Classification via Convolutional Recurrent Neural Network and Score Level Fusion

Environmental Sound Classification (ESC) is an important and challenging...

Please sign up or login with your details

Forgot password? Click here to reset