Audio Classification of Bit-Representation Waveform

04/08/2019
by   Masaki Okawa, et al.
0

This paper investigates waveform representation for audio signal classification. Recently, many studies on audio waveform classification such as acoustic event detection and music genre classification have been increasing. Most studies on audio waveform classification proposed to use a deep learning (neural network) framework. Generally, a frequency analysis method like the Fourier transform is applied to extract frequency or spectral information of the input audio waveform before inputting the raw audio waveform into a neural network. As against to these previous studies, in this paper, we propose a novel waveform representation method, in which audio waveforms are represented as bit-sequence, for audio classification. In our experiment, we compare the proposed bit-representation waveform, which is directly given to a neural network, to other representation of audio waveforms such as raw audio waveform and power spectrum on two classification tasks: one is an acoustic event classification task, the other is a sound/music classification task. The experimental results showed that the bit-representation waveform got the best classification performances on both the tasks.

READ FULL TEXT
research
04/09/2020

Music Artist Classification with WaveNet Classifier for Raw Waveform Audio Data

Models for music artist classification usually were operated in the freq...
research
12/08/2017

Representations of Sound in Deep Learning of Audio Features from Music

The work of a single musician, group or composer can vary widely in term...
research
04/27/2023

XAI-based Comparison of Input Representations for Audio Event Classification

Deep neural networks are a promising tool for Audio Event Classification...
research
12/14/2017

DLR : Toward a deep learned rhythmic representation for music content analysis

In the use of deep neural networks, it is crucial to provide appropriate...
research
04/28/2021

AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries

This paper proposes a neural network that performs audio transformations...
research
02/22/2020

Multi-Representation Knowledge Distillation For Audio Classification

As an important component of multimedia analysis tasks, audio classifica...
research
05/03/2022

Frequency Domain-Based Detection of Generated Audio

Attackers may manipulate audio with the intent of presenting falsified r...

Please sign up or login with your details

Forgot password? Click here to reset