An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances

07/31/2020
by   Hu Hu, et al.
0

In this paper, we propose a sub-utterance unit selection framework to remove acoustic segments in audio recordings that carry little information for acoustic scene classification (ASC). Our approach is built upon a universal set of acoustic segment units covering the overall acoustic scene space. First, those units are modeled with acoustic segment models (ASMs) used to tokenize acoustic scene utterances into sequences of acoustic segment units. Next, paralleling the idea of stop words in information retrieval, stop ASMs are automatically detected. Finally, acoustic segments associated with the stop ASMs are blocked, because of their low indexing power in retrieval of most acoustic scenes. In contrast to building scene models with whole utterances, the ASM-removed sub-utterances, i.e., acoustic utterances without stop acoustic segments, are then used as inputs to the AlexNet-L back-end for final classification. On the DCASE 2018 dataset, scene classification accuracy increases from 68 This represents a competitive accuracy without any data augmentation, and/or ensemble strategy. Moreover, our approach compares favourably to AlexNet-L with attention.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2019

Acoustic scene classification using teacher-student learning with soft-labels

Acoustic scene classification identifies an input segment into one of th...
research
04/16/2021

Segmenting Subtitles for Correcting ASR Segmentation Errors

Typical ASR systems segment the input audio into utterances using purely...
research
07/11/2018

A punishment voting algorithm based on super categories construction for acoustic scene classification

In acoustic scene classification researches, audio segment is usually sp...
research
11/10/2019

Characterizing dynamically varying acoustic scenes from egocentric audio recordings in workplace setting

Devices capable of detecting and categorizing acoustic scenes have numer...
research
12/07/2020

Using previous acoustic context to improve Text-to-Speech synthesis

Many speech synthesis datasets, especially those derived from audiobooks...
research
04/08/2019

Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery

This work tackles the problem of learning a set of language specific aco...
research
05/05/2021

Acoustic Scene Classification Using Multichannel Observation with Partially Missing Channels

Sounds recorded with smartphones or IoT devices often have partially unr...

Please sign up or login with your details

Forgot password? Click here to reset