HODGEPODGE: Sound event detection based on ensemble of semi-supervised learning methods

07/17/2019
by   Ziqiang Shi, et al.
0

In this paper, we present a method called HODGEPODGE[1] for large-scale detection of sound events using weakly labeled, synthetic, and unlabeled data proposed in the Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 challenge Task 4: Sound event detection in domestic environments. To perform this task, we adopted the convolutional recurrent neural networks (CRNN) as our backbone network. In order to deal with a small amount of tagged data and a large amounts of unlabeled in-domain data, we aim to focus primarily on how to apply semi-supervise learning methods efficiently to make full use of limited data. Three semi-supervised learning principles have been used in our system, including: 1) Consistency regularization applies data augmentation; 2) MixUp regularizer requiring that the predictions for a interpolation of two inputs is close to the interpolation of the prediction for each individual input; 3) MixUp regularization applies to interpolation between data augmentations. We also tried an ensemble of various models, which are trained by using different semi-supervised learning principles. Our proposed approach significantly improved the performance of the baseline, achieving the event-based f-measure of 42.0% compared to 25.8% event-based f-measure of the baseline in the provided official evaluation dataset. Our submissions ranked third among 18 teams in the task 4.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2020

Hodge and Podge: Hybrid Supervised Sound Event Detection with Multi-Hot MixMatch and Composition Consistence Training

In this paper, we propose a method called Hodge and Podge for sound even...
research
01/30/2021

Semi-supervised Sound Event Detection using Random Augmentation and Consistency Regularization

Sound event detection is a core module for acoustic environmental analys...
research
04/04/2021

IITK@Detox at SemEval-2021 Task 5: Semi-Supervised Learning and Dice Loss for Toxic Spans Detection

In this work, we present our approach and findings for SemEval-2021 Task...
research
09/11/2019

Guided Learning Convolution System for DCASE 2019 Task 4

In this paper, we describe in detail the system we submitted to DCASE201...
research
05/27/2021

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

Sound event detection is an important facet of audio tagging that aims t...
research
10/21/2021

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

Sound event detection (SED), as a core module of acoustic environmental ...
research
09/15/2023

Semi-supervised Sound Event Detection with Local and Global Consistency Regularization

Learning meaningful frame-wise features on a partially labeled dataset i...

Please sign up or login with your details

Forgot password? Click here to reset