Densely Connected CNNs for Bird Audio Detection

07/08/2018
by   Thomas Pellegrini, et al.
0

Detecting bird sounds in audio recordings automatically, if accurate enough, is expected to be of great help to the research community working in bio- and ecoacoustics, interested in monitoring biodiversity based on audio field recordings. To estimate how accurate the state-of-the-art machine learning approaches are, the Bird Audio Detection challenge involving large audio datasets was recently organized. In this paper, experiments using several types of convolutional neural networks (i.e. standard CNNs, residual nets and densely connected nets) are reported in the framework of this challenge. DenseNets were the preferred solution since they were the best performing and most compact models, leading to a 88.22 test set of the challenge. Performance gains were obtained thank to data augmentation through time and frequency shifting, model parameter averaging during training and ensemble methods using the geometric mean. On the contrary, the attempts to enlarge the training dataset with samples of the test set with automatic predictions used as pseudo-groundtruth labels consistently degraded performance.

READ FULL TEXT
research
07/15/2020

An Ensemble of Convolutional Neural Networks for Audio Classification

In this paper, ensembles of classifiers that exploit several data augmen...
research
12/16/2019

Data augmentation approaches for improving animal audio classification

In this paper we present ensembles of classifiers for automated animal a...
research
11/26/2018

Combining High-Level Features of Raw Audio Waves and Mel-Spectrograms for Audio Tagging

In this paper, we describe our contribution to Task 2 of the DCASE 2018 ...
research
05/23/2023

A study of audio mixing methods for piano transcription in violin-piano ensembles

While piano music transcription models have shown high performance for s...
research
04/12/2021

L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing

The L3DAS21 Challenge is aimed at encouraging and fostering collaborativ...
research
05/25/2020

Interpreting Chest X-rays via CNNs that Exploit Hierarchical Disease Dependencies and Uncertainty Labels

The chest X-rays (CXRs) is one of the views most commonly ordered by rad...
research
07/28/2022

Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions Using Trainable Kernels and Augmentations

In this paper, we demonstrate a unique recipe to enhance the effectivene...

Please sign up or login with your details

Forgot password? Click here to reset