Diffusion-Based Audio Inpainting

05/24/2023
by   Eloi Moliner, et al.
0

Audio inpainting aims to reconstruct missing segments in corrupted recordings. Previous methods produce plausible reconstructions when the gap length is shorter than about 100 ms, but the quality decreases for longer gaps. This paper explores recent advancements in deep learning and, particularly, diffusion models, for the task of audio inpainting. The proposed method uses an unconditionally trained generative model, which can be conditioned in a zero-shot fashion for audio inpainting, offering high flexibility to regenerate gaps of arbitrary length. An improved deep neural network architecture based on the constant-Q transform, which allows the model to exploit pitch-equivariant symmetries in audio, is also presented. The performance of the proposed algorithm is evaluated through objective and subjective metrics for the task of reconstructing short to mid-sized gaps. The results of a formal listening test show that the proposed method delivers a comparable performance against state-of-the-art for short gaps, while retaining a good audio quality and outperforming the baselines for the longest gap lengths tested, 150 ms and 200 ms. This work helps improve the restoration of sound recordings having fairly long local disturbances or dropouts, which must be reconstructed.

READ FULL TEXT
research
10/09/2020

Audio-Visual Speech Inpainting with Deep Learning

In this paper, we present a deep-learning-based framework for audio-visu...
research
05/11/2020

GACELA – A generative adversarial context encoder for long audio inpainting

We introduce GACELA, a generative adversarial network (GAN) designed to ...
research
10/27/2022

Solving Audio Inverse Problems with a Diffusion Model

This paper presents CQT-Diff, a data-driven generative audio model that ...
research
03/13/2020

Audio inpainting with generative adversarial network

We study the ability of Wasserstein Generative Adversarial Network (WGAN...
research
10/29/2018

A context encoder for audio inpainting

We studied the ability of deep neural networks (DNNs) to restore missing...
research
06/02/2023

Zero-Shot Blind Audio Bandwidth Extension

Audio bandwidth extension involves the realistic reconstruction of high-...
research
08/17/2017

Automatic Organisation and Quality Analysis of User-Generated Content with Audio Fingerprinting

The increase of the quantity of user-generated content experienced in so...

Please sign up or login with your details

Forgot password? Click here to reset