HooliGAN: Robust, High Quality Neural Vocoding

08/06/2020
by   Ollie McCarthy, et al.
0

Recent developments in generative models have shown that deep learning combined with traditional digital signal processing (DSP) techniques could successfully generate convincing violin samples [1], that source-excitation combined with WaveNet yields high-quality vocoders [2, 3] and that generative adversarial network (GAN) training can improve naturalness [4, 5]. By combining the ideas in these models we introduce HooliGAN, a robust vocoder that has state of the art results, finetunes very well to smaller datasets (<30 minutes of speechdata) and generates audio at 2.2MHz on GPU and 35kHz on CPU. We also show a simple modification to Tacotron-basedmodels that allows seamless integration with HooliGAN. Results from our listening tests show the proposed model's ability to consistently output high-quality audio with a variety of datasets, big and small. We provide samples at the following demo page: https://resemble-ai.github.io/hooligan_demo/

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2021

Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition

Recent studies have shown that neural vocoders based on generative adver...
research
03/12/2022

A Proposal to Study "Is High Quality Data All We Need?"

Even though deep neural models have achieved superhuman performance on m...
research
11/10/2019

EarthquakeGen: Earthquake Simulation Using Generative Adversarial Networks

Detecting earthquake events from seismic time series has proved itself a...
research
01/14/2020

DDSP: Differentiable Digital Signal Processing

Most generative models of audio directly generate samples in one of two ...
research
01/23/2023

ECGAN: Self-supervised generative adversarial network for electrocardiography

High-quality synthetic data can support the development of effective pre...
research
05/02/2019

High quality, lightweight and adaptable TTS using LPCNet

We present a lightweight adaptable neural TTS system with high quality o...
research
04/19/2023

SP-BatikGAN: An Efficient Generative Adversarial Network for Symmetric Pattern Generation

Following the contention of AI arts, our research focuses on bringing AI...

Please sign up or login with your details

Forgot password? Click here to reset