Conditioning Deep Generative Raw Audio Models for Structured Automatic Music

06/26/2018
by   Rachel Manzelli, et al.
0

Existing automatic music generation approaches that feature deep learning can be broadly classified into two types: raw audio models and symbolic models. Symbolic models, which train and generate at the note level, are currently the more prevalent approach; these models can capture long-range dependencies of melodic structure, but fail to grasp the nuances and richness of raw audio generations. Raw audio models, such as DeepMind's WaveNet, train directly on sampled audio waveforms, allowing them to produce realistic-sounding, albeit unstructured music. In this paper, we propose an automatic music generation methodology combining both of these approaches to create structured, realistic-sounding compositions. We consider a Long Short Term Memory network to learn the melodic structure of different styles of music, and then use the unique symbolic generations from this model as a conditioning input to a WaveNet-based raw audio generator, creating a model for automatic, novel music. We then evaluate this approach by showcasing results of this work.

READ FULL TEXT
research
06/26/2018

The challenge of realistic music generation: modelling raw audio at scale

Realistic music generation is a challenging task. When building generati...
research
11/16/2018

Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles

We use a modified SampleRNN architecture to generate music in modern gen...
research
07/10/2019

Explicitly Conditioned Melody Generation: A Case Study with Interdependent RNNs

Deep generative models for symbolic music are typically designed to mode...
research
02/05/2020

Continuous Melody Generation via Disentangled Short-Term Representations and Structural Conditions

Automatic music generation is an interdisciplinary research topic that c...
research
01/12/2021

MP3net: coherent, minute-long music generation from raw audio with a simple convolutional GAN

We present a deep convolutional GAN which leverages techniques from MP3/...
research
07/20/2023

Progressive distillation diffusion for raw music generation

This paper aims to apply a new deep learning approach to the task of gen...
research
06/26/2021

An Audio Envelope Generator Derived from Industrial Process Control

Audio envelopes serve a crucial role in ensuring the versatility of synt...

Please sign up or login with your details

Forgot password? Click here to reset