Neural audio synthesis of musical notes with wavenet autoencoders
Generative models in vision have seen rapid progress due to algorithmic improvements and
the availability of high-quality image datasets. In this paper, we offer contributions in both …
the availability of high-quality image datasets. In this paper, we offer contributions in both …
Sing: Symbol-to-instrument neural generator
Recent progress in deep learning for audio synthesis opens the way to models that directly
produce the waveform, shifting away from the traditional paradigm of relying on vocoders or …
produce the waveform, shifting away from the traditional paradigm of relying on vocoders or …
[PDF][PDF] Wavenet: A generative model for raw audio
This paper introduces WaveNet, a deep neural network for generating raw audio waveforms.
The model is fully probabilistic and autoregressive, with the predictive distribution for each …
The model is fully probabilistic and autoregressive, with the predictive distribution for each …
RAVE: A variational autoencoder for fast and high-quality neural audio synthesis
Deep generative models applied to audio have improved by a large margin the state-of-the-
art in many speech and music related tasks. However, as raw waveform modelling remains …
art in many speech and music related tasks. However, as raw waveform modelling remains …
Neural waveshaping synthesis
We present the Neural Waveshaping Unit (NEWT): a novel, lightweight, fully causal
approach to neural audio synthesis which operates directly in the waveform domain, with an …
approach to neural audio synthesis which operates directly in the waveform domain, with an …
A generative model for raw audio using transformer architectures
P Verma, C Chafe - … Conference on Digital Audio Effects (DAFx …, 2021 - ieeexplore.ieee.org
This paper proposes a novel way of doing audio synthesis at the waveform level using
Transformer architectures. We propose a deep neural network for generating waveforms …
Transformer architectures. We propose a deep neural network for generating waveforms …
Gansynth: Adversarial neural audio synthesis
Efficient audio synthesis is an inherently difficult machine learning task, as human
perception is sensitive to both global structure and fine-scale waveform coherence …
perception is sensitive to both global structure and fine-scale waveform coherence …
Multi-instrument music synthesis with spectrogram diffusion
An ideal music synthesizer should be both interactive and expressive, generating high-
fidelity audio in realtime for arbitrary combinations of instruments and notes. Recent neural …
fidelity audio in realtime for arbitrary combinations of instruments and notes. Recent neural …
DDSP: Differentiable digital signal processing
Most generative models of audio directly generate samples in one of two domains: time or
frequency. While sufficient to express any signal, these representations are inefficient, as …
frequency. While sufficient to express any signal, these representations are inefficient, as …
MidiNet: A convolutional generative adversarial network for symbolic-domain music generation
Most existing neural network models for music generation use recurrent neural networks.
However, the recent WaveNet model proposed by DeepMind shows that convolutional …
However, the recent WaveNet model proposed by DeepMind shows that convolutional …