Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published at the 2020 Joint Conference on AI Music Creativity.

Stars: ✭ 26 (+0%)

Mutual labels: vq-vae

VQ-VAE with PixelCNN prior

Workflow

Train the Vector Quantised Variational AutoEncoder (VQ-VAE) for discrete representation and reconstruction.
Use PixelCNN to learn the priors on the discrete latents for image sampling.

Acknowledgement

VQ-VAE is originally mentioned in the paper Neural Discrete Representation Learning.
PixelCNN is proposed in the papers Pixel Recurrent Neural Networks and Conditional Image Generation with PixelCNN Decoders.
Implementation of VQ-VAE (without priors) is based on the official codes from Google DeepMind. Note: Different from the official codes, the implementation here does not rely on the Sonnet library.
Implementation of PixelCNN is based on this repo with little modify.
We provide the slides which may be of help for readers to gain better understanding on PixelCNN and VQ-VAE. Some images used in the slides are borrowed from papers and websites, so the slides can only be used for learning purpose.

Usage

Run MNIST: vqvae1_withPixelCNNprior_mnist.py
Run cifar-10: vqvae1_withPixelCNNprior_cifar10.py

Results

	Testing data	Reconstruction	Random samples	Samples based on PixelCNN prior
MNIST
cifar-10

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

jiazhao97 / VQ-VAE_withPixelCNNprior

Programming Languages

Labels

Projects that are alternatives of or similar to VQ-VAE withPixelCNNprior

VQ-VAE with PixelCNN prior

Workflow

Acknowledgement

Usage

Results