Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published at the 2020 Joint Conference on AI Music Creativity.
Stars: ✭ 26 (+0%)
Mutual labels: vq-vae
VQ-VAE with PixelCNN prior
Workflow
Train the Vector Quantised Variational AutoEncoder (VQ-VAE) for discrete representation and reconstruction.
Use PixelCNN to learn the priors on the discrete latents for image sampling.
Implementation of PixelCNN is based on this repo with little modify.
We provide the slides which may be of help for readers to gain better understanding on PixelCNN and VQ-VAE. Some images used in the slides are borrowed from papers and websites, so the slides can only be used for learning purpose.
Usage
Run MNIST: vqvae1_withPixelCNNprior_mnist.py
Run cifar-10: vqvae1_withPixelCNNprior_cifar10.py
Results
Testing data
Reconstruction
Random samples
Samples based on PixelCNN prior
MNIST
cifar-10
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].