All Projects → Kyubyong → vq-vae

Kyubyong / vq-vae

Licence: Apache-2.0 license
A Tensorflow Implementation of VQ-VAE Speaker Conversion

Programming Languages

python
139335 projects - #7 most used programming language

VQ-VAE

This is a Tensorflow Implementation of VQ-VAE Speaker Conversion introduced in Neural Discrete Representation Learning. Although the training curves look fine, the samples generated during training were bad. Unfortunately, I have no time to dig more in this as I'm tied with my other projects. So I publish this project for those who are interested in the paper or its implementation. If you succeed in training based on this repo, please share the good news.

Data

Requirements

  • NumPy >= 1.11.1
  • TensorFlow >= 1.3
  • librosa
  • tqdm
  • matplotlib
  • scipy

Training

  • STEP 0. Download vctk
  • STEP 1. Adjust hyper parameters in hyperparams.py.
  • STEP 2. Run python prepro.py.
  • STEP 3. Run python train.py.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].