Programming Languages

python

139335 projects - #7 most used programming language

WaveVAE

work in progress

Note that my implementation isn't stable yet.

A Pytorch Implementation of WaveVAE (Mel Spectrogram --> Waveform)

part of "Parallel Neural Text-to-Speech"

Requirements

PyTorch 0.4.1 & python 3.6 & Librosa

Examples

Step 1. Download Dataset

LJSpeech : https://keithito.com/LJ-Speech-Dataset/

Step 2. Preprocessing (Preparing Mel Spectrogram)

python preprocessing.py --in_dir ljspeech --out_dir DATASETS/ljspeech

Step 3. Train Model

python train.py --model_name wavevae_1 --batch_size 4 --num_gpu 2

Step 4. Synthesize

--load_step CHECKPOINT : the # of the model's global training step (also depicted in the trained weight file)

python synthesize.py --model_name wavevae_1 --load_step 10000 --num_samples 5

References

WaveNet vocoder : https://github.com/r9y9/wavenet_vocoder
Parallel Neural Text-to-Speech : https://arxiv.org/abs/1905.08459

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

ksw0306 / WaveVAE