All Projects → yoyololicon → wavenet-like-vocoder

yoyololicon / wavenet-like-vocoder

Licence: other
Basic wavenet and fftnet vocoder model.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to wavenet-like-vocoder

FFTNet
FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
Stars: ✭ 63 (+215%)
Mutual labels:  vocoder, fftnet
pytorch FFTNet
A pytorch implementation of FFTNet.
Stars: ✭ 35 (+75%)
Mutual labels:  vocoder, fftnet
Parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+3310%)
Mutual labels:  wavenet
Tacotron 2
DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (+9740%)
Mutual labels:  wavenet
Wavenet Enhancement
Speech Enhancement using Bayesian WaveNet
Stars: ✭ 86 (+330%)
Mutual labels:  wavenet
Pytorch Uniwavenet
Stars: ✭ 30 (+50%)
Mutual labels:  wavenet
Numpy Ml
Machine learning, in numpy
Stars: ✭ 11,100 (+55400%)
Mutual labels:  wavenet
Flowavenet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Stars: ✭ 471 (+2255%)
Mutual labels:  wavenet
Seriesnet
Time series prediction using dilated causal convolutional neural nets (temporal CNN)
Stars: ✭ 185 (+825%)
Mutual labels:  wavenet
Chainer Vq Vae
A Chainer implementation of VQ-VAE.
Stars: ✭ 77 (+285%)
Mutual labels:  wavenet
Wavenet vocoder
WaveNet vocoder
Stars: ✭ 1,926 (+9530%)
Mutual labels:  wavenet
Tf Wavenet vocoder
Wavenet and its applications with Tensorflow
Stars: ✭ 58 (+190%)
Mutual labels:  wavenet
Vq Vae Wavenet
TensorFlow implementation of VQ-VAE with WaveNet decoder, based on https://arxiv.org/abs/1711.00937 and https://arxiv.org/abs/1901.08810
Stars: ✭ 40 (+100%)
Mutual labels:  wavenet
Pytorch Gan Timeseries
GANs for time series generation in pytorch
Stars: ✭ 109 (+445%)
Mutual labels:  wavenet
Wavenet Stt
An end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-10%)
Mutual labels:  wavenet
Deep Time Series Prediction
Seq2Seq, Bert, Transformer, WaveNet for time series prediction.
Stars: ✭ 183 (+815%)
Mutual labels:  wavenet
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+2480%)
Mutual labels:  wavenet
Wavenet
WaveNet implementation with chainer
Stars: ✭ 53 (+165%)
Mutual labels:  wavenet
Nsynth wavenet
parallel wavenet based on nsynth
Stars: ✭ 100 (+400%)
Mutual labels:  wavenet
Vq Vae Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (+835%)
Mutual labels:  wavenet

WaveNet-like vocoder models

Basic implementations of WaveNet and modified FFTNet in PyTorch. The project structre is brought from pytorch-template.

Requirements

  • NumPy
  • SciPy
  • PyTorch >= 0.4.1
  • tqdm
  • librosa

Quick Start

The code in this repo by default will train a WaveNet (or FFTNet) using 80-dimension mel-spectrogram with linear interpolation.

Preprocess

Use preprocess.py to convert your wave files into mel-spectrograms.

python preprocess.py wave/files/folder -c config.json --out data

The preprocessed data will be stored in ./data. You can change the configurations of "feature" in the .json file.

Train

python train.py -c config.json

Test

Use preprocess.py to convert a single wave file into mel-spectrogram feature.

python preprocess.py example.wav -c config.json --out test

The result is stored in test.npz.

Then use the latest checkpoint file in the ./saved folder to decoded test.npz back to waveform. The generating process will run on gpu if you add --cuda.

python test.py test.npz outfile.wav -r saved/your-model-name/XXXX_XXXXXX/checkpoint-stepXXXXX.pth --cuda

That's it. Other instructions and advanced usage can be found in pytorch-template, I didn't change too much of the whole structure.

Customization

I add a new folder feature which is different from pytorch-template. To use other feature like mfcc instead of mel-spectrogram, you can add your own function in ./feature/features.py with similar arguments style of get_logmel().

Other customization method can be found in pytorch-template.

Fast inference

In test.py I implement fast-wavenet generation process in a very naive way. Use fast_inference.py you can get a huge speed up (CPU only). The speed is around 1500 samples/s on FFTNet and 300 samples/s on WaveNet.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].