fatchord / Wavernn

Licence: mit

WaveRNN Vocoder + TTS

Programming Languages

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Wavernn

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-98.66%)

Mutual labels: text-to-speech, tts, speech-synthesis, tacotron

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-80.13%)

Mutual labels: speech-synthesis, text-to-speech, tts

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-98.11%)

Mutual labels: speech-synthesis, text-to-speech, tts

Wsay

Windows "say"

Stars: ✭ 36 (-97.8%)

Mutual labels: speech-synthesis, text-to-speech, tts

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-98.29%)

Mutual labels: speech-synthesis, text-to-speech, tts

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (-82.64%)

Mutual labels: speech-synthesis, text-to-speech, tts

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (-58.31%)

Mutual labels: speech-synthesis, text-to-speech, tts

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-95.48%)

Mutual labels: text-to-speech, tts, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (-77.87%)

Mutual labels: speech-synthesis, text-to-speech, tts

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-96.82%)

Mutual labels: speech-synthesis, text-to-speech, tts

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-97.37%)

Mutual labels: speech-synthesis, tacotron, text-to-speech

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-82.95%)

Mutual labels: speech-synthesis, text-to-speech, tts

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-93.7%)

Mutual labels: speech-synthesis, text-to-speech, tts

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-80.93%)

Mutual labels: speech-synthesis, text-to-speech, tts

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-98.29%)

Mutual labels: text-to-speech, tts, speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (-80.2%)

Mutual labels: speech-synthesis, text-to-speech, tts

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-96.82%)

Mutual labels: text-to-speech, tts, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-95.54%)

Mutual labels: text-to-speech, tts, speech-synthesis

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+231.72%)

Mutual labels: tacotron, text-to-speech, tts

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (-81.36%)

Mutual labels: tacotron, text-to-speech, tts

View All Similar Projects ➔

WaveRNN

(Update: Vanilla Tacotron One TTS system just implemented - more coming soon!)

Pytorch implementation of Deepmind's WaveRNN model from Efficient Neural Audio Synthesis

Installation

Ensure you have:

Python >= 3.6
Pytorch 1 with CUDA

Then install the rest with pip:

pip install -r requirements.txt

How to Use

Quick Start

If you want to use TTS functionality immediately you can simply use:

python quick_start.py

This will generate everything in the default sentences.txt file and output to a new 'quick_start' folder where you can playback the wav files and take a look at the attention plots

You can also use that script to generate custom tts sentences and/or use '-u' to generate unbatched (better audio quality):

python quick_start.py -u --input_text "What will happen if I run this command?"

Training your own Models

Download the LJSpeech Dataset.

Edit hparams.py, point wav_path to your dataset and run:

python preprocess.py

or use preprocess.py --path to point directly to the dataset

Here's my recommendation on what order to run things:

1 - Train Tacotron with:

python train_tacotron.py

2 - You can leave that finish training or at any point you can use:

python train_tacotron.py --force_gta

this will force tactron to create a GTA dataset even if it hasn't finish training.

3 - Train WaveRNN with:

python train_wavernn.py --gta

NB: You can always just run train_wavernn.py without --gta if you're not interested in TTS.

4 - Generate Sentences with both models using:

python gen_tacotron.py wavernn

this will generate default sentences. If you want generate custom sentences you can use

python gen_tacotron.py --input_text "this is whatever you want it to be" wavernn

And finally, you can always use --help on any of those scripts to see what options are available :)

Samples

Can be found here.

Pretrained Models

Currently there are two pretrained models available in the /pretrained/ folder':

Both are trained on LJSpeech

WaveRNN (Mixture of Logistics output) trained to 800k steps
Tacotron trained to 180k steps

References

Acknowlegements

https://github.com/keithito/tacotron
https://github.com/r9y9/wavenet_vocoder
Special thanks to github users G-Wang, geneing & erogol

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

fatchord / Wavernn

Programming Languages

Labels

Projects that are alternatives of or similar to Wavernn

WaveRNN

(Update: Vanilla Tacotron One TTS system just implemented - more coming soon!)

Installation

How to Use

Quick Start

Training your own Models

Samples

Pretrained Models

References

Acknowlegements