h-meru / Tacotron Wavernn
Licence: mit
TTS (Tacotron + WaveRNN)
Stars: ✭ 40
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Tacotron Wavernn
Tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+6352.5%)
Mutual labels: tacotron, tts
TTS tf
WIP Tensorflow implementation of https://github.com/mozilla/TTS
Stars: ✭ 14 (-65%)
Mutual labels: tts, tacotron
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+5112.5%)
Mutual labels: dnn, tts
Tacotron Pytorch
Pytorch implementation of Tacotron
Stars: ✭ 189 (+372.5%)
Mutual labels: tacotron, tts
Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-45%)
Mutual labels: tts, tacotron
Multi Tacotron Voice Cloning
Phoneme multilingual(Russian-English) voice cloning based on
Stars: ✭ 192 (+380%)
Mutual labels: tacotron, tts
tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
Stars: ✭ 102 (+155%)
Mutual labels: tts, tacotron
FCH-TTS
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Stars: ✭ 154 (+285%)
Mutual labels: tts, tacotron
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+295%)
Mutual labels: dnn, tts
Text-to-Speech-Landscape
No description or website provided.
Stars: ✭ 31 (-22.5%)
Mutual labels: tts, tacotron
Gst Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Stars: ✭ 175 (+337.5%)
Mutual labels: tacotron, tts
Tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (+662.5%)
Mutual labels: tacotron, tts
Mimic Recording Studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Stars: ✭ 202 (+405%)
Mutual labels: tacotron, tts
Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Stars: ✭ 118 (+195%)
Mutual labels: tts, tacotron
Tts
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+13467.5%)
Mutual labels: tacotron, tts
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+1132.5%)
Mutual labels: tacotron, tts
Dnn.azureadprovider
The DNN Azure Active Directory Provider is an Authentication provider for DNN Platform (formerly DotNetNuke) that uses Azure Active Directory OAuth2 authentication to authenticate users.
Stars: ✭ 21 (-47.5%)
Mutual labels: dnn
Parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+1605%)
Mutual labels: tts
Tacotron-WaveRNN
Tacotron + WaveRNN synthesis
Makes use of:
- Tacotron: https://github.com/Rayhane-mamah/Tacotron-2
- WaveRNN: https://github.com/fatchord/WaveRNN
You'll at least need python3, PyTorch 0.4.1, Tensorflow and librosa.
Preprocess
python3 preprocess.py --model='WaveRNN'
Default parameters:
name | default | |
---|---|---|
--base_dir | ||
--hparams | ex) 'wavernn_gpu_num=4, wavernn_batch_size=16' | |
--model | 'Tacotron' | 'Tacotron', 'WaveRNN' |
--dataset | 'LJSpeech-1.1' | 'LJSpeech-1.0', 'LJSpeech-1.1', 'M-AILABS' |
Others, look at this file...
Training
python3 train.py --model='Tacotron-2' --GTA --use_cuda
If you would like to train separately...
# Tacotron
python3 train.py --model='Tacotron'
# Tacotron synth
python3 synthesize.py --model='Tacotron' --mode='synthesis' --GTA
# WaveRNN
python3 train.py --model='WaveRNN' --use_cuda
Default parameters:
name | default | |
---|---|---|
--base_dir | ||
--hparams | ex) 'wavernn_gpu_num=4, wavernn_batch_size=16' | |
--model | 'Tacotron-2' | 'Tacotron-2', 'Tacotron', 'WaveRNN' |
--mode | 'synthesis' | 'eval', 'synthesis', 'live' |
--init | False | True, False |
--slack_url | {your slack wabhook url...} | |
--use_cuda | False | True, False |
Others, look at this file...
Synthesis
python3 synthesize.py --model='Tacotron-2' --text_list={your text file}
Default parameters:
name | default | |
---|---|---|
--base_dir | ||
--hparams | ex) 'wavernn_gpu_num=4, wavernn_batch_size=16' | |
--model | 'Tacotron-2' | 'Tacotron-2', 'Tacotron', 'WaveRNN' |
--mode | 'eval' | 'eval', 'synthesis', 'live' |
--text_list | {your text file...} | |
--use_cuda | False | True, False |
Others, look at this file...
Pretrained Model(old)
https://github.com/h-meru/Tacotron-WaveRNN/files/2444777/wavernn_model.zip
Samples(old)
https://github.com/h-meru/Tacotron-WaveRNN/files/2444792/Samples_730k.zip
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].