Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+5112.5%)

Mutual labels: dnn, tts

Tacotron Pytorch

Pytorch implementation of Tacotron

Stars: ✭ 189 (+372.5%)

Mutual labels: tacotron, tts

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-45%)

Mutual labels: tts, tacotron

Multi Tacotron Voice Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Stars: ✭ 192 (+380%)

Mutual labels: tacotron, tts

tacotron2

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Stars: ✭ 102 (+155%)

Mutual labels: tts, tacotron

FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Stars: ✭ 154 (+285%)

Mutual labels: tts, tacotron

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+295%)

Mutual labels: dnn, tts

Text-to-Speech-Landscape

No description or website provided.

Stars: ✭ 31 (-22.5%)

Mutual labels: tts, tacotron

Gst Tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Stars: ✭ 175 (+337.5%)

Mutual labels: tacotron, tts

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+662.5%)

Mutual labels: tacotron, tts

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+3990%)

Mutual labels: tacotron, tts

Mimic Recording Studio

Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2

Stars: ✭ 202 (+405%)

Mutual labels: tacotron, tts

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Stars: ✭ 118 (+195%)

Mutual labels: tts, tacotron

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+13467.5%)

Mutual labels: tacotron, tts

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+1132.5%)

Mutual labels: tacotron, tts

Ekho

Chinese text-to-speech engine

Stars: ✭ 690 (+1625%)

Mutual labels: tts

Dnn.azureadprovider

The DNN Azure Active Directory Provider is an Authentication provider for DNN Platform (formerly DotNetNuke) that uses Azure Active Directory OAuth2 authentication to authenticate users.

Stars: ✭ 21 (-47.5%)

Mutual labels: dnn

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+1605%)

Mutual labels: tts

View All Similar Projects ➔

Tacotron-WaveRNN

Tacotron + WaveRNN synthesis

Makes use of:

Tacotron: https://github.com/Rayhane-mamah/Tacotron-2
WaveRNN: https://github.com/fatchord/WaveRNN

You'll at least need python3, PyTorch 0.4.1, Tensorflow and librosa.

Preprocess

python3 preprocess.py --model='WaveRNN'

Default parameters:

name	default
--base_dir
--hparams		ex) 'wavernn_gpu_num=4, wavernn_batch_size=16'
--model	'Tacotron'	'Tacotron', 'WaveRNN'
--dataset	'LJSpeech-1.1'	'LJSpeech-1.0', 'LJSpeech-1.1', 'M-AILABS'

Others, look at this file...

Training

python3 train.py --model='Tacotron-2' --GTA --use_cuda

If you would like to train separately...

# Tacotron
python3 train.py --model='Tacotron'

# Tacotron synth
python3 synthesize.py --model='Tacotron' --mode='synthesis' --GTA

# WaveRNN
python3 train.py --model='WaveRNN' --use_cuda

Default parameters:

name	default
--base_dir
--hparams		ex) 'wavernn_gpu_num=4, wavernn_batch_size=16'
--model	'Tacotron-2'	'Tacotron-2', 'Tacotron', 'WaveRNN'
--mode	'synthesis'	'eval', 'synthesis', 'live'
--init	False	True, False
--slack_url		{your slack wabhook url...}
--use_cuda	False	True, False

Others, look at this file...

Synthesis

python3 synthesize.py --model='Tacotron-2' --text_list={your text file}

Default parameters:

name	default
--base_dir
--hparams		ex) 'wavernn_gpu_num=4, wavernn_batch_size=16'
--model	'Tacotron-2'	'Tacotron-2', 'Tacotron', 'WaveRNN'
--mode	'eval'	'eval', 'synthesis', 'live'
--text_list		{your text file...}
--use_cuda	False	True, False

Others, look at this file...

Pretrained Model(old)

https://github.com/h-meru/Tacotron-WaveRNN/files/2444777/wavernn_model.zip

Samples(old)

https://github.com/h-meru/Tacotron-WaveRNN/files/2444792/Samples_730k.zip

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 40

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗