All Projects → ishandutta2007 → Text-to-Speech-Landscape

ishandutta2007 / Text-to-Speech-Landscape

Licence: other
No description or website provided.

Projects that are alternatives of or similar to Text-to-Speech-Landscape

STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021
Stars: ✭ 105 (+238.71%)
Mutual labels:  tts, style-transfer, prosody
One-Shot-Voice-Cloning
☺️ One Shot Voice Cloning base on Unet-TTS
Stars: ✭ 118 (+280.65%)
Mutual labels:  tts, style-transfer, voice-cloning
Tacotron Pytorch
Pytorch implementation of Tacotron
Stars: ✭ 189 (+509.68%)
Mutual labels:  tts, tacotron
Multi Tacotron Voice Cloning
Phoneme multilingual(Russian-English) voice cloning based on
Stars: ✭ 192 (+519.35%)
Mutual labels:  tts, tacotron
FCH-TTS
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Stars: ✭ 154 (+396.77%)
Mutual labels:  tts, tacotron
Wavernn
WaveRNN Vocoder + TTS
Stars: ✭ 1,636 (+5177.42%)
Mutual labels:  tts, tacotron
Tensorflowtts
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+7583.87%)
Mutual labels:  tts, voice-cloning
Tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+8225.81%)
Mutual labels:  tts, tacotron
Tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (+883.87%)
Mutual labels:  tts, tacotron
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2612.9%)
Mutual labels:  tts, voice-cloning
Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Stars: ✭ 41 (+32.26%)
Mutual labels:  tts, prosody
Tacotron Wavernn
TTS (Tacotron + WaveRNN)
Stars: ✭ 40 (+29.03%)
Mutual labels:  tts, tacotron
Real Time Voice Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Stars: ✭ 32,095 (+103432.26%)
Mutual labels:  tts, voice-cloning
Gst Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Stars: ✭ 175 (+464.52%)
Mutual labels:  tts, tacotron
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+1490.32%)
Mutual labels:  tts, tacotron
Mimic Recording Studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Stars: ✭ 202 (+551.61%)
Mutual labels:  tts, tacotron
Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-29.03%)
Mutual labels:  tts, tacotron
Tts
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+17406.45%)
Mutual labels:  tts, tacotron
tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
Stars: ✭ 102 (+229.03%)
Mutual labels:  tts, tacotron
TTS tf
WIP Tensorflow implementation of https://github.com/mozilla/TTS
Stars: ✭ 14 (-54.84%)
Mutual labels:  tts, tacotron

Text to Speech(TTS)/Style Transfer/Voice Cloning Landscape

Reddit Posts:

Samples from github:

Samples Pretrained Models Code Paper Output Quality
Baidu's Deep Voice samples(official) -- -- -- D
Baidu's Deep Voice 3 samples(official) -- -- 1710.07654 B
Google Tacotron2 samples(official) -- -- 1712.05884 A
Google tacotron + style transfer sample(official) -- -- 1803.09047 A
NVIDIA's waveglow Download Model Code 1811.00002 A
NVIDIA's tacotron2 + waveglow Download Model Code -- A
Griffin-Lim -- -- -- A
Deepmind Neural Discrete Representation Learning samples(official) -- -- 1711.00937 B
r9y9's wavenet vocoder Tacotron2(189k iterations) (Download Tacotron2 model) - (Download wavenet model(1000k iterations)) - (Get models) -- 1712.05884 and 1611.09482 B
dhgrs's implementation of Neural Discrete Representation Learning samples Download Model Code 1711.00937 D
mazzzystar's Tacotron-WaveRNN samples(730k iterations) Get Model Code -- A
syang1993's tacotron + style transfer samples(200k iterations) Model ErnstTmp(232k iter) -- 1803.09047 and 1803.09017 C
keithito's tacotron samples(414k iterations) Get model -- -- D
rayhane's Tacotron2 samples(6k4 steps(whatever that means)) -- -- -- D
Kyubyong's tacotron on LJ dataset(200k iterations) Download model -- -- D
Kyubyong's tacotron on nick dataset(215k iterations) -- -- -- D
Kyubyong's tacotron on web dataset(183k iterations) Download model -- -- D
Kyubyong's expressive tacotron(420k iterations) -- Code 1803.09047 D
Kyubyong's dc-tts on LJ dataset(800k iterations) Get model -- -- D
Kyubyong's dc-tts on nick dataset(800k iterations) -- -- -- D
Kyubyong's dc-tts kate(800k iterations) -- -- -- D
andabi's deep voice conversion -- -- -- D
Facebook Loop samples(official) Get model -- -- D
mazzzystar's randomCNN voice transfer -- -- 1712.08363 D

Work in progress:

If I missed your output sample/demo in this consolidation, just add and send a pull request. I will be more than happy to add it. Thanks!

Codelabs:

Product Demos:

Related Works:

Arxiv-sanity

Support:

If you want the good work to continue please support us on

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].