ishandutta2007 / Text-to-Speech-Landscape

Licence: other

No description or website provided.

Projects that are alternatives of or similar to Text-to-Speech-Landscape

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Stars: ✭ 105 (+238.71%)

Mutual labels: tts, style-transfer, prosody

One-Shot-Voice-Cloning

☺️ One Shot Voice Cloning base on Unet-TTS

Stars: ✭ 118 (+280.65%)

Mutual labels: tts, style-transfer, voice-cloning

Tacotron Pytorch

Pytorch implementation of Tacotron

Stars: ✭ 189 (+509.68%)

Mutual labels: tts, tacotron

Multi Tacotron Voice Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Stars: ✭ 192 (+519.35%)

Mutual labels: tts, tacotron

FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Stars: ✭ 154 (+396.77%)

Mutual labels: tts, tacotron

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+5177.42%)

Mutual labels: tts, tacotron

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+7583.87%)

Mutual labels: tts, voice-cloning

Tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Stars: ✭ 2,581 (+8225.81%)

Mutual labels: tts, tacotron

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+883.87%)

Mutual labels: tts, tacotron

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+2612.9%)

Mutual labels: tts, voice-cloning

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (+32.26%)

Mutual labels: tts, prosody

Tacotron Wavernn

TTS (Tacotron + WaveRNN)

Stars: ✭ 40 (+29.03%)

Mutual labels: tts, tacotron

Real Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Stars: ✭ 32,095 (+103432.26%)

Mutual labels: tts, voice-cloning

Gst Tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Stars: ✭ 175 (+464.52%)

Mutual labels: tts, tacotron

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+1490.32%)

Mutual labels: tts, tacotron

Mimic Recording Studio

Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2

Stars: ✭ 202 (+551.61%)

Mutual labels: tts, tacotron

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-29.03%)

Mutual labels: tts, tacotron

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+17406.45%)

Mutual labels: tts, tacotron

tacotron2

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Stars: ✭ 102 (+229.03%)

Mutual labels: tts, tacotron

TTS tf

WIP Tensorflow implementation of https://github.com/mozilla/TTS

Stars: ✭ 14 (-54.84%)

Mutual labels: tts, tacotron

View All Similar Projects ➔

Text to Speech(TTS)/Style Transfer/Voice Cloning Landscape

Reddit Posts:

Samples from github:

Samples	Pretrained Models	Code	Paper	Output Quality
Baidu's Deep Voice samples(official)	--	--	--	D
Baidu's Deep Voice 3 samples(official)	--	--	1710.07654	B
Google Tacotron2 samples(official)	--	--	1712.05884	A
Google tacotron + style transfer sample(official)	--	--	1803.09047	A
NVIDIA's waveglow	Download Model	Code	1811.00002	A
NVIDIA's tacotron2 + waveglow	Download Model	Code	--	A
Griffin-Lim	--	--	--	A
Deepmind Neural Discrete Representation Learning samples(official)	--	--	1711.00937	B
r9y9's wavenet vocoder Tacotron2(189k iterations)	(Download Tacotron2 model) - (Download wavenet model(1000k iterations)) - (Get models)	--	1712.05884 and 1611.09482	B
dhgrs's implementation of Neural Discrete Representation Learning samples	Download Model	Code	1711.00937	D
mazzzystar's Tacotron-WaveRNN samples(730k iterations)	Get Model	Code	--	A
syang1993's tacotron + style transfer samples(200k iterations)	Model ErnstTmp(232k iter)	--	1803.09047 and 1803.09017	C
keithito's tacotron samples(414k iterations)	Get model	--	--	D
rayhane's Tacotron2 samples(6k4 steps(whatever that means))	--	--	--	D
Kyubyong's tacotron on LJ dataset(200k iterations)	Download model	--	--	D
Kyubyong's tacotron on nick dataset(215k iterations)	--	--	--	D
Kyubyong's tacotron on web dataset(183k iterations)	Download model	--	--	D
Kyubyong's expressive tacotron(420k iterations)	--	Code	1803.09047	D
Kyubyong's dc-tts on LJ dataset(800k iterations)	Get model	--	--	D
Kyubyong's dc-tts on nick dataset(800k iterations)	--	--	--	D
Kyubyong's dc-tts kate(800k iterations)	--	--	--	D
andabi's deep voice conversion	--	--	--	D
Facebook Loop samples(official)	Get model	--	--	D
mazzzystar's randomCNN voice transfer	--	--	1712.08363	D

Work in progress:

If I missed your output sample/demo in this consolidation, just add and send a pull request. I will be more than happy to add it. Thanks!

Codelabs:

https://github.com/tugstugi/dl-colab-notebooks

Product Demos:

Lyrebird samples(official)
Lyrebird Demo(official)
Google Duplex Demo(official)
Adobe Voco Demo(official)
Voice Cloning Toolbox(official)

Related Works:

https://github.com/tensorflow/magenta

Arxiv-sanity

Support:

If you want the good work to continue please support us on

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

ishandutta2007 / Text-to-Speech-Landscape

Labels

Projects that are alternatives of or similar to Text-to-Speech-Landscape

Text to Speech(TTS)/Style Transfer/Voice Cloning Landscape

Reddit Posts:

Samples from github:

Work in progress:

Codelabs:

Product Demos:

Related Works:

Arxiv-sanity

Support: