seungwonpark / Awesome Tts Samples
Licence: cc0-1.0
Awesome list of TTS papers with audio samples
Stars: ✭ 35
Projects that are alternatives of or similar to Awesome Tts Samples
Hifi Gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (+828.57%)
Mutual labels: tts
Tts
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+15405.71%)
Mutual labels: tts
Transformer Tts
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Stars: ✭ 418 (+1094.29%)
Mutual labels: tts
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1448.57%)
Mutual labels: tts
Cognitive Speech Tts
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (+791.43%)
Mutual labels: tts
Cboard
AAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+1148.57%)
Mutual labels: tts
Parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+1848.57%)
Mutual labels: tts
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+934.29%)
Mutual labels: tts
Real Time Voice Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Stars: ✭ 32,095 (+91600%)
Mutual labels: tts
Multilingual text to speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (+825.71%)
Mutual labels: tts
Facemoji
😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS
Stars: ✭ 320 (+814.29%)
Mutual labels: tts
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+1308.57%)
Mutual labels: tts
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-11.43%)
Mutual labels: tts
Zhrtvc
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。
Stars: ✭ 771 (+2102.86%)
Mutual labels: tts
Transformertts
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
Stars: ✭ 617 (+1662.86%)
Mutual labels: tts
awesome-tts-samples
List of TTS papers with audio samples provided by the authors. The last rows of each paper show the spectrogram inversion (vocoder) being used.
For more comprehensive list of important TTS papers, I recommmend reading xcmyz/speech-synthesis-paper written by Zhengxi Liu.
2020
-
FastPitch - FastPitch: Parallel Text-to-speech with Pitch Prediction
- https://fastpitch.github.io/
- WaveGlow
- EATS - End-to-End Adversarial Text-to-Speech
- Glow-TTS - Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
-
Flowtron - Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
- https://nv-adlr.github.io/Flowtron
- WaveGlow
2019
- Tacotron2+DCA - Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
-
GAN-TTS - High Fidelity Speech Synthesis with Adversarial Networks
- https://storage.googleapis.com/deepmind-media/research/abstract.wav
- End-to-end model (Built on top of 200Hz linguistic & log pitch features)
- Multi-lingual Tacotron2 - Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
- MelNet - MelNet: A Generative Model for Audio in the Frequency Domain
- FastSpeech - FastSpeech: Fast, Robust and Controllable Text to Speech
-
ParaNet - Parallel Neural Text-to-Speech
- https://parallel-neural-tts-demo.github.io
- WaveVAE, ClariNet, WaveNet
2018
- Transformer-TTS - Neural Speech Synthesis with Transformer Network
- Multi-speaker Tacotron2 - Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
- Tacotron2+GST - Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
2017
- Tacotron2 - Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
- Tacotron - Tacotron: Towards End-to-End Speech Synthesis
Contributing
TODO
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].