Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → seungwonpark → Awesome Tts Samples

seungwonpark / Awesome Tts Samples

Licence: cc0-1.0

Awesome list of TTS papers with audio samples

Labels

awesome tts

Projects that are alternatives of or similar to Awesome Tts Samples

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+828.57%)

Mutual labels: tts

Melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Stars: ✭ 444 (+1168.57%)

Mutual labels: tts

Ekho

Chinese text-to-speech engine

Stars: ✭ 690 (+1871.43%)

Mutual labels: tts

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+15405.71%)

Mutual labels: tts

Transformer Tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Stars: ✭ 418 (+1094.29%)

Mutual labels: tts

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+1448.57%)

Mutual labels: tts

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+791.43%)

Mutual labels: tts

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-20%)

Mutual labels: tts

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+1148.57%)

Mutual labels: tts

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+1848.57%)

Mutual labels: tts

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+934.29%)

Mutual labels: tts

Xzvoice

Free and open source text-to-speech software

Stars: ✭ 355 (+914.29%)

Mutual labels: tts

Real Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Stars: ✭ 32,095 (+91600%)

Mutual labels: tts

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+825.71%)

Mutual labels: tts

Mtrans

Multi-source Translation

Stars: ✭ 711 (+1931.43%)

Mutual labels: tts

Facemoji

😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS

Stars: ✭ 320 (+814.29%)

Mutual labels: tts

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+1308.57%)

Mutual labels: tts

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-11.43%)

Mutual labels: tts

Zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

Stars: ✭ 771 (+2102.86%)

Mutual labels: tts

Transformertts

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Stars: ✭ 617 (+1662.86%)

Mutual labels: tts

View All Similar Projects ➔

awesome-tts-samples

List of TTS papers with audio samples provided by the authors. The last rows of each paper show the spectrogram inversion (vocoder) being used.

For more comprehensive list of important TTS papers, I recommmend reading xcmyz/speech-synthesis-paper written by Zhengxi Liu.

2020

FastPitch - FastPitch: Parallel Text-to-speech with Pitch Prediction
- https://fastpitch.github.io/
- WaveGlow
EATS - End-to-End Adversarial Text-to-Speech
- https://deepmind.com/research/publications/End-to-End-Adversarial-Text-to-Speech
- End-to-end model
Glow-TTS - Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
- https://jaywalnut310.github.io/glow-tts-demo
- WaveGlow
Flowtron - Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
- https://nv-adlr.github.io/Flowtron
- WaveGlow

2019

Tacotron2+DCA - Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
- https://google.github.io/tacotron/publications/location_relative_attention
- WaveRNN
GAN-TTS - High Fidelity Speech Synthesis with Adversarial Networks
- https://storage.googleapis.com/deepmind-media/research/abstract.wav
- End-to-end model (Built on top of 200Hz linguistic & log pitch features)
Multi-lingual Tacotron2 - Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
- https://google.github.io/tacotron/publications/multilingual
- WaveRNN
MelNet - MelNet: A Generative Model for Audio in the Frequency Domain
FastSpeech - FastSpeech: Fast, Robust and Controllable Text to Speech
- https://speechresearch.github.io/fastspeech
- WaveGlow
ParaNet - Parallel Neural Text-to-Speech
- https://parallel-neural-tts-demo.github.io
- WaveVAE, ClariNet, WaveNet

2018

Transformer-TTS - Neural Speech Synthesis with Transformer Network
- https://neuraltts.github.io/transformertts
- WaveNet
Multi-speaker Tacotron2 - Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
- https://google.github.io/tacotron/publications/speaker_adaptation
- WaveNet
Tacotron2+GST - Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
- https://google.github.io/tacotron/publications/global_style_tokens
- Griffin-Lim

2017

Tacotron2 - Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
- https://google.github.io/tacotron/publications/tacotron2
- WaveNet
Tacotron - Tacotron: Towards End-to-End Speech Synthesis
- https://google.github.io/tacotron/publications/tacotron
- Griffin-Lim

Contributing

TODO

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 35

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗