Tensorflowttsπ TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: β 2,382 (+1601.43%)
Comprehensive-Tacotron2PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: β 22 (-84.29%)
Ttsπ€ π¬ Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: β 5,427 (+3776.43%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: β 52 (-62.86%)
AdaSpeechAdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: β 108 (-22.86%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: β 139 (-0.71%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: β 74 (-47.14%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: β 362 (+158.57%)
Cs224n Gpu That TalksAttention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: β 52 (-62.86%)
VAENAR-TTSPyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Stars: β 66 (-52.86%)
Daft-ExprtPyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Stars: β 41 (-70.71%)
vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Stars: β 1,604 (+1045.71%)
ttslearnttslearn: Library for Pythonγ§ε¦γΆι³ε£°εζ (Text-to-speech with Python)
Stars: β 158 (+12.86%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: β 27 (-80.71%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: β 682 (+387.14%)
Tacotron2-PyTorchYet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Stars: β 118 (-15.71%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: β 31 (-77.86%)
JSpeakA Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features
Stars: β 16 (-88.57%)
StyleSpeechOfficial implementation of Meta-StyleSpeech and StyleSpeech
Stars: β 161 (+15%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: β 33 (-76.43%)
Parallel-Tacotron2PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Stars: β 149 (+6.43%)
WavernnWaveRNN Vocoder + TTS
Stars: β 1,636 (+1068.57%)
talkieText-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.
Stars: β 43 (-69.29%)
LVCNetLVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Stars: β 67 (-52.14%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: β 111 (-20.71%)
MaryttsMARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Stars: β 1,699 (+1113.57%)
Pytorch Dc TtsText to Speech with PyTorch (English and Mongolian)
Stars: β 122 (-12.86%)
FastSpeech2PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
Stars: β 163 (+16.43%)
CrystalCrystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: β 108 (-22.86%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: β 295 (+110.71%)
esp32-fliteSpeech synthesis running on ESP32 based on Flite engine.
Stars: β 28 (-80%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: β 279 (+99.29%)
Glow TtsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Stars: β 284 (+102.86%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: β 312 (+122.86%)
WsayWindows "say"
Stars: β 36 (-74.29%)
FastSpeech2Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech β
Stars: β 64 (-54.29%)
Jsut LabHTS-style full-context labels for JSUT v1.1
Stars: β 28 (-80%)
Cross-Speaker-Emotion-TransferPyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Stars: β 107 (-23.57%)
open-speech-corporaπ A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: β 841 (+500.71%)
WavegradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: β 245 (+75%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: β 103 (-26.43%)
WaveGrad2PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Stars: β 55 (-60.71%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: β 73 (-47.86%)
Hifi GanHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: β 325 (+132.14%)
Multilingual text to speechAn implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: β 324 (+131.43%)
MouseTooltipTranslatorchrome extension - When mouse hover on text, it shows translated tooltip using google translate
Stars: β 93 (-33.57%)
XzvoiceFree and open source text-to-speech software
Stars: β 355 (+153.57%)
sova-tts-engineTacotron2 based engine for the SOVA-TTS project
Stars: β 63 (-55%)
TtsπΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: β 305 (+117.86%)
Transformer TtsA Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Stars: β 418 (+198.57%)
CboardAAC communication system with text-to-speech for the browser
Stars: β 437 (+212.14%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: β 65 (-53.57%)
ZhrtvcChinese real time voice cloning (VC) and Chinese text to speech (TTS). ε₯½η¨ηδΈζθ―ι³ε
ιε
ΌδΈζθ―ι³εζη³»η»οΌε
ε«θ―ι³ηΌη ε¨γθ―ι³εζε¨γε£°η ε¨εε―θ§ε樑εγ
Stars: β 771 (+450.71%)
Transformerttsπ€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
Stars: β 617 (+340.71%)
Rhvoicea free and open source speech synthesizer for Russian and other languages
Stars: β 750 (+435.71%)
Tacotron2A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: β 43 (-69.29%)
MerlinThis is now the official location of the Merlin project.
Stars: β 1,168 (+734.29%)
ttsflowtensorflow speech synthesis c++ inference for voicenet
Stars: β 17 (-87.86%)