Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (-74.74%)

Mutual labels: text-to-speech, speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (-41.61%)

Mutual labels: speech-synthesis, text-to-speech

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (-90.84%)

Mutual labels: text-to-speech, speech-synthesis

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-97.6%)

Mutual labels: speech-synthesis, text-to-speech

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-93.75%)

Mutual labels: text-to-speech, speech-synthesis

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-88.18%)

Mutual labels: speech-synthesis, text-to-speech

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-96.32%)

Mutual labels: speech-synthesis, text-to-speech

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+103.94%)

Mutual labels: speech-synthesis, text-to-speech

ttsflow

tensorflow speech synthesis c++ inference for voicenet

Stars: ✭ 17 (-98.54%)

Mutual labels: text-to-speech, speech-synthesis

Vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Stars: ✭ 158 (-86.47%)

Mutual labels: speech-synthesis, text-to-speech

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (-86.22%)

Mutual labels: text-to-speech, speech-synthesis

TensorVox

Desktop application for neural speech synthesis written in C++

Stars: ✭ 140 (-88.01%)

Mutual labels: text-to-speech, speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-90.75%)

Mutual labels: text-to-speech, speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+37.33%)

Mutual labels: text-to-speech, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-97.35%)

Mutual labels: speech-synthesis, text-to-speech

Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

Stars: ✭ 69 (-94.09%)

Mutual labels: text-to-speech, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-95.55%)

Mutual labels: text-to-speech, speech-synthesis

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (-94.26%)

Mutual labels: text-to-speech, speech-synthesis

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (-73.63%)

Mutual labels: speech-synthesis, text-to-speech

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-73.29%)

Mutual labels: speech-synthesis, text-to-speech

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-88.1%)

Mutual labels: speech-synthesis, text-to-speech

Espeak

eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.

Stars: ✭ 339 (-70.98%)

Mutual labels: speech-synthesis, text-to-speech

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-88.61%)

Mutual labels: speech-synthesis, text-to-speech

Tacotron 2

DeepMind's Tacotron-2 Tensorflow implementation

Stars: ✭ 1,968 (+68.49%)

Mutual labels: speech-synthesis, text-to-speech

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+45.46%)

Mutual labels: speech-synthesis, text-to-speech

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (-79.02%)

Mutual labels: speech-synthesis, text-to-speech

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-85.36%)

Mutual labels: speech-synthesis, text-to-speech

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-95.46%)

Mutual labels: text-to-speech, speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (-89.55%)

Mutual labels: speech-synthesis, text-to-speech

Rhvoice

a free and open source speech synthesizer for Russian and other languages

Stars: ✭ 750 (-35.79%)

Mutual labels: speech-synthesis, text-to-speech

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-97%)

Mutual labels: text-to-speech, speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-97.17%)

Mutual labels: text-to-speech, speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (-88.1%)

Mutual labels: text-to-speech, speech-synthesis

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (-94.35%)

Mutual labels: text-to-speech, speech-synthesis

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-95.72%)

Mutual labels: text-to-speech, speech-synthesis

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-95.55%)

Mutual labels: speech-synthesis, text-to-speech

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-90.5%)

Mutual labels: speech-synthesis, text-to-speech

Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG

Stars: ✭ 15 (-98.72%)

Mutual labels: text-to-speech, speech-synthesis

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-97.69%)

Mutual labels: text-to-speech, speech-synthesis

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Stars: ✭ 149 (-87.24%)

Mutual labels: text-to-speech, speech-synthesis

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-95.29%)

Mutual labels: text-to-speech, speech-synthesis

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (-96.32%)

Mutual labels: text-to-speech, speech-synthesis

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (-86.47%)

Mutual labels: text-to-speech, speech-synthesis

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+632.88%)

Mutual labels: text-to-speech, speech-synthesis

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-96.49%)

Mutual labels: text-to-speech, speech-synthesis

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (-75.68%)

Mutual labels: speech-synthesis, text-to-speech

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-76.11%)

Mutual labels: speech-synthesis, text-to-speech

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-72.17%)

Mutual labels: speech-synthesis, text-to-speech

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-98.12%)

Mutual labels: text-to-speech, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+40.07%)

Mutual labels: speech-synthesis, text-to-speech

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (-90.75%)

Mutual labels: speech-synthesis, text-to-speech

melgan

MelGAN implementation with Multi-Band and Full Band supports...