Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+113.77%)

Mutual labels: text-to-speech, speech, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-73.91%)

Mutual labels: speech, speech-synthesis, text-to-speech

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-77.54%)

Mutual labels: speech, speech-synthesis, text-to-speech

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-19.57%)

Mutual labels: speech, speech-synthesis, text-to-speech

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+162.32%)

Mutual labels: speech, speech-synthesis, text-to-speech

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+77.54%)

Mutual labels: speech, speech-synthesis, text-to-speech

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+16.67%)

Mutual labels: text-to-speech, speech, speech-synthesis

Tacotron 2

DeepMind's Tacotron-2 Tensorflow implementation

Stars: ✭ 1,968 (+1326.09%)

Mutual labels: paper, speech-synthesis, text-to-speech

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-76.09%)

Mutual labels: text-to-speech, speech, speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-21.74%)

Mutual labels: text-to-speech, speech, speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (-11.59%)

Mutual labels: speech-synthesis, text-to-speech

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Stars: ✭ 118 (-14.49%)

Mutual labels: text-to-speech, pretrained-models

Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG

Stars: ✭ 15 (-89.13%)

Mutual labels: text-to-speech, speech-synthesis

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-86.23%)

Mutual labels: speech, speech-synthesis

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (-51.45%)

Mutual labels: speech, speech-synthesis

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (-51.45%)

Mutual labels: text-to-speech, speech-synthesis

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (-14.49%)

Mutual labels: speech, text-to-speech

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+6102.9%)

Mutual labels: text-to-speech, speech-synthesis

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-79.71%)

Mutual labels: text-to-speech, speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+102.17%)

Mutual labels: speech-synthesis, text-to-speech

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-80.43%)

Mutual labels: text-to-speech, speech-synthesis

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-60.14%)

Mutual labels: text-to-speech, speech-synthesis

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-86.23%)

Mutual labels: text-to-speech, speech

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-70.29%)

Mutual labels: text-to-speech, speech-synthesis

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Stars: ✭ 149 (+7.97%)

Mutual labels: text-to-speech, speech-synthesis

Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

Stars: ✭ 69 (-50%)

Mutual labels: text-to-speech, speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+126.09%)

Mutual labels: speech-synthesis, text-to-speech

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (-68.84%)

Mutual labels: text-to-speech, speech-synthesis

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (+105.8%)

Mutual labels: speech-synthesis, text-to-speech

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+135.51%)

Mutual labels: speech-synthesis, text-to-speech

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+3832.61%)

Mutual labels: speech, text-to-speech

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-84.06%)

Mutual labels: text-to-speech, speech-synthesis

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (+123.19%)

Mutual labels: speech-synthesis, text-to-speech

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+115.22%)

Mutual labels: speech, speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+134.78%)

Mutual labels: speech-synthesis, text-to-speech

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+509.42%)

Mutual labels: text-to-speech, speech-synthesis

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+216.67%)

Mutual labels: speech, text-to-speech

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+255.07%)

Mutual labels: speech, speech-synthesis

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+121.01%)

Mutual labels: speech, text-to-speech

Rhvoice

a free and open source speech synthesizer for Russian and other languages

Stars: ✭ 750 (+443.48%)

Mutual labels: speech-synthesis, text-to-speech

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+394.2%)

Mutual labels: speech-synthesis, text-to-speech

Espeak Ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Stars: ✭ 799 (+478.99%)

Mutual labels: speech-synthesis, text-to-speech

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-68.84%)

Mutual labels: speech-synthesis, text-to-speech

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-53.62%)

Mutual labels: speech, text-to-speech

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-79.71%)

Mutual labels: speech-synthesis, text-to-speech

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)