PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-92.95%)

Mutual labels: text-to-speech, tts, speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+414.1%)

Mutual labels: text-to-speech, tts, speech-synthesis

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-91.03%)

Mutual labels: speech-synthesis, text-to-speech, tts

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (-65.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+118.59%)

Mutual labels: speech-synthesis, text-to-speech, tts

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (-78.85%)

Mutual labels: text-to-speech, tts, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+16.03%)

Mutual labels: speech-synthesis, text-to-speech, tts

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-82.37%)

Mutual labels: text-to-speech, tts, speech-synthesis

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-86.86%)

Mutual labels: text-to-speech, tts, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-90.06%)

Mutual labels: speech-synthesis, text-to-speech, tts

Wsay

Windows "say"

Stars: ✭ 36 (-88.46%)

Mutual labels: speech-synthesis, text-to-speech, tts

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (-49.36%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+424.36%)

Mutual labels: speech-synthesis, text-to-speech, tts

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-76.28%)

Mutual labels: text-to-speech, tts, speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+663.46%)

Mutual labels: speech-synthesis, text-to-speech, tts

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-83.33%)

Mutual labels: text-to-speech, tts, speech-synthesis

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-91.03%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Stars: ✭ 149 (-52.24%)

Mutual labels: text-to-speech, tts, speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (-55.45%)

Mutual labels: text-to-speech, tts, speech-synthesis

Transformer Tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Stars: ✭ 418 (+33.97%)

Mutual labels: text-to-speech, tts, transformer

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-83.33%)

Mutual labels: speech-synthesis, text-to-speech, tts

TensorVox

Desktop application for neural speech synthesis written in C++

Stars: ✭ 140 (-55.13%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (-21.47%)

Mutual labels: speech-synthesis, text-to-speech, tts

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-83.01%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-10.58%)

Mutual labels: speech-synthesis, text-to-speech, tts

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (-86.22%)

Mutual labels: text-to-speech, tts, speech-synthesis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (-65.38%)

Mutual labels: speech-synthesis, text-to-speech, tts

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+3.85%)

Mutual labels: speech-synthesis, text-to-speech, tts

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-64.42%)

Mutual labels: speech-synthesis, text-to-speech, tts

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (-5.45%)

Mutual labels: text-to-speech, tts, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-76.6%)

Mutual labels: text-to-speech, tts, speech-synthesis

soundpad-text-to-speech

Text-To-Speech for Soundpad

Stars: ✭ 29 (-90.71%)

Mutual labels: text-to-speech, tts

MouseTooltipTranslator

chrome extension - When mouse hover on text, it shows translated tooltip using google translate

Stars: ✭ 93 (-70.19%)

Mutual labels: text-to-speech, tts

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-88.78%)

Mutual labels: text-to-speech, speech-synthesis

FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊

Stars: ✭ 64 (-79.49%)

Mutual labels: text-to-speech, tts

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-79.17%)

Mutual labels: tts, speech-synthesis

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-83.97%)

Mutual labels: text-to-speech, speech-synthesis

ukrainian-tts

Ukrainian TTS (text-to-speech) using Coqui TTS

Stars: ✭ 74 (-76.28%)

Mutual labels: text-to-speech, tts

deep-learning-german-tts

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Stars: ✭ 268 (-14.1%)

Mutual labels: tts, speech-synthesis

EMPHASIS-pytorch

EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System

Stars: ✭ 15 (-95.19%)

Mutual labels: text-to-speech, tts

voices

macOS CLI for changing the default TTS (text-to-speech) voice and printing information about and speaking text with multiple voices.

Stars: ✭ 53 (-83.01%)

Mutual labels: text-to-speech, tts

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-82.69%)

Mutual labels: text-to-speech, speech-synthesis

google-translate-tts

Node library for Google Translate TTS (Text-to-Speech) API

Stars: ✭ 23 (-92.63%)

Mutual labels: text-to-speech, tts

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (-1.28%)

Mutual labels: speech-synthesis, text-to-speech

brasiltts

Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…

Stars: ✭ 34 (-89.1%)

Mutual labels: text-to-speech, tts

JSpeak

A Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features

Stars: ✭ 16 (-94.87%)

Mutual labels: text-to-speech, tts

golang-tts

Text-to-Speach golang package based in Amazon Polly service

Stars: ✭ 19 (-93.91%)

Mutual labels: text-to-speech, tts

SpeakIt Vietnamese TTS

Vietnamese Text-to-Speech on Windows Project (zalo-speech)