PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-21.43%)

Mutual labels: text-to-speech, tts, speech-synthesis

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (+46.43%)

Mutual labels: text-to-speech, tts, speech-synthesis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (+285.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+89.29%)

Mutual labels: text-to-speech, tts, speech-synthesis

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (+53.57%)

Mutual labels: text-to-speech, tts, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (+10.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (+396.43%)

Mutual labels: text-to-speech, tts, speech-synthesis

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (+85.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (+282.14%)

Mutual labels: text-to-speech, tts, speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (+335.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (+135.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+896.43%)

Mutual labels: text-to-speech, tts, speech-synthesis

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (+96.43%)

Mutual labels: text-to-speech, tts, speech-synthesis

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (+0%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+5742.86%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (+28.57%)

Mutual labels: text-to-speech, tts, speech-synthesis

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (+139.29%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Stars: ✭ 149 (+432.14%)

Mutual labels: text-to-speech, tts, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+1192.86%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+775%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+2335.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+1057.14%)

Mutual labels: text-to-speech, tts, speech-synthesis

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+1060.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+953.57%)

Mutual labels: text-to-speech, tts, speech-synthesis

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (+296.43%)

Mutual labels: text-to-speech, tts, speech-synthesis

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+30471.43%)

Mutual labels: text-to-speech, speech-synthesis, flite

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+2903.57%)

Mutual labels: text-to-speech, tts, speech-synthesis

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+464.29%)

Mutual labels: text-to-speech, tts, speech-synthesis

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+5967.86%)

Mutual labels: text-to-speech, tts, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (+160.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+8407.14%)

Mutual labels: text-to-speech, tts, speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (+285.71%)

Mutual labels: text-to-speech, tts, speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+1014.29%)

Mutual labels: text-to-speech, tts, speech-synthesis

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (+321.43%)

Mutual labels: text-to-speech, esp32, tts

Aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+6835.71%)

Mutual labels: text-to-speech, festival, tts

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (+164.29%)

Mutual labels: text-to-speech, tts, speech-synthesis

talkbot

Text-to-speech and translation bot for Discord

Stars: ✭ 27 (-3.57%)

Mutual labels: text-to-speech, tts

speak.awf

An Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak text aloud.

Stars: ✭ 29 (+3.57%)

Mutual labels: text-to-speech, tts

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (+132.14%)

Mutual labels: tts, speech-synthesis

hawking

The retro text-to-speech bot for Discord

Stars: ✭ 24 (-14.29%)

Mutual labels: text-to-speech, tts

brasiltts

Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…

Stars: ✭ 34 (+21.43%)

Mutual labels: text-to-speech, tts

MouseTooltipTranslator

chrome extension - When mouse hover on text, it shows translated tooltip using google translate

Stars: ✭ 93 (+232.14%)

Mutual labels: text-to-speech, tts

soundpad-text-to-speech

Text-To-Speech for Soundpad

Stars: ✭ 29 (+3.57%)

Mutual labels: text-to-speech, tts

TwitterPiBot

A Python based bot for Raspberry Pi that grabs tweets with a specific hashtag and reads them out loud.

Stars: ✭ 85 (+203.57%)

Mutual labels: text-to-speech, flite

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (+25%)

Mutual labels: text-to-speech, speech-synthesis

FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊

Stars: ✭ 64 (+128.57%)

Mutual labels: text-to-speech, tts

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+78.57%)

Mutual labels: text-to-speech, speech-synthesis

ukrainian-tts

Ukrainian TTS (text-to-speech) using Coqui TTS

Stars: ✭ 74 (+164.29%)

Mutual labels: text-to-speech, tts

JSpeak

A Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features

Stars: ✭ 16 (-42.86%)

Mutual labels: text-to-speech, tts

sam

SAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM)