Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+353.85%)

Mutual labels: speech, tts, speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+949.23%)

Mutual labels: tts, speech-synthesis

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-56.92%)

Mutual labels: tts, speech-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-18.46%)

Mutual labels: tts, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+2416.92%)

Mutual labels: tts, speech-synthesis

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-20%)

Mutual labels: tts, speech-synthesis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (+66.15%)

Mutual labels: tts, speech-synthesis

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-67.69%)

Mutual labels: speech, tts

Tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Stars: ✭ 2,581 (+3870.77%)

Mutual labels: tts, speech-synthesis

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+2513.85%)

Mutual labels: tts, speech-synthesis

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+36.92%)

Mutual labels: speech, tts

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (+336.92%)

Mutual labels: tts, speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (+87.69%)

Mutual labels: tts, speech-synthesis

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-16.92%)

Mutual labels: speech, speech-synthesis

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (+3.08%)

Mutual labels: speech, speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+733.85%)

Mutual labels: tts, speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+398.46%)

Mutual labels: tts, speech-synthesis

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+29.23%)

Mutual labels: speech, speech-synthesis

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+400%)

Mutual labels: tts, speech-synthesis

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (+58.46%)

Mutual labels: tts, speech-synthesis

Cnn vocoder

A fast cnn-based vocoder

Stars: ✭ 74 (+13.85%)

Mutual labels: tts, speech-synthesis

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+2444.62%)

Mutual labels: tts, speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+380%)

Mutual labels: tts, speech-synthesis

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+3107.69%)

Mutual labels: tts, speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+3564.62%)

Mutual labels: tts, speech-synthesis

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+272.31%)

Mutual labels: speech, speech-synthesis

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+369.23%)

Mutual labels: speech, tts

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+572.31%)

Mutual labels: speech, tts

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+658.46%)

Mutual labels: speech, tts

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+8249.23%)

Mutual labels: speech, tts

Android Speech

Android speech recognition and text to speech made easy

Stars: ✭ 310 (+376.92%)

Mutual labels: speech, tts

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+356.92%)

Mutual labels: speech, speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+653.85%)

Mutual labels: speech, speech-synthesis

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (+1464.62%)

Mutual labels: speech, tts

Tts

Tools to convert text to speech 📚💬

Stars: ✭ 84 (+29.23%)

Mutual labels: speech, tts

Tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Stars: ✭ 1,756 (+2601.54%)

Mutual labels: speech, tts

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+2863.08%)

Mutual labels: speech, speech-synthesis

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (+224.62%)

Mutual labels: speech, speech-synthesis

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (+112.31%)

Mutual labels: speech, speech-synthesis

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (+113.85%)

Mutual labels: speech, speech-synthesis

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (+81.54%)

Mutual labels: speech, tts

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-66.15%)

Mutual labels: tts, speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+329.23%)

Mutual labels: tts, speech-synthesis

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-70.77%)

Mutual labels: speech, speech-synthesis

Gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API