All Categories → Machine Learning → speech-synthesis

Top 141 speech-synthesis open source projects

Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Normit
Translations with speech synthesis in your terminal as a node package
Tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Neural Voice Cloning With Few Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Universalvocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
Expressive tacotron
Tensorflow Implementation of Expressive Tacotron
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Cyclegan Vc2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Vocgan
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Tacotron 2
DeepMind's Tacotron-2 Tensorflow implementation
Tensorflowtts
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Zerospeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Xva Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Cotatron
Official code for Cotatron @ INTERSPEECH 2020
Legacy straight
A vocoder framework which had been widely used in research community since 1999.
Marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Tacotron Pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Waveflow
A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"
Cross vc
Cross-lingual Voice Conversion
Cnn vocoder
A fast cnn-based vocoder
Merlin
This is now the official location of the Merlin project.
Speech ai
Simple speech linguistic AI with Python
Tf Wavenet vocoder
Wavenet and its applications with Tensorflow
Pink Trombone
A programmable version of Neil Thapen's Pink Trombone
Cs224n Gpu That Talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Jsut Lab
HTS-style full-context labels for JSUT v1.1
Pororo
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Espeak Ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
World
A high-quality speech analysis, manipulation and synthesis system
Rhvoice
a free and open source speech synthesizer for Russian and other languages
Parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Sam
Software Automatic Mouth - Tiny Speech Synthesizer
Fastspeech
The Implementation of FastSpeech based on pytorch.
Melgan Neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Termit
Translations with speech synthesis in your terminal as a ruby gem
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Gantts
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
1-60 of 141 speech-synthesis projects