All Categories → Text Processing → text-to-speech

Top 175 text-to-speech open source projects

Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Hantts
Chinese Text-to-Speech web service
Go Astibob
Golang framework to build an AI that can understand and speak back to you, and everything else you want
Tts Cube
End-2-end speech synthesis with recurrent neural networks
Ronor
Sonos smart speaker controller API and command-line tools
Waveglow
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Vonage Ruby Sdk
Vonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Tacotron Pytorch
Pytorch implementation of Tacotron
Hms Ml Demo
HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Google Tts
Google TTS (Text-To-Speech) for node.js
Doc2audiobook
Convert text documents to high fidelity audio(books).
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Vocgan
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Tacotron 2
DeepMind's Tacotron-2 Tensorflow implementation
Nonparaseq2seqvc code
Implementation code of non-parallel sequence-to-sequence VC
Aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Tensorflowtts
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Amazon Polly Sample
Sample application for Amazon Polly. Allows to convert any blog into an audio podcast.
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Vonage Python Sdk
Vonage Server SDK for Python. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Androidmarytts
Android MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS
Talkify
Javascript Text to speech library
Alan Sdk Pcf
Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Articulate.js
A jQuery plugin that lets the browser speak to you.
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Cross Lingual Voice Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Tacotron Pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Zerospeech Tts Without T
A Pytorch implementation for the ZeroSpeech 2019 challenge.
Joytan
Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel
Gtts
Python library and CLI tool to interface with Google Translate's text-to-speech API
Serverless Medium Text To Speech
🔊 Serverless-based, text-to-speech service for Medium articles
Speaker
A PHP library to convert text to speech using various web services
Bvae Tts
Official implementation of BVAE-TTS
Merlin
This is now the official location of the Merlin project.
Watbot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Voicenet
Speech synthesis platform based on tensorflow and sonnet
Cs224n Gpu That Talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Textnormalizationcoveringgrammars
Covering grammars for English and Russian text normalization
Tacotron2
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Friend.ly
A social media platform with a friend recommendation engine based on personality trait extraction
Asrgen
Attacking Speaker Recognition with Deep Generative Models
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Jsut Lab
HTS-style full-context labels for JSUT v1.1
Vonage Php Sdk Core
Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
1-60 of 175 text-to-speech projects