WavegradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
HanttsChinese Text-to-Speech web service
Go AstibobGolang framework to build an AI that can understand and speak back to you, and everything else you want
Tts CubeEnd-2-end speech synthesis with recurrent neural networks
RonorSonos smart speaker controller API and command-line tools
WaveglowA PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Vonage Ruby SdkVonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Hms Ml DemoHMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Doc2audiobookConvert text documents to high fidelity audio(books).
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
VocganVocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Tacotron 2DeepMind's Tacotron-2 Tensorflow implementation
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Tensorflowtts😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Amazon Polly SampleSample application for Amazon Polly. Allows to convert any blog into an audio podcast.
WavegradA fast, high-quality neural vocoder.
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Vonage Python SdkVonage Server SDK for Python. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
AndroidmaryttsAndroid MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS
TalkifyJavascript Text to speech library
Alan Sdk PcfAlan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
MaryttsMARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
TtsText-to-Speech for Arduino
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
CrystalCrystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Cross Lingual Voice CloningTacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Tacotron PytorchA Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
JoytanCreative Audio/Textbook Maker 🎵 📖 See our YouTube channel
GttsPython library and CLI tool to interface with Google Translate's text-to-speech API
SpeakerA PHP library to convert text to speech using various web services
Bvae TtsOfficial implementation of BVAE-TTS
MerlinThis is now the official location of the Merlin project.
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
VoicenetSpeech synthesis platform based on tensorflow and sonnet
Tacotron2pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Tacotron2A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Friend.lyA social media platform with a friend recommendation engine based on personality trait extraction
AsrgenAttacking Speaker Recognition with Deep Generative Models
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Jsut LabHTS-style full-context labels for JSUT v1.1
Vonage Php Sdk CoreVonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.