😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Prosody

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

✭ 139

python machine-learning pytorch natural-language-processing dataset speech-synthesis corpus sequence-labeling

Wavegrad

A fast, high-quality neural vocoder.

✭ 138

python machine-learning pytorch neural-network paper speech text-to-speech pretrained-models speech-synthesis

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

✭ 139

python machine-learning pytorch neural-network paper speech text-to-speech pretrained-models speech-synthesis

Zerospeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

✭ 137

python pytorch speech-synthesis

Xva Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

✭ 136

python machine-learning electron speech-synthesis tacotron

Cotatron

Official code for Cotatron @ INTERSPEECH 2020

✭ 137

python pytorch speech-synthesis

Awesome Ai Services

An overview of the AI-as-a-service landscape

✭ 133

javascript java kotlin nodejs machine-learning computer-vision natural-language-processing artificial-intelligence jvm speech-recognition face-recognition sentiment-analysis text-to-speech speech-to-text speech-synthesis machine-translation text-recognition

Legacy straight

A vocoder framework which had been widely used in research community since 1999.

✭ 130

matlab speech-synthesis

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

✭ 1,699

java XSLT groovy javascript HTML Raku text-to-speech tts speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

✭ 122

python jupyter-notebook deep-learning pytorch convolutional-neural-networks text-to-speech tts speech-synthesis

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

✭ 1,654

python shell machine-learning pytorch tts speech-synthesis speech-processing end-to-end multi-speaker

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

✭ 111

python speech text-to-speech tts speech-synthesis

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

✭ 1,509

python shell Dockerfile HTML linux bot home-automation speech-recognition speech-to-text speech-synthesis raspberry personal-assistant bot-creation jarvis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

✭ 108

text-to-speech tts speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

✭ 1,636

python pytorch text-to-speech tts speech-synthesis tacotron wavernn neural-vocoder

Tacotron Pytorch

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

✭ 104

python deep-learning pytorch seq2seq text-to-speech speech-synthesis end-to-end tacotron

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

✭ 103

python deep-learning machine-learning tensorflow nlp bot natural-language-processing raspberry-pi neural-networks embedded speech-recognition text-to-speech speech-to-text tts speech-synthesis natural-language-understanding nlu smart-home voice-recognition

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

✭ 1,378

python deep-learning tensorflow speech-recognition seq2seq language-model text-to-speech speech-to-text speech-synthesis neural-machine-translation sequence-to-sequence

Waveflow

A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"

✭ 95

jupyter-notebook pytorch speech-synthesis

Cross vc

Cross-lingual Voice Conversion

✭ 91

python speech-recognition speech-synthesis

Cnn vocoder

A fast cnn-based vocoder

✭ 74

python pytorch tts speech-synthesis

Merlin

This is now the official location of the Merlin project.

✭ 1,168

python deep-learning tensorflow keras text-to-speech speech-synthesis theano

Speech ai

Simple speech linguistic AI with Python

✭ 66

python machine-learning speech-recognition speech-synthesis

Tf Wavenet vocoder

Wavenet and its applications with Tensorflow

✭ 58

jupyter-notebook tensorflow speech-synthesis wavenet

Pink Trombone

A programmable version of Neil Thapen's Pink Trombone

✭ 54

javascript api voice speech-synthesis web-audio web-component

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

✭ 52

python3 jupyter-notebook tensorflow text-to-speech tts speech-synthesis

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

✭ 43

python pytorch text-to-speech speech-synthesis tacotron

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

✭ 1,011

javascript speech-recognition speech-to-text speech-synthesis recognition voice-commands

Wsay

Windows "say"

✭ 36

windows command-line-tool speech text-to-speech tts speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

✭ 31

python pytorch speech text-to-speech tts speech-synthesis

Jsut Lab

HTS-style full-context labels for JSUT v1.1

✭ 28

dataset text-to-speech tts speech-synthesis

Pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

✭ 812

python deep-learning natural-language-processing speech-synthesis

Espeak Ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

✭ 799

c android text-to-speech speech-synthesis

World

A high-quality speech analysis, manipulation and synthesis system

✭ 769

speech-synthesis

Rhvoice

a free and open source speech synthesizer for Russian and other languages

✭ 750

android linux windows text-to-speech speech-synthesis english russian

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

✭ 682

jupyter-notebook pytorch realtime text-to-speech tts speech-synthesis wavenet

Sam

Software Automatic Mouth - Tiny Speech Synthesizer

✭ 667

c speech-synthesis

Parrot

RNN-based generative models for speech.

✭ 601

python deep-learning recurrent-neural-networks speech-synthesis theano

Fastspeech

The Implementation of FastSpeech based on pytorch.

✭ 600

python deep-learning pytorch speech-synthesis

Melgan Neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

✭ 592

python deep-learning pytorch gans speech-synthesis

Flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

✭ 546

jupyter-notebook speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

✭ 542

python tensorflow deployment speech-recognition transformer unsupervised-learning tts speech-synthesis asr sequence-to-sequence ctc

Termit

Translations with speech synthesis in your terminal as a ruby gem

✭ 505

ruby terminal translation ruby-gem speech-synthesis translations

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

✭ 490

java api google speech-recognition speech speech-to-text speech-synthesis recognition

Autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

✭ 485

python unsupervised-learning speech-synthesis

Gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

✭ 460

jupyter-notebook gan speech-synthesis

1-60 of 141 speech-synthesis projects

›