A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-68.38%)

Mutual labels: speech-synthesis, tacotron

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+1102.94%)

Mutual labels: speech-synthesis, tacotron

mimic2

Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.

Stars: ✭ 537 (+294.85%)

Mutual labels: speech-synthesis, tacotron

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-83.82%)

Mutual labels: speech-synthesis, tacotron

Merlin

This is now the official location of the Merlin project.

Stars: ✭ 1,168 (+758.82%)

Mutual labels: speech-synthesis

Sam

Software Automatic Mouth - Tiny Speech Synthesizer

Stars: ✭ 667 (+390.44%)

Mutual labels: speech-synthesis

Fastspeech

The Implementation of FastSpeech based on pytorch.

Stars: ✭ 600 (+341.18%)

Mutual labels: speech-synthesis

Flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Stars: ✭ 546 (+301.47%)

Mutual labels: speech-synthesis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (-20.59%)

Mutual labels: speech-synthesis

Tf Wavenet vocoder

Wavenet and its applications with Tensorflow

Stars: ✭ 58 (-57.35%)

Mutual labels: speech-synthesis

Termit

Translations with speech synthesis in your terminal as a ruby gem

Stars: ✭ 505 (+271.32%)

Mutual labels: speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+260.29%)

Mutual labels: speech-synthesis

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-61.76%)

Mutual labels: speech-synthesis

Gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Stars: ✭ 460 (+238.24%)

Mutual labels: speech-synthesis

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+124.26%)

Mutual labels: tacotron

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (-10.29%)

Mutual labels: speech-synthesis

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+3233.09%)

Mutual labels: speech-synthesis

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+643.38%)

Mutual labels: speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+166.18%)

Mutual labels: speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+401.47%)

Mutual labels: speech-synthesis

Cnn vocoder

A fast cnn-based vocoder

Stars: ✭ 74 (-45.59%)

Mutual labels: speech-synthesis

Parrot

RNN-based generative models for speech.

Stars: ✭ 601 (+341.91%)

Mutual labels: speech-synthesis

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+1009.56%)

Mutual labels: speech-synthesis

Melgan Neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Stars: ✭ 592 (+335.29%)

Mutual labels: speech-synthesis

Speech ai

Simple speech linguistic AI with Python

Stars: ✭ 66 (-51.47%)

Mutual labels: speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+298.53%)

Mutual labels: speech-synthesis

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+1149.26%)

Mutual labels: speech-synthesis

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+262.5%)

Mutual labels: tacotron

Pink Trombone

A programmable version of Neil Thapen's Pink Trombone

Stars: ✭ 54 (-60.29%)

Mutual labels: speech-synthesis

Autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Stars: ✭ 485 (+256.62%)

Mutual labels: speech-synthesis

Espeak

eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.

Stars: ✭ 339 (+149.26%)

Mutual labels: speech-synthesis

Sprocket

Voice Conversion Tool Kit

Stars: ✭ 425 (+212.5%)

Mutual labels: speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-73.53%)

Mutual labels: speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+138.24%)

Mutual labels: speech-synthesis

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-2.21%)

Mutual labels: speech-synthesis

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (+160.29%)

Mutual labels: speech-synthesis

Tacotron Wavernn

TTS (Tacotron + WaveRNN)

Stars: ✭ 40 (-70.59%)

Mutual labels: tacotron

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+3890.44%)

Mutual labels: tacotron

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-24.26%)

Mutual labels: speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-77.21%)

Mutual labels: speech-synthesis

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+138.97%)

Mutual labels: speech-synthesis

Gst Tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"

Stars: ✭ 313 (+130.15%)

Mutual labels: tacotron

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+129.41%)

Mutual labels: speech-synthesis

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+1116.18%)

Mutual labels: speech-synthesis

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+913.24%)

Mutual labels: speech-synthesis

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-79.41%)

Mutual labels: speech-synthesis

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (+126.47%)

Mutual labels: speech-synthesis

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+118.38%)

Mutual labels: speech-synthesis

Pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Stars: ✭ 812 (+497.06%)

Mutual labels: speech-synthesis

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (+108.82%)

Mutual labels: speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+105.15%)

Mutual labels: speech-synthesis

Waveflow

A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"

Stars: ✭ 95 (-30.15%)

Mutual labels: speech-synthesis

Espeak Ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Stars: ✭ 799 (+487.5%)

Mutual labels: speech-synthesis

1-60 of 157 similar projects

›