A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Stars: ✭ 154 (-71.32%)

Mutual labels: tacotron

GlottDNN

GlottDNN vocoder and tools for training DNN excitation models

Stars: ✭ 30 (-94.41%)

Mutual labels: speech-synthesis

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (-80.07%)

Mutual labels: speech-synthesis

Lingvo

Stars: ✭ 2,361 (+339.66%)

Mutual labels: speech-synthesis

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-39.48%)

Mutual labels: speech-synthesis

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+88.27%)

Mutual labels: speech-synthesis

spoken-word

Spoken Word

Stars: ✭ 46 (-91.43%)

Mutual labels: speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-94.23%)

Mutual labels: speech-synthesis

sam

Software Automatic Mouth - Tiny Speech Synthesizer

Stars: ✭ 316 (-41.15%)

Mutual labels: speech-synthesis

Pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Stars: ✭ 812 (+51.21%)

Mutual labels: speech-synthesis

ml-with-audio

HF's ML for Audio study group

Stars: ✭ 104 (-80.63%)

Mutual labels: speech-synthesis

World

A high-quality speech analysis, manipulation and synthesis system

Stars: ✭ 769 (+43.2%)

Mutual labels: speech-synthesis

voder

An emulation of the Voder Speech Synthesizer.

Stars: ✭ 19 (-96.46%)

Mutual labels: speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+27%)

Mutual labels: speech-synthesis

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-89.76%)

Mutual labels: speech-synthesis

Parrot

RNN-based generative models for speech.

Stars: ✭ 601 (+11.92%)

Mutual labels: speech-synthesis

Melgan Neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Stars: ✭ 592 (+10.24%)

Mutual labels: speech-synthesis

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Stars: ✭ 60 (-88.83%)

Mutual labels: speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+0.93%)

Mutual labels: speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-8.75%)

Mutual labels: speech-synthesis

Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG

Stars: ✭ 15 (-97.21%)

Mutual labels: speech-synthesis

Gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Stars: ✭ 460 (-14.34%)

Mutual labels: speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-41.9%)

Mutual labels: speech-synthesis

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+744.13%)

Mutual labels: speech-synthesis

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (-91.43%)

Mutual labels: speech-synthesis

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-34.08%)

Mutual labels: speech-synthesis

Universalvocoding

A PyTorch implementation of "Robust Universal Neural Vocoding"

Stars: ✭ 197 (-63.31%)

Mutual labels: speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (-39.66%)

Mutual labels: speech-synthesis

melgan

MelGAN implementation with Multi-Band and Full Band supports...