This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

Stars: ✭ 51 (-62.5%)

Mutual labels: tacotron

Speech ai

Simple speech linguistic AI with Python

Stars: ✭ 66 (-51.47%)

Mutual labels: speech-synthesis

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (-50.74%)

Mutual labels: speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+298.53%)

Mutual labels: speech-synthesis

Text-to-Speech-Landscape

No description or website provided.

Stars: ✭ 31 (-77.21%)

Mutual labels: tacotron

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+1149.26%)

Mutual labels: speech-synthesis

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (+108.82%)

Mutual labels: speech-synthesis

SingleVC

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

Stars: ✭ 25 (-81.62%)

Mutual labels: speech-synthesis

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+262.5%)

Mutual labels: tacotron

spoken-word

Spoken Word

Stars: ✭ 46 (-66.18%)

Mutual labels: speech-synthesis

Pink Trombone

A programmable version of Neil Thapen's Pink Trombone

Stars: ✭ 54 (-60.29%)

Mutual labels: speech-synthesis

ExtensibleTTS-PyTorch

An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery

Stars: ✭ 25 (-81.62%)

Mutual labels: speech-synthesis

Autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Stars: ✭ 485 (+256.62%)

Mutual labels: speech-synthesis

Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG

Stars: ✭ 15 (-88.97%)

Mutual labels: speech-synthesis

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Stars: ✭ 118 (-13.24%)

Mutual labels: tacotron

Sprocket

Voice Conversion Tool Kit

Stars: ✭ 425 (+212.5%)

Mutual labels: speech-synthesis

TinyCog

Small Robot, Toy Robot platform

Stars: ✭ 29 (-78.68%)

Mutual labels: speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (+2.21%)

Mutual labels: speech-synthesis

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-59.56%)

Mutual labels: speech-synthesis

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+3233.09%)

Mutual labels: speech-synthesis

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-2.21%)

Mutual labels: speech-synthesis

Espeak Ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Stars: ✭ 799 (+487.5%)

Mutual labels: speech-synthesis

Pytorchwavenetvocoder

WaveNet-Vocoder implementation with pytorch.

Stars: ✭ 269 (+97.79%)

Mutual labels: speech-synthesis

audioslides.io

Use Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.

Stars: ✭ 19 (-86.03%)

Mutual labels: speech-synthesis

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+518.38%)

Mutual labels: speech-synthesis

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (+160.29%)

Mutual labels: speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-20.59%)

Mutual labels: speech-synthesis

Tacotron Wavernn

TTS (Tacotron + WaveRNN)

Stars: ✭ 40 (-70.59%)

Mutual labels: tacotron

tacotron2

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Stars: ✭ 102 (-25%)

Mutual labels: tacotron

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+3890.44%)

Mutual labels: tacotron

FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Stars: ✭ 154 (+13.24%)

Mutual labels: tacotron

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-24.26%)

Mutual labels: speech-synthesis

ml-with-audio

HF's ML for Audio study group

Stars: ✭ 104 (-23.53%)

Mutual labels: speech-synthesis

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+138.97%)

Mutual labels: speech-synthesis

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Stars: ✭ 60 (-55.88%)

Mutual labels: speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-77.21%)

Mutual labels: speech-synthesis

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (-66.18%)

Mutual labels: speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+129.41%)

Mutual labels: speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-75.74%)

Mutual labels: speech-synthesis

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+1116.18%)

Mutual labels: speech-synthesis

vietTTS

Vietnamese Text to Speech library

Stars: ✭ 78 (-42.65%)

Mutual labels: tacotron

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+118.38%)

Mutual labels: speech-synthesis

Khronos

The open source intelligent personal assistant

Stars: ✭ 25 (-81.62%)

Mutual labels: speech-synthesis

Pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Stars: ✭ 812 (+497.06%)

Mutual labels: speech-synthesis

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-74.26%)

Mutual labels: speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+105.15%)

Mutual labels: speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+1079.41%)

Mutual labels: speech-synthesis

Waveflow

A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"

Stars: ✭ 95 (-30.15%)

Mutual labels: speech-synthesis

Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Stars: ✭ 117 (-13.97%)

Mutual labels: speech-synthesis

World

A high-quality speech analysis, manipulation and synthesis system

Stars: ✭ 769 (+465.44%)

Mutual labels: speech-synthesis

voice-conversion

an tutorial implement of voice conversion using pytorch

Stars: ✭ 26 (-80.88%)

Mutual labels: speech-synthesis

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-79.41%)

Mutual labels: speech-synthesis

Cotatron

Official code for Cotatron @ INTERSPEECH 2020

Stars: ✭ 137 (+0.74%)

Mutual labels: speech-synthesis

61-120 of 157 similar projects

‹

›