Cnn vocoderA fast cnn-based vocoder
Stars: ✭ 74 (-45.59%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-46.32%)
ParrotRNN-based generative models for speech.
Stars: ✭ 601 (+341.91%)
Meta-TTSOfficial repository of https://arxiv.org/abs/2111.04040v1
Stars: ✭ 69 (-49.26%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+1009.56%)
YourTTSYourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Stars: ✭ 217 (+59.56%)
Melgan NeuripsGAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (+335.29%)
ExpressiveTacotronThis repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
Stars: ✭ 51 (-62.5%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (-51.47%)
LVCNetLVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Stars: ✭ 67 (-50.74%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+298.53%)
MaryttsMARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Stars: ✭ 1,699 (+1149.26%)
Glow TtsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Stars: ✭ 284 (+108.82%)
SingleVCAny-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
Stars: ✭ 25 (-81.62%)
TacotronAudio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+262.5%)
Pink TromboneA programmable version of Neil Thapen's Pink Trombone
Stars: ✭ 54 (-60.29%)
ExtensibleTTS-PyTorchAn extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
Stars: ✭ 25 (-81.62%)
AutovcAutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Stars: ✭ 485 (+256.62%)
Sinsy-NG(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG
Stars: ✭ 15 (-88.97%)
Tacotron2-PyTorchYet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Stars: ✭ 118 (-13.24%)
SprocketVoice Conversion Tool Kit
Stars: ✭ 425 (+212.5%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-78.68%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (+2.21%)
WaveGrad2PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Stars: ✭ 55 (-59.56%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+3233.09%)
Espeak NgeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Stars: ✭ 799 (+487.5%)
audioslides.ioUse Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.
Stars: ✭ 19 (-86.03%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+518.38%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+160.29%)
AdaSpeechAdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (-20.59%)
tacotron2Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
Stars: ✭ 102 (-25%)
Tts🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+3890.44%)
FCH-TTSA fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Stars: ✭ 154 (+13.24%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-24.26%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (-23.53%)
Hifi GanHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (+138.97%)
few-shot-transformer-ttsByte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Stars: ✭ 60 (-55.88%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-77.21%)
MediumVCAny-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (-66.18%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (+129.41%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-75.74%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+1116.18%)
vietTTSVietnamese Text to Speech library
Stars: ✭ 78 (-42.65%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+118.38%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (-81.62%)
PororoPORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (+497.06%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-74.26%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (+105.15%)
vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Stars: ✭ 1,604 (+1079.41%)
WaveflowA PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"
Stars: ✭ 95 (-30.15%)
Catch-A-WaveformOfficial pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Stars: ✭ 117 (-13.97%)
WorldA high-quality speech analysis, manipulation and synthesis system
Stars: ✭ 769 (+465.44%)
voice-conversionan tutorial implement of voice conversion using pytorch
Stars: ✭ 26 (-80.88%)
esp32-fliteSpeech synthesis running on ESP32 based on Flite engine.
Stars: ✭ 28 (-79.41%)
CotatronOfficial code for Cotatron @ INTERSPEECH 2020
Stars: ✭ 137 (+0.74%)