Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (-71.2%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+219.94%)
ZerospeechVQ-VAE for Acoustic Unit Discovery and Voice Conversion
Stars: ✭ 137 (-56.65%)
Tacotron PytorchA Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (-67.09%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+115.82%)
Tensorflowtts😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+653.8%)
UniversalvocodingA PyTorch implementation of "Robust Universal Neural Vocoding"
Stars: ✭ 197 (-37.66%)
PororoPORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (+156.96%)
Legacy straightA vocoder framework which had been widely used in research community since 1999.
Stars: ✭ 130 (-58.86%)
CrystalCrystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (-65.82%)
Melgan NeuripsGAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (+87.34%)
Tacotron 2DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (+522.78%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+336.08%)
TacotronA TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+716.77%)
MerlinThis is now the official location of the Merlin project.
Stars: ✭ 1,168 (+269.62%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (-56.33%)
Cs224n Gpu That TalksAttention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-83.54%)
tacotron2Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
Stars: ✭ 17 (-94.62%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-90.19%)
CotatronOfficial code for Cotatron @ INTERSPEECH 2020
Stars: ✭ 137 (-56.65%)
WorldA high-quality speech analysis, manipulation and synthesis system
Stars: ✭ 769 (+143.35%)
ParrotRNN-based generative models for speech.
Stars: ✭ 601 (+90.19%)
Pytorch Dc TtsText to Speech with PyTorch (English and Mongolian)
Stars: ✭ 122 (-61.39%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+377.53%)
FlowtronFlowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Stars: ✭ 546 (+72.78%)
VocganVocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (-50%)
WavernnWaveRNN Vocoder + TTS
Stars: ✭ 1,636 (+417.72%)
NormitTranslations with speech synthesis in your terminal as a node package
Stars: ✭ 219 (-30.7%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-67.41%)
WaveflowA PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"
Stars: ✭ 95 (-69.94%)
ttsflowtensorflow speech synthesis c++ inference for voicenet
Stars: ✭ 17 (-94.62%)
Cnn vocoderA fast cnn-based vocoder
Stars: ✭ 74 (-76.58%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-56.01%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (-79.11%)
Pink TromboneA programmable version of Neil Thapen's Pink Trombone
Stars: ✭ 54 (-82.91%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-56.01%)
Tacotron2A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: ✭ 43 (-86.39%)
voderAn emulation of the Voder Speech Synthesizer.
Stars: ✭ 19 (-93.99%)
WsayWindows "say"
Stars: ✭ 36 (-88.61%)
Xva SynthMachine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (-56.96%)
Jsut LabHTS-style full-context labels for JSUT v1.1
Stars: ✭ 28 (-91.14%)
LingvoLingvo
Stars: ✭ 2,361 (+647.15%)
Espeak NgeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Stars: ✭ 799 (+152.85%)
Rhvoicea free and open source speech synthesizer for Russian and other languages
Stars: ✭ 750 (+137.34%)
WavegradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (-22.47%)
SamSoftware Automatic Mouth - Tiny Speech Synthesizer
Stars: ✭ 667 (+111.08%)
MaryttsMARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Stars: ✭ 1,699 (+437.66%)
FastspeechThe Implementation of FastSpeech based on pytorch.
Stars: ✭ 600 (+89.87%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-45.89%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+423.42%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-73.42%)
resid-rsPort of reSID, a MOS6581 SID emulator engine, to Rust
Stars: ✭ 25 (-92.09%)
Tacotron pytorchPyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (-23.42%)
Cyclegan Vc2Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Stars: ✭ 158 (-50%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-64.87%)