Cs224n Gpu That TalksAttention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (+13.04%)
wiki2ssmlWiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.
Stars: ✭ 31 (-32.61%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+2097.83%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-32.61%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (+202.17%)
PororoPORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (+1665.22%)
Cyclegan Vc2Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Stars: ✭ 158 (+243.48%)
WorldA high-quality speech analysis, manipulation and synthesis system
Stars: ✭ 769 (+1571.74%)
GlottDNNGlottDNN vocoder and tools for training DNN excitation models
Stars: ✭ 30 (-34.78%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+1382.61%)
Tacotron 2DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (+4178.26%)
ParrotRNN-based generative models for speech.
Stars: ✭ 601 (+1206.52%)
StyleSpeechOfficial implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+250%)
Melgan NeuripsGAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (+1186.96%)
Tensorflowtts😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+5078.26%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1078.26%)
samSoftware Automatic Mouth - Tiny Speech Synthesizer
Stars: ✭ 316 (+586.96%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+965.22%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (+200%)
GanttsPyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Stars: ✭ 460 (+900%)
audioslides.ioUse Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.
Stars: ✭ 19 (-58.7%)
ZerospeechVQ-VAE for Acoustic Unit Discovery and Voice Conversion
Stars: ✭ 137 (+197.83%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+669.57%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+82.61%)
Multilingual text to speechAn implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (+604.35%)
CotatronOfficial code for Cotatron @ INTERSPEECH 2020
Stars: ✭ 137 (+197.83%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (+578.26%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-28.26%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+545.65%)
Legacy straightA vocoder framework which had been widely used in research community since 1999.
Stars: ✭ 130 (+182.61%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (+506.52%)
voderAn emulation of the Voder Speech Synthesizer.
Stars: ✭ 19 (-58.7%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+7910.87%)
Pytorch Dc TtsText to Speech with PyTorch (English and Mongolian)
Stars: ✭ 122 (+165.22%)
Catch-A-WaveformOfficial pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Stars: ✭ 117 (+154.35%)
mimic2Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (+1067.39%)
esp32-fliteSpeech synthesis running on ESP32 based on Flite engine.
Stars: ✭ 28 (-39.13%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (+141.3%)
tacotron2Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
Stars: ✭ 17 (-63.04%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (+60.87%)
CrystalCrystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (+134.78%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (+58.7%)
TacotronA TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+5510.87%)
Cnn vocoderA fast cnn-based vocoder
Stars: ✭ 74 (+60.87%)
Meta-TTSOfficial repository of https://arxiv.org/abs/2111.04040v1
Stars: ✭ 69 (+50%)
Tacotron PytorchA Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (+126.09%)
Tacotron pytorchPyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+426.09%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+243.48%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+2895.65%)
MelNet-SpeechGenerationImplementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-58.7%)
MerlinThis is now the official location of the Merlin project.
Stars: ✭ 1,168 (+2439.13%)
Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (+97.83%)
Cross-Speaker-Emotion-TransferPyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Stars: ✭ 107 (+132.61%)
TensorVoxDesktop application for neural speech synthesis written in C++
Stars: ✭ 140 (+204.35%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-23.91%)
NanoFlowPyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
Stars: ✭ 63 (+36.96%)