MelnetImplementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
Stars: ✭ 161 (+631.82%)
Tts Papers🐸 collection of TTS papers
Stars: ✭ 160 (+627.27%)
TacotronA TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+7881.82%)
DlaDeep learning for audio processing
Stars: ✭ 142 (+545.45%)
Midi2voiceSinging synthesis from MIDI file
Stars: ✭ 102 (+363.64%)
My AppdaemonMy apps, my helpfiles, all about AppDaemon for Home Assistant
Stars: ✭ 94 (+327.27%)
PitchtronTTS for pitch-accented language. Korean dialect DB.
Stars: ✭ 91 (+313.64%)
TtsTools to convert text to speech 📚💬
Stars: ✭ 84 (+281.82%)
Hassio AddonsThe repository for my Home Assistant Supervisor Add-ons.
Stars: ✭ 71 (+222.73%)
Drachtio Freeswitch ModulesA collection of open-sourced freeswitch modules that I use in various drachtio applications
Stars: ✭ 73 (+231.82%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (+118.18%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (+109.09%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+4522.73%)
MtransMulti-source Translation
Stars: ✭ 711 (+3131.82%)
EkhoChinese text-to-speech engine
Stars: ✭ 690 (+3036.36%)
Real Time Voice CloningClone a voice in 5 seconds to generate arbitrary speech in real-time
Stars: ✭ 32,095 (+145786.36%)
MelganMelGAN vocoder (compatible with NVIDIA/tacotron2)
Stars: ✭ 444 (+1918.18%)
Facemoji😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS
Stars: ✭ 320 (+1354.55%)
Android SpeechAndroid speech recognition and text to speech made easy
Stars: ✭ 310 (+1309.09%)
Flutter ttsFlutter Text to Speech package
Stars: ✭ 263 (+1095.45%)
NormitTranslations with speech synthesis in your terminal as a node package
Stars: ✭ 219 (+895.45%)
UniversalvocodingA PyTorch implementation of "Robust Universal Neural Vocoding"
Stars: ✭ 197 (+795.45%)
Cyclegan Vc2Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Stars: ✭ 158 (+618.18%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (+531.82%)
ZerospeechVQ-VAE for Acoustic Unit Discovery and Voice Conversion
Stars: ✭ 137 (+522.73%)
CotatronOfficial code for Cotatron @ INTERSPEECH 2020
Stars: ✭ 137 (+522.73%)
Legacy straightA vocoder framework which had been widely used in research community since 1999.
Stars: ✭ 130 (+490.91%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+6759.09%)
WaveflowA PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"
Stars: ✭ 95 (+331.82%)
Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (+313.64%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (+200%)
Pink TromboneA programmable version of Neil Thapen's Pink Trombone
Stars: ✭ 54 (+145.45%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+4495.45%)
PororoPORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (+3590.91%)
WorldA high-quality speech analysis, manipulation and synthesis system
Stars: ✭ 769 (+3395.45%)
SamSoftware Automatic Mouth - Tiny Speech Synthesizer
Stars: ✭ 667 (+2931.82%)
ParrotRNN-based generative models for speech.
Stars: ✭ 601 (+2631.82%)
FastspeechThe Implementation of FastSpeech based on pytorch.
Stars: ✭ 600 (+2627.27%)
Melgan NeuripsGAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (+2590.91%)
FlowtronFlowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Stars: ✭ 546 (+2381.82%)
TermitTranslations with speech synthesis in your terminal as a ruby gem
Stars: ✭ 505 (+2195.45%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+2127.27%)
AutovcAutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Stars: ✭ 485 (+2104.55%)
GanttsPyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Stars: ✭ 460 (+1990.91%)
SprocketVoice Conversion Tool Kit
Stars: ✭ 425 (+1831.82%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+20504.55%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+1509.09%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+1250%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+650%)
Gst TacotronA tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
Stars: ✭ 313 (+1322.73%)