ParrotRNN-based generative models for speech.
Stars: ✭ 601 (-63.66%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-98.61%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-89.66%)
Melgan NeuripsGAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (-64.21%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-96.98%)
Espeak NgeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Stars: ✭ 799 (-51.69%)
Flutter ttsFlutter Text to Speech package
Stars: ✭ 263 (-84.1%)
few-shot-transformer-ttsByte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Stars: ✭ 60 (-96.37%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-91.6%)
py-espeak-ngSome simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.
Stars: ✭ 27 (-98.37%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-91.6%)
Real Time Voice CloningClone a voice in 5 seconds to generate arbitrary speech in real-time
Stars: ✭ 32,095 (+1840.45%)
Xva SynthMachine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (-91.78%)
wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+264.33%)
Pink TromboneA programmable version of Neil Thapen's Pink Trombone
Stars: ✭ 54 (-96.74%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (-84.46%)
magphaseMagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Stars: ✭ 76 (-95.41%)
MediumVCAny-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (-97.22%)
Vq Vae SpeechPyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-88.69%)
JoytanCreative Audio/Textbook Maker 🎵 📖 See our YouTube channel
Stars: ✭ 91 (-94.5%)
Sdk JsTanker client-side encryption SDK for JavaScript
Stars: ✭ 786 (-52.48%)
SpeechTransProgressTracking the progress in end-to-end speech translation
Stars: ✭ 139 (-91.6%)
FastSpeech2Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
Stars: ✭ 64 (-96.13%)
Tutorial separationThis repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Stars: ✭ 151 (-90.87%)
pygtrans谷歌翻译, 支持 APIKEY 一口气翻译十万条
Stars: ✭ 60 (-96.37%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-91.17%)
InspectitinspectIT is the leading Open Source APM (Application Performance Management) tool for analyzing your Java (EE) applications.
Stars: ✭ 513 (-68.98%)
DiscordEncryption🔐 Configurable end to end encryption for Discord
Stars: ✭ 30 (-98.19%)
SstdSingle Shot Text Detector with Regional Attention
Stars: ✭ 221 (-86.64%)
FullsubnetPyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Stars: ✭ 51 (-96.92%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (-88.51%)
text-to-speech⚡️ Capacitor plugin for synthesizing speech from text.
Stars: ✭ 50 (-96.98%)
Lanedetection end2endEnd-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)
Stars: ✭ 500 (-69.77%)
Listen Attend SpellA PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Stars: ✭ 147 (-91.11%)
MttsA Demo of Mandarin/Chinese TTS frontend
Stars: ✭ 229 (-86.15%)
shairport-syncAirPlay audio player. Shairport Sync adds multi-room capability with Audio Synchronisation
Stars: ✭ 5,532 (+234.46%)
Speaker adapted ttsMaking a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (-88.94%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-70.37%)
Gst Tacotron A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Stars: ✭ 175 (-89.42%)
MelnetImplementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
Stars: ✭ 161 (-90.27%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-97.16%)
mimic2Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (-67.53%)
TtsText-to-Speech for Arduino
Stars: ✭ 118 (-92.87%)
Tf Kaldi SpeakerNeural speaker recognition/verification system based on Kaldi and Tensorflow
Stars: ✭ 117 (-92.93%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (-8.77%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (-16.69%)
Drachtio Freeswitch ModulesA collection of open-sourced freeswitch modules that I use in various drachtio applications
Stars: ✭ 73 (-95.59%)
ZhrtvcChinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。
Stars: ✭ 771 (-53.39%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-94.68%)