Parallel-Tacotron2PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Stars: ✭ 149 (-89.19%)
Alan Sdk CordovaAlan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (-80.48%)
Glow TtsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Stars: ✭ 284 (-79.39%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (-11.54%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-97.46%)
Angle⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-95.57%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-98.33%)
Alan Sdk IosAlan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (-76.92%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+1255.59%)
Cs224n Gpu That TalksAttention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-96.23%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-91.07%)
Comprehensive-Tacotron2PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-98.4%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-79.75%)
Alan Sdk IonicAlan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (-79.17%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-94.63%)
Alan Sdk FlutterAlan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (-77.58%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-77.36%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-94.34%)
NnmnkwiiLibrary to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (-77.65%)
Multilingual text to speechAn implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (-76.49%)
Neural-HMMNeural HMMs are all you need (for high-quality attention-free TTS)
Stars: ✭ 69 (-94.99%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-75.11%)
Hifi GanHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-76.42%)
EspeakeSpeak NG is an open source speech synthesizer that supports 101 languages and accents.
Stars: ✭ 339 (-75.4%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (-1.52%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-72.86%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (-73.29%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+228.96%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-72.21%)
NeuralmonkeyAn open-source tool for sequence learning in NLP built on TensorFlow.
Stars: ✭ 400 (-70.97%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (-71.12%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-70.97%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (-73.73%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-71.48%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-70.54%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+258.71%)
Nlp Librarycurated collection of papers for the nlp practitioner 📖👩🔬
Stars: ✭ 1,025 (-25.62%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-68.07%)
Nmt KerasNeural Machine Translation with Keras
Stars: ✭ 501 (-63.64%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (-61.61%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-62.12%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-61.39%)
Nmt ListA list of Neural MT implementations
Stars: ✭ 359 (-73.95%)
JoeynmtMinimalist NMT for educational purposes
Stars: ✭ 420 (-69.52%)
Seq2seq.pytorchSequence-to-Sequence learning using PyTorch
Stars: ✭ 514 (-62.7%)
Tacotron2A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: ✭ 43 (-96.88%)