wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+2.01%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (-23.26%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (-86.32%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-98.21%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-99.64%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (-96.78%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-98.97%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (-92.28%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (-98.07%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-99.58%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-97.92%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-93.36%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-91.86%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-93.67%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (-93.77%)
Stn OcrCode for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition
Stars: ✭ 473 (-91.99%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-94.19%)
CodeceptionFull-stack testing PHP framework
Stars: ✭ 4,401 (-25.5%)
J.a.r.v.i.spython powered Intelligent System
Stars: ✭ 325 (-94.5%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-93.35%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-91.7%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-93.52%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-90.82%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (-93.58%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-89.28%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-94.01%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (-92.25%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+216.23%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-90.99%)
Alan Sdk IosAlan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (-94.62%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+1.56%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-89.47%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (-91.04%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (-92.42%)
Alan Sdk FlutterAlan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (-94.77%)
Speech recognitionA Flutter plugin to use speech recognition on iOS & Android (Swift/Java)
Stars: ✭ 302 (-94.89%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-92.55%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-94.96%)
Alan Sdk IonicAlan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (-95.14%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-91.16%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (-16.32%)
Wire Ios📱 Wire for iOS (iPhone and iPad)
Stars: ✭ 3,079 (-47.88%)
Alan Sdk AndroidAlan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (-95.29%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-93.09%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (-95.31%)
Neuraldialog CvaeTensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 279 (-95.28%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (-89.55%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-93.13%)