kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (-43.56%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-8.66%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-53.71%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-86.88%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (-69.31%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-81.31%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (-6.44%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (-80.69%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+67.95%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+159.53%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-96.91%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-66.46%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (-87.13%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+461.01%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (-85.89%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (-76.49%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (-65.72%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-77.85%)
Speech TransformerA PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Stars: ✭ 565 (-30.07%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-96.66%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-74.63%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-97.4%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (-56.19%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (-86.14%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-32.92%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-97.4%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+631.06%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-49.5%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-98.27%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-92.45%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-97.03%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-97.4%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+195.05%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-51.36%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-51.49%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-97.52%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-35.4%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-52.6%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-97.4%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-93.56%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-95.05%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (-93.81%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-95.54%)
wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+645.79%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (-97.03%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-96.66%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-21.66%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-97.28%)