Top 106 asr open source projects

kaldi-readers-for-tensorflow
readers that enable reading kaldi ark in tensorflow
Speech-Recognition
End-to-End Speech Recognition using Neural Networks.
torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
kosr
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
simple diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
kaldi-alligner
scripts to align a given wave to its transcription using trained models by Kaldi
lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
torchain
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
syn-speech-samples
An application that demostrate the usage of Syn.Speech library for Speech Recognition
klaam
Arabic speech recognition, classification and text-to-speech.
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
spokestack-tray-android
A UI component that makes it easy to add voice interaction to your app.
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
KoLM
Korean text normalization and language preparation package for LM in Kaldi-based ASR system
edit-distance-papers
A curated list of papers dedicated to edit-distance as objective function
speech course
YSDA course in Speech Processing.
ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
AESRC2020
Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
avsr-tf1
Audio-Visual Speech Recognition using Sequence to Sequence Models
myG2P
Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Speech-Corpus-Collection
A Collection of Speech Corpus for ASR and TTS
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
pie
百度云流式语音识别客户端 SDK
megs
A merged version of multiple open-source German speech datasets.
rasr
The RWTH ASR Toolkit.
61-106 of 106 asr projects