rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+420%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+3940%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+150%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+1255%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+680%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (+35%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (+20%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+3680%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+1285%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+3590%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+10385%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+6685%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+1770%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+1140%)
SetkTools for Speech Enhancement integrated with Kaldi
Stars: ✭ 227 (+1035%)
AESRC2020Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Stars: ✭ 40 (+100%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (+770%)
Ctc pytorchCTC end -to-end ASR for timit and 863 corpus.
Stars: ✭ 161 (+705%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (+5%)
myG2PMyanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Stars: ✭ 43 (+115%)
Pykaldi2Yet another speech toolkit based on Kaldi and PyTorch
Stars: ✭ 158 (+690%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+880%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-30%)
avsr-tf1Audio-Visual Speech Recognition using Sequence to Sequence Models
Stars: ✭ 76 (+280%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+700%)
klaamArabic speech recognition, classification and text-to-speech.
Stars: ✭ 151 (+655%)
EendEnd-to-End Neural Diarization
Stars: ✭ 153 (+665%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (+5%)
kaldi-python-ioA python IO interface for data accessing in kaldi
Stars: ✭ 39 (+95%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+55655%)
Tf Kaldi SpeakerNeural speaker recognition/verification system based on Kaldi and Tensorflow
Stars: ✭ 117 (+485%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (+75%)
KoLMKorean text normalization and language preparation package for LM in Kaldi-based ASR system
Stars: ✭ 46 (+130%)
Kaldi GopComputes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (+420%)
Elpis🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (+405%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+795%)
edit-distance-papersA curated list of papers dedicated to edit-distance as objective function
Stars: ✭ 49 (+145%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (+390%)
PldaAn LDA/PLDA estimator using KALDI in python for speaker verification tasks
Stars: ✭ 85 (+325%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+925%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+5500%)
NhyaiAI智能审查,支持色情识别、暴恐识别、语言识别、敏感文字检测和视频检测等功能,以及各种OCR识别能力,如身份证、驾照、行驶证、营业执照、银行卡、手写体、车牌和名片识别等功能,可以访问网站体验功能。
Stars: ✭ 60 (+200%)
speech courseYSDA course in Speech Processing.
Stars: ✭ 93 (+365%)
pie百度云流式语音识别客户端 SDK
Stars: ✭ 62 (+210%)
Voxceleb IvectorVoxceleb1 i-vector based speaker recognition system
Stars: ✭ 36 (+80%)
Theano Kaldi RnnTHEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Stars: ✭ 31 (+55%)