Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-5.77%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+1204.81%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+0%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+259.62%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+88.46%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+50%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+53.85%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-86.54%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+160.58%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+277.88%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+626.92%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+976.92%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+4258.65%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-41.35%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (+67.31%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+166.35%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+10622.12%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+45.19%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+138.46%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1916.35%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+609.62%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+676.92%)
SpeechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 89 (-14.42%)
Patterspeech-to-text in pytorch
Stars: ✭ 71 (-31.73%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-33.65%)
Ivector XvectorExtract xvector and ivector under kaldi
Stars: ✭ 67 (-35.58%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-4.81%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (-36.54%)
PldaAn LDA/PLDA estimator using KALDI in python for speaker verification tasks
Stars: ✭ 85 (-18.27%)
PapersA list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented
Stars: ✭ 63 (-39.42%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+1225%)
Angle⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-41.35%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+1109.62%)
NhyaiAI智能审查,支持色情识别、暴恐识别、语言识别、敏感文字检测和视频检测等功能,以及各种OCR识别能力,如身份证、驾照、行驶证、营业执照、银行卡、手写体、车牌和名片识别等功能,可以访问网站体验功能。
Stars: ✭ 60 (-42.31%)
Masr中文语音识别; Mandarin Automatic Speech Recognition;
Stars: ✭ 1,246 (+1098.08%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-45.19%)
B.e.n.j.i.B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (-20.19%)
BiglittlenetOfficial repository for Big-Little Net
Stars: ✭ 57 (-45.19%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-49.04%)
Laibot Client开源人工智能,基于开源软硬件构建语音对话机器人、智能音箱……人机对话、自然交互,来宝拥有无限可能。特别说明,来宝运行于Python 3!
Stars: ✭ 81 (-22.12%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-53.85%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-0.96%)
Ai Study人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题
Stars: ✭ 93 (-10.58%)
Deepspeech Websocket ServerServer & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (-24.04%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-54.81%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-55.77%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+1072.12%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-56.73%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-58.65%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-11.54%)
Sytodya Flutter "speech to todo" app example
Stars: ✭ 79 (-24.04%)