kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (+50%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+550%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (+31.25%)
rasrThe RWTH ASR Toolkit.
Stars: ✭ 43 (+168.75%)
AESRC2020Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Stars: ✭ 40 (+150%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (+50%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+1181.25%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+1431.25%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (+31.25%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+1087.5%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+2750%)
myG2PMyanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Stars: ✭ 43 (+168.75%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (+31.25%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+1018.75%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+14800%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+2112.5%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (+56.25%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+1456.25%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+1181.25%)
NTUA-slp-nlp💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (+18.75%)
Hms Ml DemoHMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Stars: ✭ 187 (+1068.75%)
KoLMKorean text normalization and language preparation package for LM in Kaldi-based ASR system
Stars: ✭ 46 (+187.5%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+13006.25%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+600%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (+68.75%)
simple diarizerSimplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (+62.5%)
avsr-tf1Audio-Visual Speech Recognition using Sequence to Sequence Models
Stars: ✭ 76 (+375%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (+25%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+225%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (+118.75%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (+68.75%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (+125%)
pie百度云流式语音识别客户端 SDK
Stars: ✭ 62 (+287.5%)
klaamArabic speech recognition, classification and text-to-speech.
Stars: ✭ 151 (+843.75%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+162.5%)
Mrcp Plugin With Freeswitch使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
Stars: ✭ 168 (+950%)
Wukong Robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Stars: ✭ 3,110 (+19337.5%)
spokestack-tray-androidA UI component that makes it easy to add voice interaction to your app.
Stars: ✭ 13 (-18.75%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+1450%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+212.5%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-12.5%)
LingvoLingvo
Stars: ✭ 2,361 (+14656.25%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+1087.5%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+225%)
edit-distance-papersA curated list of papers dedicated to edit-distance as objective function
Stars: ✭ 49 (+206.25%)
Speech-RecognitionEnd-to-End Speech Recognition using Neural Networks.
Stars: ✭ 31 (+93.75%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (+181.25%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+150%)
speech courseYSDA course in Speech Processing.
Stars: ✭ 93 (+481.25%)