LingvoLingvo
Stars: ✭ 2,361 (+787.59%)
simple diarizerSimplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (-90.23%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (-28.57%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+103.76%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-32.71%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+53.38%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+43.98%)
Listen Attend SpellA PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Stars: ✭ 147 (-44.74%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-92.11%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-91.73%)
leopard-chat-ui-teneoLeopard Chat UI - A Teneo Chat Client based on Vue and Vuetify
Stars: ✭ 65 (-75.56%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-90.6%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-22.93%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (-57.14%)
KoLMKorean text normalization and language preparation package for LM in Kaldi-based ASR system
Stars: ✭ 46 (-82.71%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-60.15%)
speech courseYSDA course in Speech Processing.
Stars: ✭ 93 (-65.04%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (-84.21%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-65.41%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+71.43%)
VoicerAGI-server voice recognizer for #Asterisk
Stars: ✭ 73 (-72.56%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-80.45%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-74.06%)
AESRC2020Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Stars: ✭ 40 (-84.96%)
spinoramaA library to display and compare spinorama (speakers measurements) graphs.
Stars: ✭ 29 (-89.1%)
myG2PMyanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Stars: ✭ 43 (-83.83%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-84.96%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+33.08%)
kaldi-python-ioA python IO interface for data accessing in kaldi
Stars: ✭ 39 (-85.34%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+131.95%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-80.45%)
Speech TransformerA PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Stars: ✭ 565 (+112.41%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+96.24%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-95.11%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+47.37%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (-26.32%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (-34.59%)
rasrThe RWTH ASR Toolkit.
Stars: ✭ 43 (-83.83%)
Ctc pytorchCTC end -to-end ASR for timit and 863 corpus.
Stars: ✭ 161 (-39.47%)
pie百度云流式语音识别客户端 SDK
Stars: ✭ 62 (-76.69%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (-39.85%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-90.98%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (-6.39%)
SetkTools for Speech Enhancement integrated with Kaldi
Stars: ✭ 227 (-14.66%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-92.11%)
Wukong Robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Stars: ✭ 3,110 (+1069.17%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-2.63%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-94.74%)
dropclass speakerDropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-92.48%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (-86.84%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (-7.89%)