SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1676.74%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+1113.95%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (+146.51%)
Asr Stars: ✭ 54 (+25.58%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+530.23%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+1334.88%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+769.77%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (+60.47%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-48.84%)
Listen Attend SpellA PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Stars: ✭ 147 (+241.86%)
AsrgenAttacking Speaker Recognition with Deep Generative Models
Stars: ✭ 31 (-27.91%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+341.86%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+1616.28%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (+165.12%)
Speech TransformerA PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Stars: ✭ 565 (+1213.95%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+811.63%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+3339.53%)
Mrcp Plugin With Freeswitch使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
Stars: ✭ 168 (+290.7%)
spinoramaA library to display and compare spinorama (speakers measurements) graphs.
Stars: ✭ 29 (-32.56%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (+113.95%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (+65.12%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+186.05%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+251.16%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (+32.56%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+341.86%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (+9.3%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+197.67%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+1779.07%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+469.77%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+1658.14%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+1372.09%)
Hms Ml DemoHMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Stars: ✭ 187 (+334.88%)
Open sttOpen STT
Stars: ✭ 584 (+1258.14%)
DeepspeechrecognitionA Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Stars: ✭ 1,421 (+3204.65%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1160.47%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+479.07%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+848.84%)
BigcidianPronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Stars: ✭ 99 (+130.23%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+790.7%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+4776.74%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+3055.81%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+544.19%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+376.74%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-67.44%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+262.79%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (+81.4%)
Wukong Robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Stars: ✭ 3,110 (+7132.56%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+476.74%)
LingvoLingvo
Stars: ✭ 2,361 (+5390.7%)
SpeechtAn opensource speech-to-text software written in tensorflow
Stars: ✭ 152 (+253.49%)
VoicerAGI-server voice recognizer for #Asterisk
Stars: ✭ 73 (+69.77%)