spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-43.48%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+17692.39%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+489.13%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-70.65%)
J.a.r.v.i.spython powered Intelligent System
Stars: ✭ 325 (+253.26%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+343.48%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+169.57%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-48.91%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+170.65%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+94.57%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-77.17%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+13.04%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+2491.3%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-60.87%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+395.65%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+194.57%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+201.09%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+306.52%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+730.43%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+778.26%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-25%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+122.83%)
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+56.52%)
LingvoLingvo
Stars: ✭ 2,361 (+2466.3%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+122.83%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+284.78%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-77.17%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+106.52%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-77.17%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-73.91%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+21.74%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-43.48%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-72.83%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+106.52%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-15.22%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-84.78%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-76.09%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+326.09%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+316.3%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+467.39%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+33.7%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+721.74%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+702.17%)
Social ScraperTổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Stars: ✭ 47 (-48.91%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+588.04%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+2179.35%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-42.39%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+570.65%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-38.04%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-22.83%)