Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+254.31%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+6.01%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-92.95%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-55.35%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-73.11%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (-7.57%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-94.52%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+119.58%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+14.88%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-94.52%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (-50.65%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-53.26%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-92.17%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-86.42%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-81.98%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+36.29%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-79.63%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-60.57%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-91.12%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-67.89%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-94.26%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+25.59%)
RaylibA simple and easy-to-use library to enjoy videogames programming
Stars: ✭ 8,169 (+2032.9%)
PorcupineOn-device wake word detection powered by deep learning.
Stars: ✭ 2,606 (+580.42%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-29.24%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-46.48%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (-17.49%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+4777.28%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-85.12%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+92.69%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-81.46%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+38.9%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-66.58%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-86.42%)
LingvoLingvo
Stars: ✭ 2,361 (+516.45%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+2134.99%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-46.48%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (-52.48%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-94.52%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-86.95%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-96.34%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (-72.85%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-96.34%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+522.45%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-93.73%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-80.16%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-78.59%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-93.21%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-72.85%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-90.86%)