DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+4777.28%)

Mutual labels: speech-recognition, speech-to-text, offline

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-85.12%)

Mutual labels: speech-recognition, speech-to-text, asr

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+92.69%)

Mutual labels: speech-recognition, speech-to-text, asr

Asr benchmark

Program to benchmark various speech recognition APIs

Stars: ✭ 71 (-81.46%)

Mutual labels: speech-recognition, asr, voice-recognition

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+38.9%)

Mutual labels: speech-recognition, speech-to-text, voice-recognition

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-66.58%)

Mutual labels: speech-recognition, speech-to-text, asr

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-74.67%)

Mutual labels: speech-recognition, speech-to-text, asr

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-86.42%)

Mutual labels: voice-recognition, speech-recognition, asr

Lingvo

Stars: ✭ 2,361 (+516.45%)

Mutual labels: speech-recognition, speech-to-text, asr

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+2134.99%)

Mutual labels: offline, speech-recognition, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-46.48%)

Mutual labels: speech-recognition, speech-to-text, asr

Vosk

VOSK Speech Recognition Toolkit

Stars: ✭ 182 (-52.48%)

Mutual labels: speech-recognition, speech-to-text, voice-recognition

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-94.52%)

Mutual labels: speech-recognition, speech-to-text, asr

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-93.47%)

Mutual labels: speech-recognition, speech-to-text, asr

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-86.95%)

Mutual labels: voice-recognition, speech-recognition, speech-to-text

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-96.34%)

Mutual labels: speech-recognition, speech-to-text

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-72.85%)

Mutual labels: speech-recognition, asr

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (-61.62%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-96.34%)

Mutual labels: speech-recognition, speech-to-text

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit