End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-85.88%)

Mutual labels: voice-recognition, speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-63.84%)

Mutual labels: speech-recognition, speech-to-text, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-42.09%)

Mutual labels: speech-recognition, speech-to-text, asr

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-85.31%)

Mutual labels: speech-recognition, speech-to-text, asr

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+47.46%)

Mutual labels: speech-recognition, speech-to-text, asr

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+5176.84%)

Mutual labels: speech-recognition, speech-to-text, on-device

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (-70.62%)

Mutual labels: speech-recognition, automatic-speech-recognition, speech-to-text

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-83.9%)

Mutual labels: speech-recognition, speech-to-text, asr

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-72.6%)

Mutual labels: speech-recognition, speech-to-text, asr

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-94.07%)

Mutual labels: speech-recognition, speech-to-text, asr

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (-55.93%)

Mutual labels: speech-recognition, asr

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-54.52%)

Mutual labels: speech-recognition, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-53.39%)

Mutual labels: speech-recognition, speech-to-text

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (+35.88%)

Mutual labels: voice-recognition, speech-recognition

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-58.76%)

Mutual labels: speech-recognition, speech-to-text

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+492.37%)

Mutual labels: speech-recognition, asr

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-51.69%)

Mutual labels: speech-recognition, speech-to-text

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-85.03%)

Mutual labels: speech-recognition, speech-to-text

1-60 of 450 similar projects

›

next*5