Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (-70.19%)

Mutual labels: speech-recognition, speech-to-text, ctc

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (-66.4%)

Mutual labels: speech-recognition, asr, kaldi

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (-52.03%)

Mutual labels: speech-recognition, speech-to-text, asr

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-83.33%)

Mutual labels: speech-recognition, speech-to-text, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-72.22%)

Mutual labels: speech-recognition, speech-to-text, asr

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (-97.29%)

Mutual labels: speech-recognition, speech-to-text, asr

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-97.02%)

Mutual labels: speech-recognition, speech-to-text, asr

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (-44.72%)

Mutual labels: speech-recognition, asr, ctc

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-97.15%)

Mutual labels: speech-recognition, speech-to-text, asr

Lingvo

Stars: ✭ 2,361 (+219.92%)

Mutual labels: speech-recognition, speech-to-text, asr

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-46.75%)

Mutual labels: speech-recognition, speech-to-text, kaldi

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (-66.8%)

Mutual labels: speech-to-text, asr, ctc

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-48.1%)

Mutual labels: speech-recognition, speech-to-text, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-75.75%)

Mutual labels: speech-recognition, speech-to-text, asr

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (+51.76%)

Mutual labels: speech-recognition, speech-to-text, kaldi

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-92.28%)

Mutual labels: speech-recognition, speech-to-text, asr

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-86.86%)

Mutual labels: speech-recognition, speech-to-text, asr

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+9.49%)

Mutual labels: speech-recognition, asr, kaldi

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+1410.98%)

Mutual labels: speech-recognition, speech-to-text, kaldi

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-63.28%)

Mutual labels: speech-recognition, asr, kaldi

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-62.47%)

Mutual labels: speech-recognition, asr, kaldi

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+569.78%)

Mutual labels: speech-recognition, speech-to-text, ctc

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-91.73%)

Mutual labels: speech-recognition, speech-to-text, kaldi

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-72.22%)

Mutual labels: speech-recognition, speech-to-text, asr

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-97.15%)

Mutual labels: speech-recognition, speech-to-text, asr

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (-73.44%)

Mutual labels: speech-recognition, speech-to-text, kaldi

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+184.15%)

Mutual labels: speech-recognition, asr, kaldi

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-96.61%)

Mutual labels: speech-recognition, speech-to-text, asr

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-85.91%)

Mutual labels: speech-recognition, kaldi, asr

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-96.34%)

Mutual labels: speech-recognition, speech-to-text, asr

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-27.91%)

Mutual labels: speech-recognition, speech-to-text

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.