Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (-27.87%)

Mutual labels: speech-recognition, speech-to-text, ctc

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+16.07%)

Mutual labels: speech-recognition, speech-to-text, asr

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-82.62%)

Mutual labels: speech-recognition, speech-to-text, asr

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+77.7%)

Mutual labels: speech-recognition, asr, ctc

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (+73.44%)

Mutual labels: beam-search, speech-recognition, ctc

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-81.31%)

Mutual labels: speech-recognition, speech-to-text, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-32.79%)

Mutual labels: speech-recognition, speech-to-text, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-41.31%)

Mutual labels: speech-recognition, speech-to-text, asr

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-91.8%)

Mutual labels: speech-recognition, speech-to-text, asr

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-88.52%)

Mutual labels: speech-recognition, speech-to-text

TS3000 TheChatBOT

Its a social networking chat-bot trained on Reddit dataset . It supports open bounded queries developed on the concept of Neural Machine Translation. Beware of its being sarcastic just like its creator 😝 BDW it uses Pytorch framework and Python3.

Stars: ✭ 20 (-93.44%)

Mutual labels: beam-search, attention-mechanism

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-95.41%)

Mutual labels: speech-recognition, asr

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-94.1%)

Mutual labels: speech-recognition, speech-to-text

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (-9.18%)

Mutual labels: speech-recognition, speech-to-text

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-77.05%)

Mutual labels: speech-recognition, speech-to-text

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-83.61%)

Mutual labels: speech-recognition, speech-to-text

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (-89.84%)

Mutual labels: speech-recognition, speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (-78.69%)

Mutual labels: speech-recognition, speech-to-text

Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Stars: ✭ 61 (-80%)

Mutual labels: end-to-end, speech-recognition

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-65.9%)

Mutual labels: speech-recognition, asr

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-11.15%)

Mutual labels: speech-recognition, asr

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (-51.8%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.