Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+205.56%)

Mutual labels: speech-recognition, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+184.72%)

Mutual labels: speech-recognition, speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+251.39%)

Mutual labels: speech-recognition, speech-to-text

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+172.22%)

Mutual labels: speech-recognition, speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-70.83%)

Mutual labels: speech-recognition, speech-to-text

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+172.22%)

Mutual labels: speech-recognition, speech-to-text

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (-4.17%)

Mutual labels: speech-recognition, speech-to-text

revai-java-sdk

Rev.ai Java SDK

Stars: ✭ 16 (-77.78%)

Mutual labels: speech-recognition, speech-to-text

SpeechToText

Speech To Text in Android

Stars: ✭ 53 (-26.39%)

Mutual labels: speech-recognition, speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+23.61%)

Mutual labels: speech-recognition, speech-to-text

htk

HTK Toolkit with Linux 64 bit and Docker support

Stars: ✭ 14 (-80.56%)

Mutual labels: speech-recognition, speech-to-text

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+445.83%)

Mutual labels: speech-recognition, speech-to-text

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+625%)

Mutual labels: speech-recognition, speech-to-text

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+1813.89%)

Mutual labels: speech-recognition, speech-to-text

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (-56.94%)

Mutual labels: speech-recognition, speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (-16.67%)

Mutual labels: speech-recognition, speech-to-text

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-2.78%)

Mutual labels: speech-recognition, speech-to-text

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-80.56%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-80.56%)

Mutual labels: speech-recognition, speech-to-text

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-73.61%)

Mutual labels: speech-recognition, speech-to-text

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-69.44%)

Mutual labels: speech-recognition, speech-to-text

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (+1455.56%)

Mutual labels: speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-75%)

Mutual labels: speech-recognition, speech-to-text

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-70.83%)

Mutual labels: speech-recognition, speech-to-text

cobra

On-device voice activity detection (VAD) powered by deep learning.

Stars: ✭ 76 (+5.56%)

Mutual labels: voice-recognition, speech-recognition

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (-48.61%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+25844.44%)

Mutual labels: speech-recognition, speech-to-text

deepspeech

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Stars: ✭ 45 (-37.5%)

Mutual labels: speech-recognition, speech-to-text

houndify-sdk-go

The official Houndify SDK for Go

Stars: ✭ 23 (-68.06%)

Mutual labels: voice-recognition, speech-recognition

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras