Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+266.67%)

Mutual labels: speech-recognition, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+241.67%)

Mutual labels: speech-recognition, speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+303.33%)

Mutual labels: speech-recognition, speech-to-text

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (+15%)

Mutual labels: speech-recognition, speech-to-text

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-58.33%)

Mutual labels: speech-recognition, speech-to-text

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (+30%)

Mutual labels: speech-recognition, speech-to-text

Patter

speech-to-text in pytorch

Stars: ✭ 71 (+18.33%)

Mutual labels: speech-recognition, speech-to-text

B.e.n.j.i.

B.E.N.J.I.- The Impossible Missions Force's digital assistant

Stars: ✭ 83 (+38.33%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech Websocket Server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Stars: ✭ 79 (+31.67%)

Mutual labels: speech-recognition, speech-to-text

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+2196.67%)

Mutual labels: speech-recognition, speech-to-text

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (+15%)

Mutual labels: speech-recognition, speech-to-text

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (+76.67%)

Mutual labels: speech-recognition, speech-to-text

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (+73.33%)

Mutual labels: speech-recognition, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+18485%)

Mutual labels: speech-recognition, speech-to-text

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (+71.67%)

Mutual labels: speech-recognition, speech-to-text

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (+128.33%)

Mutual labels: speech-recognition, speech-to-text

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (+121.67%)

Mutual labels: speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-65%)

Mutual labels: speech-recognition, speech-to-text

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (+1766.67%)

Mutual labels: speech-recognition, speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-36.67%)

Mutual labels: speech-recognition, speech-to-text

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (+168.33%)

Mutual labels: speech-recognition, speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (-50%)

Mutual labels: speech-recognition, speech-to-text

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (+151.67%)

Mutual labels: speech-recognition, speech-to-text

Vosk

VOSK Speech Recognition Toolkit

Stars: ✭ 182 (+203.33%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

Stars: ✭ 176 (+193.33%)

Mutual labels: speech-recognition, speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+220%)

Mutual labels: speech-recognition, speech-to-text

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (+143.33%)

Mutual labels: speech-recognition, speech-to-text

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+226.67%)

Mutual labels: speech-recognition, speech-to-text

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+226.67%)

Mutual labels: speech-recognition, speech-to-text

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+6041.67%)

Mutual labels: speech-recognition, speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+225%)

Mutual labels: speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-70%)

Mutual labels: speech-recognition, speech-to-text

Audio Pretrained Model

A collection of Audio and Speech pre-trained models.

Stars: ✭ 61 (+1.67%)

Mutual labels: speech-recognition, speech-to-text

Angle

⦠ Angle: new speakable syntax for python 💡