A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+83.96%)

Mutual labels: speech-recognition, speech-to-text

Lingvo

Stars: ✭ 2,361 (+2127.36%)

Mutual labels: speech-recognition, speech-to-text

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+84.91%)

Mutual labels: speech-recognition, speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+81.13%)

Mutual labels: speech-recognition, speech-to-text

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+3376.42%)

Mutual labels: speech-recognition, speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+107.55%)

Mutual labels: speech-recognition, speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-80.19%)

Mutual labels: speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+20.75%)

Mutual labels: speech-recognition, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+68.87%)

Mutual labels: speech-recognition, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+93.4%)

Mutual labels: speech-recognition, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-50%)

Mutual labels: speech-recognition, speech-to-text

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+233.96%)

Mutual labels: speech-recognition, speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-64.15%)

Mutual labels: speech-recognition, speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (-71.7%)

Mutual labels: speech-recognition, speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (-16.04%)

Mutual labels: speech-recognition, speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (-66.98%)

Mutual labels: speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-83.02%)

Mutual labels: speech-recognition, speech-to-text

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-66.98%)

Mutual labels: speech-recognition, speech-to-text

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-52.83%)

Mutual labels: speech-recognition, speech-to-text

speechmatics-python

Python library and CLI for Speechmatics

Stars: ✭ 24 (-77.36%)

Mutual labels: speech-recognition, speech-to-text

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-80.19%)

Mutual labels: speech-recognition, speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (-38.68%)

Mutual labels: speech-recognition, speech-to-text

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (+38.68%)

Mutual labels: speech-recognition, speech-to-text

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (+19.81%)

Mutual labels: speech-recognition, speech-to-text

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-22.64%)

Mutual labels: speech-recognition, speech-to-text

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (-1.89%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (-65.09%)

Mutual labels: speech-recognition, speech-to-text

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-74.53%)

Mutual labels: speech-recognition, speech-to-text

deepspeech

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Stars: ✭ 45 (-57.55%)

Mutual labels: speech-recognition, speech-to-text

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-66.98%)

Mutual labels: speech-recognition, speech-to-text

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+853.77%)

Mutual labels: speech-recognition, speech-to-text

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-80.19%)

Mutual labels: speech-recognition, speech-to-text

htk

HTK Toolkit with Linux 64 bit and Docker support

Stars: ✭ 14 (-86.79%)

Mutual labels: speech-recognition, speech-to-text

SpeechToText

Speech To Text in Android

Stars: ✭ 53 (-50%)

Mutual labels: speech-recognition, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (+16.04%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-42.45%)

Mutual labels: speech-recognition, speech-to-text

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (+161.32%)

Mutual labels: speech-recognition, speech-to-text

L2c

Learning to Cluster. A deep clustering strategy.

Stars: ✭ 262 (+147.17%)

Mutual labels: unsupervised-learning, semi-supervised-learning

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+17522.64%)

Mutual labels: speech-recognition, speech-to-text

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (-75.47%)

Mutual labels: speech-recognition, speech-to-text

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+4563.21%)

Mutual labels: speech-recognition, speech-to-text

Rhino

On-device speech-to-intent engine powered by deep learning