A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+786.36%)

Mutual labels: speech-recognition, speech-to-text

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (+372.73%)

Mutual labels: speech-recognition, asr

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (+63.64%)

Mutual labels: speech-recognition, asr

Deepspeech Websocket Server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Stars: ✭ 79 (+259.09%)

Mutual labels: speech-recognition, speech-to-text

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+16650%)

Mutual labels: speech-recognition, speech-to-text

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+1031.82%)

Mutual labels: speech-recognition, asr

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (+140.91%)

Mutual labels: speech-recognition, speech-to-text

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-13.64%)

Mutual labels: speech-recognition, speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+1000%)

Mutual labels: speech-recognition, speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+900%)

Mutual labels: speech-recognition, speech-to-text

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+12404.55%)

Mutual labels: speech-recognition, automatic-speech-recognition

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+1050%)

Mutual labels: speech-recognition, speech-to-text

wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Stars: ✭ 30 (+36.36%)

Mutual labels: automatic-speech-recognition, asr

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (+59.09%)

Mutual labels: speech-recognition, speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (+59.09%)

Mutual labels: speech-recognition, speech-to-text

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (+13.64%)

Mutual labels: speech-recognition, asr

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-36.36%)

Mutual labels: speech-recognition, speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (+195.45%)

Mutual labels: speech-recognition, speech-to-text

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+790.91%)

Mutual labels: speech-recognition, speech-to-text

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (+145.45%)

Mutual labels: speech-recognition, automatic-speech-recognition

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (+40.91%)

Mutual labels: speech-recognition, speech-to-text

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-9.09%)

Mutual labels: speech-recognition, asr

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+127.27%)

Mutual labels: speech-recognition, speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (+36.36%)

Mutual labels: speech-recognition, speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (+72.73%)

Mutual labels: speech-recognition, speech-to-text

2018-dlsl

UPC Deep Learning for Speech and Language 2018

Stars: ✭ 18 (-18.18%)

Mutual labels: speech-recognition, automatic-speech-recognition

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-4.55%)

Mutual labels: speech-recognition, speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.