Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (+29.25%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+1898.11%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-34.91%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+956.6%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+83.96%)
LingvoLingvo
Stars: ✭ 2,361 (+2127.36%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+84.91%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+3376.42%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+107.55%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-80.19%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+20.75%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+68.87%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+93.4%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+233.96%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-64.15%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-71.7%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-16.04%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-83.02%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-66.98%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-52.83%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-80.19%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-38.68%)
Tensorflow Ctc Speech RecognitionApplication of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (+19.81%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-22.64%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-1.89%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-65.09%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-74.53%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-57.55%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-66.98%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+853.77%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-86.79%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+16.04%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-42.45%)
L2cLearning to Cluster. A deep clustering strategy.
Stars: ✭ 262 (+147.17%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+17522.64%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-75.47%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+4563.21%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+283.02%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+277.36%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+401.89%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+392.45%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+5559.43%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+270.75%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+1323.58%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+10419.81%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+693.4%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+261.32%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+550.94%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+665.09%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-2.83%)