EesenThe official repository of the Eesen project
Stars: ✭ 738 (-85.07%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (-95.55%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-91.91%)
Tensorflow Ctc Speech RecognitionApplication of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (-97.43%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-92.25%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (-91.95%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-98.69%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (-98.6%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-99.39%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-99.23%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-98.99%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (-97.73%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-99.72%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-99.58%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (-82.99%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-91.79%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-92.05%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-99.09%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-98.95%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-99.31%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-91.75%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-96.38%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-98.2%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-99.58%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-95.85%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-98.79%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-99.64%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-99.37%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-99.29%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-99.72%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-99.47%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (-92.84%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+277.91%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-98.34%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-99.25%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-97.9%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-99.29%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-99.45%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-99.72%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+73.17%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-97.51%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-99.62%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-92.07%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-99.55%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-98.93%)