deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-2.17%)
simple diarizerSimplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (-43.48%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+78.26%)
vspeech📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜
Stars: ✭ 38 (-17.39%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-32.61%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+40508.7%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+669.57%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-71.74%)
speech-to-textPython helper for Google and IBM Watson speech-to-text cloud APIs.
Stars: ✭ 14 (-69.57%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+18508.7%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+93.48%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-54.35%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-17.39%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-69.57%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+289.13%)
glaemscribeGlaemscribe, the tolkienian languages/writings transcription engine.
Stars: ✭ 29 (-36.96%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+345.65%)
aws-transcribe-demoA simple AWS demo utilises Amazon Transcribe to convert audio to text and analyse.
Stars: ✭ 39 (-15.22%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-41.3%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-54.35%)
vave🌊 A crazy simple library for reading/writing WAV files in V. Zero dependencies, 100% cross-platform.
Stars: ✭ 35 (-23.91%)
benchmarksttOpen Source AI Benchmarking toolkit for benchmarking speech to text services
Stars: ✭ 43 (-6.52%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1728.26%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-34.78%)
Generate-Live-TranscriptionThis extension helps to get a real-time transcription of audio playing in the browser using Deep Speech.
Stars: ✭ 16 (-65.22%)
parlatypeGNOME audio player for transcription
Stars: ✭ 151 (+228.26%)
aws-content-analysisThis project is a fully automated video search engine which uses AWS AI services for computer vision and speech recognition to catalog video archives.
Stars: ✭ 67 (+45.65%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-41.3%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (+126.09%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+450%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-69.57%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (+15.22%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-54.35%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+41.3%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+426.09%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+432.61%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+8.7%)
Stt🐸STT - a deep learning toolkit for Speech-to-Text, battle-tested in research and production
Stars: ✭ 197 (+328.26%)
Go AstibobGolang framework to build an AI that can understand and speak back to you, and everything else you want
Stars: ✭ 222 (+382.61%)
dataflow-contact-center-speech-analysisSpeech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create analytics.
Stars: ✭ 46 (+0%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+378.26%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+30.43%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+345.65%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+326.09%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-60.87%)
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+326.09%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+323.91%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-23.91%)
LingvoLingvo
Stars: ✭ 2,361 (+5032.61%)