All Projects → deep_avsr → Similar Projects or Alternatives

369 Open source projects that are alternatives of or similar to deep_avsr

leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+240.38%)
demo vietasr
Vietnamese Speech Recognition
Stars: ✭ 22 (-78.85%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+18.27%)
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-79.81%)
Vosk
VOSK Speech Recognition Toolkit
Stars: ✭ 182 (+75%)
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-51.92%)
Chinese-automatic-speech-recognition
Chinese speech recognition
Stars: ✭ 147 (+41.35%)
scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-86.54%)
K6nele
An Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+88.46%)
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+111.54%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-74.04%)
DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-70.19%)
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+45.19%)
2018-dlsl
UPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-82.69%)
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+58.65%)
Lingvo
Lingvo
Stars: ✭ 2,361 (+2170.19%)
Dictate.js
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+87.5%)
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+97.12%)
Tensorflow Speech Recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+1936.54%)
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-49.04%)
speechmatics-python
Python library and CLI for Speechmatics
Stars: ✭ 24 (-76.92%)
obvi
A Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-48.08%)
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-14.42%)
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (-66.35%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+72.12%)
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+708.65%)
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+40.38%)
Speechrecognizerbutton
UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (+38.46%)
Hey Jetson
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+54.81%)
Go Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (+31.73%)
Deepspeech Server
A testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+69.23%)
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+64.42%)
rnnt decoder cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-42.31%)
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (+27.88%)
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-82.69%)
Automatic Speech Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Stars: ✭ 192 (+84.62%)
Unity live caption
Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-75%)
Voice Overlay Android
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+81.73%)
hf-experiments
Experiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (-64.42%)
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+88.46%)
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-66.35%)
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+23.08%)
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-79.81%)
Automatic speech recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 2,751 (+2545.19%)
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (-75.96%)
Speech recognition with tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+143.27%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+97.12%)
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-79.81%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-49.04%)
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+132.69%)
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-87.5%)
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+2192.31%)
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (-33.65%)
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-63.46%)
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-86.54%)
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-71.15%)
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+10622.12%)
Tensorflow Ctc Speech Recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (+22.12%)
Nemo
NeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+3443.27%)
revai-java-sdk
Rev.ai Java SDK
Stars: ✭ 16 (-84.62%)
1-60 of 369 similar projects