Wav2letter.pytorchA fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (+44.44%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (+90.28%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+106.94%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+137.5%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+129.17%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+123.61%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-27.78%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+858.33%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-51.39%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+455.56%)
LingvoLingvo
Stars: ✭ 2,361 (+3179.17%)
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (+41.67%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+205.56%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+184.72%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+251.39%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+172.22%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-70.83%)
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+172.22%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (-4.17%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+23.61%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-80.56%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+445.83%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+625%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+1813.89%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-56.94%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-16.67%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-80.56%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-80.56%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-73.61%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-69.44%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+1455.56%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-75%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+5.56%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-48.61%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+25844.44%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-37.5%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-15.28%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+11788.89%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-27.78%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+70.83%)
React MicRecord audio from a user's microphone and display a cool visualization.
Stars: ✭ 323 (+348.61%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-51.39%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-4.17%)
Patterspeech-to-text in pytorch
Stars: ✭ 71 (-1.39%)
Angle⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-15.28%)