leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+2428.57%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (+57.14%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+485.71%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+15028.57%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+5907.14%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+778.57%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+535.71%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (+50%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+364.29%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+328.57%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+635.71%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (+114.29%)
Wav2letter.pytorchA fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (+642.86%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (+878.57%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (+50%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+79550%)
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (+928.57%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+1050%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (+121.43%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+1178.57%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+1250%)
LingvoLingvo
Stars: ✭ 2,361 (+16764.29%)
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (+628.57%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+9742.86%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (+50%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+9592.86%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+171.43%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (+28.57%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+10678.57%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+814.29%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (+392.86%)
Tensorflow Ctc Speech RecognitionApplication of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (+807.14%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+978.57%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+942.86%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+1078.57%)
B.e.n.j.i.B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (+492.86%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (+150%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+1364.29%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+1200%)
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+1300%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+1292.86%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+257.14%)
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+1157.14%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+1471.43%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (+278.57%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+1628.57%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+1364.29%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+26221.43%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+1707.14%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+8607.14%)
Deepspeech Websocket ServerServer & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (+464.29%)