Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (-34.94%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (-90.75%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (-89.61%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (-91.08%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-99.01%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (-88.05%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (-67.42%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-91.55%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-99.01%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-81.11%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-75.35%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+193.48%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (-92.4%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-98.35%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-99.01%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-97.64%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-97.17%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (-42.45%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-98.21%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-96.32%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-99.34%)
KurDescriptive Deep Learning
Stars: ✭ 811 (-61.71%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (-63.55%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-95.09%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-91.93%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (-52.27%)
Patterspeech-to-text in pytorch
Stars: ✭ 71 (-96.65%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-98.35%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-98.39%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-97.54%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-97.12%)
Angle⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-97.12%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-98.25%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-97.31%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-99.1%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (-91.41%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-81.92%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+781.96%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (-92.21%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-79.23%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+133.38%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-76.86%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-80.83%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+183.24%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-74.88%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-81.44%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-65.16%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-98.35%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (-93.53%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-93.63%)
Kaldi GopComputes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (-95.09%)