Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-87.61%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-48.97%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (-17.06%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-88.02%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (-84.5%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+73.75%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-98.93%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (-85.07%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-57.18%)
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (-88.19%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-97.79%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (-36.67%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (-79.25%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-96.96%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+202.3%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-95.73%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-81.62%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (-37.98%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-63.9%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (-81.95%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+305.5%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-66.69%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (-88.76%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-95.73%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-95%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-39.46%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-97.13%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-95.08%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-94.67%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (-14.03%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-94.75%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-97.87%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-91.47%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (-31.01%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-98.85%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (-43.4%)
Css10CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (-75.23%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+1432.4%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-97.21%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+602.21%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-96.31%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-98.2%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-88.93%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-96.88%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-75.55%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-98.44%)