CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+828.07%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+671.93%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+1254.39%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+982.46%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+589.47%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-68.42%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-29.82%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+615.79%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+10805.26%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+1010.53%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (+564.91%)
Kaldi Gstreamer ServerReal-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Stars: ✭ 935 (+1540.35%)
Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-24.56%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+759.65%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+1322.81%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-17.54%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+615.79%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+1226.32%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+598.25%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-38.6%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+571.93%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+10263.16%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+1010.53%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (+545.61%)
RhasspyRhasspy voice assistant for offline home automation
Stars: ✭ 851 (+1392.98%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+991.23%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-24.56%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+850.88%)
Assistant ClientИнструмент для тестирования и отладки СanvasApps c семейством Виртуальных Ассистентов "Салют"
Stars: ✭ 26 (-54.39%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+833.33%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+815.79%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (+1361.4%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+743.86%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+1673.68%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (+703.51%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+1317.54%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+685.96%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-7.02%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+8571.93%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1240.35%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+612.28%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+1642.11%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+601.75%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+1194.74%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-19.3%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+587.72%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+1110.53%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+556.14%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+1080.7%)
BiglittlenetOfficial repository for Big-Little Net
Stars: ✭ 57 (+0%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-15.79%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-21.05%)
ArvutajaAn Android app for voice actions in Estonian and English
Stars: ✭ 28 (-50.88%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+10424.56%)