Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
ArvutajaAn Android app for voice actions in Estonian and English
RhasspyRhasspy voice assistant for offline home automation
Kaldi Gstreamer ServerReal-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Assistant ClientИнструмент для тестирования и отладки СanvasApps c семейством Виртуальных Ассистентов "Салют"
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
KurDescriptive Deep Learning
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
EesenThe official repository of the Eesen project
Annyang💬 Speech recognition for your site
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
RhasspyOffline private voice assistant for many human languages
UspeechSpeech recognition toolkit for the arduino
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
RhinoOn-device speech-to-intent engine powered by deep learning
Neural spEnd-to-end ASR/LM implementation with PyTorch
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
NmtpytorchSequence-to-Sequence Framework in PyTorch
CheetahOn-device streaming speech-to-text engine powered by deep learning
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
SubsyncSubtitle Speech Synchronizer
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
EspnetEnd-to-End Speech Processing Toolkit
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
BrevitasBrevitas: quantization-aware training in PyTorch
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.