All Categories → Machine Learning → speech-recognition

Top 326 speech-recognition open source projects

Cortex M Kws
Cortex M KWS example with Tengine Lite.
Formant Analyzer
iOS application for finding formants in spoken sounds
Avsr Deep Speech
Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Pncc
A implementation of Power Normalized Cepstral Coefficients: PNCC
Voice
🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Arvutaja
An Android app for voice actions in Estonian and English
Rhasspy
Rhasspy voice assistant for offline home automation
Kaldi Gstreamer Server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Assistant Client
Инструмент для тестирования и отладки СanvasApps c семейством Виртуальных Ассистентов "Салют"
Wavenet Stt
An end-to-end speech recognition system with Wavenet. Built using C++ and python.
Speechpy
💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Eesen
The official repository of the Eesen project
Wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Libreasr
💬 An On-Premises, Streaming Speech Recognition System
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Ctcdecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Rhasspy
Offline private voice assistant for many human languages
Uspeech
Speech recognition toolkit for the arduino
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Specaugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Tensorflowasr
⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Ctcwordbeamsearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Free Spoken Digit Dataset
A free audio dataset of spoken digits. Think MNIST for audio.
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Zamia Speech
Open tools and data for cloudless automatic speech recognition
Subsync
Subtitle Speech Synchronizer
Alan Sdk Web
Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Deepspeech Examples
Examples of how to use or integrate DeepSpeech
Libfaceid
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Deepspeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
121-180 of 326 speech-recognition projects