Speech recognition with tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）
Kaldi-based Korean ASR (한국어 음성인식) open-source project
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
NeMo: a toolkit for conversational AI
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
An Android app that offers speech-to-text services and user interfaces to other apps
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Voice Overlay Android
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
VOSK Speech Recognition Toolkit
The CIDLib general purpose C++ development environment
A testing server for a speech to text service based on mozilla deepspeech
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
A pure python module for reading and writing kaldi ark files
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
A speech recognition framework designed for SwiftUI.
Speech Recognition Neural Network
This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Deep learning for audio processing
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
A tool for automatic phoneme transcription
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Alan Sdk Pcf
Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
kaldi-asr/kaldi is the official location of the Kaldi project.
Wer are we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.