Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
NemoNeMo: a toolkit for conversational AI
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
DragonflySpeech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
VoskVOSK Speech Recognition Toolkit
CidlibThe CIDLib general purpose C++ development environment
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
KaldiioA pure python module for reading and writing kaldi ark files
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
SwiftspeechA speech recognition framework designed for SwiftUI.
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
DlaDeep learning for audio processing
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
PersephoneA tool for automatic phoneme transcription
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Alan Sdk PcfAlan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Wer are weAttempt at tracking states of the arts and recent results (bibliography) on speech recognition.