All Categories → Machine Learning → speech-recognition

Top 326 speech-recognition open source projects

Speech recognition with tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Cn2an
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Chinese text normalization
Chinese text normalization for speech processing
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Dragonfly
Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
K6nele
An Android app that offers speech-to-text services and user interfaces to other apps
Dictate.js
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Asr Evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Tensorflow Speech Recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Kaldi Offline Transcriber
Offline transcription system for Estonian using Kaldi
Deepspeech German
Automatic Speech Recognition (ASR) - German
Deepspeech Server
A testing server for a speech to text service based on mozilla deepspeech
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Kaldi Onnx
Kaldi model converter to ONNX
Gst Kaldi Nnet2 Online
GStreamer plugin around Kaldi's online neural network decoder
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Kaldiio
A pure python module for reading and writing kaldi ark files
Rnnt Speech Recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Clovacall
ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Swiftspeech
A speech recognition framework designed for SwiftUI.
Speech Recognition Neural Network
This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Speechrecognizerbutton
UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Aimybox Android Assistant
Embeddable custom voice assistant for Android applications
Go Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Alan Sdk Pcf
Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Tensorflow Ctc Speech Recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Lip Reading Deeplearning
🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Keras Kaldi
Keras Interface for Kaldi ASR
Wer are we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
1-60 of 326 speech-recognition projects