🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，还可能是首个支持脑机交互的开源智能音箱项目。
📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）
Kaldi-based Korean ASR (한국어 음성인식) open-source project
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Hms Ml Demo
HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Mrcp Plugin With Freeswitch
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
An opensource speech-to-text software written in tensorflow
Listen Attend Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
PyTorch Implementations for End-to-End Automatic Speech Recognition
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
DELTA is a deep learning based natural language and speech processing platform.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Automatically constructing corpus for automatic speech recognition from YouTube videos
Speech Recognition model based off of FAIR research paper built using Pytorch.
AGI-server voice recognizer for #Asterisk
Program to benchmark various speech recognition APIs
A pytorch based end2end speech recognition system.
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Attacking Speaker Recognition with Deep Generative Models
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
SincNet is a neural architecture for efficiently processing raw audio samples.
The official repository of the Eesen project
💬 An On-Premises, Streaming Speech Recognition System
Production First and Production Ready End-to-End Speech Recognition Toolkit
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
an open-source implementation of sequence-to-sequence based speech processing engine
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
End-to-end ASR/LM implementation with PyTorch
Sequence-to-Sequence Framework in PyTorch
On-device streaming speech-to-text engine powered by deep learning
Open tools and data for cloudless automatic speech recognition
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
A library to display and compare spinorama (speakers measurements) graphs.
Automatic Speech Recognition in Unity.
SOVA ASR (Automatic Speech Recognition)