Top 106 asr open source projects

Wukong Robot
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Cn2an
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Chinese text normalization
Chinese text normalization for speech processing
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Asr Evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Hms Ml Demo
HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Mrcp Plugin With Freeswitch
使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
✭ 168
ttsasr
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Speecht
An opensource speech-to-text software written in tensorflow
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Listen Attend Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Asr syllable
基于卷积神经网络的语音识别声学模型的研究
Rnn Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Deepspeechrecognition
A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
E2e Asr
PyTorch Implementations for End-to-End Automatic Speech Recognition
Bigcidian
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Zerospeech Tts Without T
A Pytorch implementation for the ZeroSpeech 2019 challenge.
Ktspeechcrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Asr benchmark
Program to benchmark various speech recognition APIs
Openasr
A pytorch based end2end speech recognition system.
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Asrgen
Attacking Speaker Recognition with Deep Generative Models
Espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Eesen
The official repository of the Eesen project
Libreasr
💬 An On-Premises, Streaming Speech Recognition System
Wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Speech Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Zamia Speech
Open tools and data for cloudless automatic speech recognition
Asr theory
语音识别理论,论文和PPT
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Vosk Server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Vosk Android Demo
Offline speech recognition for Android with Vosk library.
Docker Kaldi Gstreamer Server
Dockerfile for kaldi-gstreamer-server.
spinorama
A library to display and compare spinorama (speakers measurements) graphs.
UnityASR
Automatic Speech Recognition in Unity.
1-60 of 106 asr projects