pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+231.28%)

Mutual labels: speech-recognition, asr

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-96.05%)

Mutual labels: speech-recognition, asr

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (-27.96%)

Mutual labels: speech-recognition, asr

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-39.49%)

Mutual labels: speech-recognition, asr

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-96.84%)

Mutual labels: speech-recognition, asr

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-91.79%)

Mutual labels: speech-recognition, asr

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-97%)

Mutual labels: speech-recognition, asr

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+114.38%)

Mutual labels: speech-recognition, asr

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-96.05%)

Mutual labels: speech-recognition, asr

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-84.68%)

Mutual labels: speech-recognition, asr

E2e Asr

PyTorch Implementations for End-to-End Automatic Speech Recognition

Stars: ✭ 106 (-83.25%)

Mutual labels: speech-recognition, asr

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-85.47%)

Mutual labels: speech-recognition, asr

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-79.78%)

Mutual labels: speech-recognition, asr

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (-80.41%)

Mutual labels: speech-recognition, asr

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-17.54%)

Mutual labels: speech-recognition, asr

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-87.68%)

Mutual labels: speech-recognition, asr

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (-69.98%)

Mutual labels: speech-recognition, asr

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (-69.98%)

Mutual labels: speech-recognition, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-67.61%)

Mutual labels: speech-recognition, asr

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-72.35%)

Mutual labels: speech-recognition, asr

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-56.24%)

Mutual labels: speech-recognition, asr

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (-60.66%)

Mutual labels: speech-recognition, asr

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-57.19%)

Mutual labels: speech-recognition, asr

Asr benchmark

Program to benchmark various speech recognition APIs

Stars: ✭ 71 (-88.78%)

Mutual labels: speech-recognition, asr

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-96.68%)

Mutual labels: speech-recognition, asr

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-91.63%)

Mutual labels: speech-recognition, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (-82.31%)

Mutual labels: speech-recognition, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-71.72%)

Mutual labels: speech-recognition, asr

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-83.57%)

Mutual labels: speech-recognition, asr

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-96.68%)

Mutual labels: speech-recognition, asr

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-97.79%)

Mutual labels: speech-recognition, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.