pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+14878.57%)

Mutual labels: speech-recognition, asr

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (+1257.14%)

Mutual labels: speech-recognition, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+1364.29%)

Mutual labels: speech-recognition, asr

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (+392.86%)

Mutual labels: speech-recognition, asr

Asr benchmark

Program to benchmark various speech recognition APIs

Stars: ✭ 71 (+407.14%)

Mutual labels: speech-recognition, asr

Bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Stars: ✭ 99 (+607.14%)

Mutual labels: speech-recognition, asr

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (+592.86%)

Mutual labels: speech-recognition, asr

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (+157.14%)

Mutual labels: speech-recognition, asr

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (+1014.29%)

Mutual labels: speech-recognition, asr

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (+1257.14%)

Mutual labels: speech-recognition, asr

Rnn Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Stars: ✭ 114 (+714.29%)

Mutual labels: speech-recognition, asr

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (+78.57%)

Mutual labels: speech-recognition, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+1364.29%)

Mutual labels: speech-recognition, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+1178.57%)

Mutual labels: speech-recognition, asr

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+535.71%)

Mutual labels: speech-recognition, stt

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+278.57%)

Mutual labels: speech-recognition, asr

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (+3157.14%)

Mutual labels: speech-recognition, asr

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (+42.86%)

Mutual labels: speech-recognition, asr

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (+307.14%)

Mutual labels: speech-recognition, asr

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (+235.71%)

Mutual labels: speech-recognition, asr

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (+457.14%)

Mutual labels: speech-recognition, asr

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+5671.43%)

Mutual labels: speech-recognition, asr

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+10464.29%)

Mutual labels: speech-recognition, asr

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+9592.86%)

Mutual labels: speech-recognition, asr

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+5357.14%)

Mutual labels: speech-recognition, asr

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (+978.57%)

Mutual labels: speech-recognition, asr

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+814.29%)

Mutual labels: speech-recognition, asr

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (+35.71%)

Mutual labels: speech-recognition, asr

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (+785.71%)

Mutual labels: speech-recognition, asr

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+15028.57%)

Mutual labels: speech-recognition, stt

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+1150%)

Mutual labels: speech-recognition, asr

Lingvo

Stars: ✭ 2,361 (+16764.29%)

Mutual labels: speech-recognition, asr

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+5300%)

Mutual labels: speech-recognition, asr

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (+271.43%)

Mutual labels: speech-recognition, asr

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (+50%)

Mutual labels: speech-recognition, asr

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+1678.57%)

Mutual labels: speech-recognition, asr

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+16928.57%)

Mutual labels: speech-recognition, asr

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (+71.43%)

Mutual labels: speech-recognition, asr

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (+50%)

Mutual labels: speech-recognition, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+700%)

Mutual labels: speech-recognition, asr

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (+1671.43%)

Mutual labels: speech-recognition, asr

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (+0%)

Mutual labels: speech-recognition, stt

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (+642.86%)

Mutual labels: speech-recognition, asr

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+5907.14%)

Mutual labels: speech-recognition, stt

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (+50%)

Mutual labels: speech-recognition, asr

Libreasr

💬 An On-Premises, Streaming Speech Recognition System