pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+10385%)

Mutual labels: speech-recognition, asr

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+24615%)

Mutual labels: speech-recognition, chinese-speech-recognition

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+1145%)

Mutual labels: speech-recognition, asr

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (+1140%)

Mutual labels: speech-recognition, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+795%)

Mutual labels: speech-recognition, asr

Chinese text normalization

Chinese text normalization for speech processing

Stars: ✭ 242 (+1110%)

Mutual labels: speech-recognition, asr

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 6,026 (+30030%)

Mutual labels: end-to-end, speech-recognition

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (+150%)

Mutual labels: end-to-end, speech-recognition

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+22565%)

Mutual labels: end-to-end, speech-recognition

Speech Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Stars: ✭ 565 (+2725%)

Mutual labels: end-to-end, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+925%)

Mutual labels: speech-recognition, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+925%)

Mutual labels: speech-recognition, asr

Speech Transformer Tf2.0

transformer for ASR-systerm (via tensorflow2.0)

Stars: ✭ 90 (+350%)

Mutual labels: end-to-end, speech-recognition

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+1670%)

Mutual labels: speech-recognition, asr

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

Stars: ✭ 392 (+1860%)

Mutual labels: speech-recognition, asr

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+1940%)

Mutual labels: speech-recognition, asr

Lingvo

Stars: ✭ 2,361 (+11705%)

Mutual labels: speech-recognition, asr

Wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 5,907 (+29435%)

Mutual labels: end-to-end, speech-recognition

Listen Attend Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Stars: ✭ 147 (+635%)

Mutual labels: end-to-end, asr

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (+5%)

Mutual labels: speech-recognition, asr

1-60 of 409 similar projects

›

next*5