pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+239.87%)

Mutual labels: speech-recognition, asr

E2e Asr

PyTorch Implementations for End-to-End Automatic Speech Recognition

Stars: ✭ 106 (-82.82%)

Mutual labels: speech-recognition, asr

Deepspeechrecognition

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

Stars: ✭ 1,421 (+130.31%)

Mutual labels: speech-recognition, asr

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-75.53%)

Mutual labels: speech-recognition, asr

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (-79.9%)

Mutual labels: speech-recognition, asr

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (-59.81%)

Mutual labels: speech-recognition, asr

Chinese text normalization

Chinese text normalization for speech processing

Stars: ✭ 242 (-60.78%)

Mutual labels: speech-recognition, asr

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-96.6%)

Mutual labels: speech-recognition, asr

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

Stars: ✭ 392 (-36.47%)

Mutual labels: speech-recognition, asr

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (-69.21%)

Mutual labels: speech-recognition, asr

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (-42.63%)

Mutual labels: speech-recognition, asr

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-91.41%)

Mutual labels: speech-recognition, asr

Speech Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Stars: ✭ 565 (-8.43%)

Mutual labels: asr, transformer

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-96.6%)

Mutual labels: speech-recognition, asr

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-83.14%)

Mutual labels: speech-recognition, asr

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-15.4%)

Mutual labels: speech-recognition, asr

Bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Stars: ✭ 99 (-83.95%)

Mutual labels: speech-recognition, asr

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+139.71%)

Mutual labels: speech-recognition, asr

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+119.94%)

Mutual labels: speech-recognition, asr

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-96.6%)

Mutual labels: speech-recognition, asr

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-79.25%)

Mutual labels: speech-recognition, asr

Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Stars: ✭ 61 (-90.11%)

Mutual labels: transformer, speech-recognition

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-95.62%)

Mutual labels: speech-recognition, asr

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-91.57%)

Mutual labels: speech-recognition, asr

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-84.28%)

Mutual labels: speech-recognition, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-66.77%)

Mutual labels: speech-recognition, asr

Lingvo

Stars: ✭ 2,361 (+282.66%)

Mutual labels: speech-recognition, asr

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (-59.64%)

Mutual labels: speech-recognition, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-70.99%)

Mutual labels: speech-recognition, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-66.77%)

Mutual labels: speech-recognition, asr

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-96.6%)

Mutual labels: speech-recognition, asr

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-85.09%)

Mutual labels: speech-recognition, asr

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-37.93%)

Mutual labels: speech-recognition, asr

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-96.76%)

Mutual labels: speech-recognition, asr

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (-39.38%)

Mutual labels: speech-recognition, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (-81.85%)

Mutual labels: speech-recognition, asr

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-96.11%)

Mutual labels: speech-recognition, asr

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-91.57%)

Mutual labels: speech-recognition, asr

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-93.52%)

Mutual labels: transformer, asr

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-80.06%)

Mutual labels: speech-recognition, asr

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-97.73%)

Mutual labels: speech-recognition, asr

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (-96.76%)

Mutual labels: speech-recognition, asr

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-96.43%)

Mutual labels: speech-recognition, asr

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-94.17%)

Mutual labels: speech-recognition, asr

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-55.11%)

Mutual labels: speech-recognition, asr

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-87.36%)

Mutual labels: speech-recognition, asr

Speech Transformer Tf2.0

transformer for ASR-systerm (via tensorflow2.0)

Stars: ✭ 90 (-85.41%)

Mutual labels: speech-recognition, transformer

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-95.95%)

Mutual labels: speech-recognition, asr

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-96.92%)

Mutual labels: speech-recognition, asr

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-56.08%)

Mutual labels: speech-recognition, asr

1-60 of 712 similar projects

›

next*5