pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+1916.35%)

Mutual labels: speech-recognition, kaldi, asr

kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

Stars: ✭ 24 (-76.92%)

Mutual labels: kaldi, asr, kaldi-asr

NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (-81.73%)

Mutual labels: automata, openfst, kaldi-asr

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+626.92%)

Mutual labels: speech-recognition, kaldi, asr

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (+138.46%)

Mutual labels: speech-recognition, kaldi, asr

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-81.73%)

Mutual labels: speech-recognition, kaldi, asr

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-79.81%)

Mutual labels: speech-recognition, kaldi, asr

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (+160.58%)

Mutual labels: speech-recognition, kaldi, asr

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (+259.62%)

Mutual labels: speech-recognition, kaldi, asr

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (+508.65%)

Mutual labels: speech-recognition, asr

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+634.62%)

Mutual labels: speech-recognition, asr

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-54.81%)

Mutual labels: speech-recognition, asr

Asr benchmark

Program to benchmark various speech recognition APIs

Stars: ✭ 71 (-31.73%)

Mutual labels: speech-recognition, asr

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-45.19%)

Mutual labels: speech-recognition, asr

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-25%)

Mutual labels: speech-recognition, asr

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-6.73%)

Mutual labels: speech-recognition, asr

Bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Stars: ✭ 99 (-4.81%)

Mutual labels: speech-recognition, asr

E2e Asr

PyTorch Implementations for End-to-End Automatic Speech Recognition

Stars: ✭ 106 (+1.92%)

Mutual labels: speech-recognition, asr

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-80.77%)

Mutual labels: speech-recognition, asr

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (-5.77%)

Mutual labels: speech-recognition, kaldi

Rnn Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Stars: ✭ 114 (+9.62%)

Mutual labels: speech-recognition, asr

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+10622.12%)

Mutual labels: speech-recognition, kaldi

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+23.08%)

Mutual labels: speech-recognition, asr

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (+493.27%)

Mutual labels: speech-recognition, asr

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-33.65%)

Mutual labels: speech-recognition, asr

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (+976.92%)

Mutual labels: speech-recognition, kaldi

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-11.54%)

Mutual labels: speech-recognition, asr

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+421.15%)

Mutual labels: speech-recognition, asr

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+1322.12%)

Mutual labels: speech-recognition, asr

Kaldi Gop

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (+0%)

Mutual labels: speech-recognition, kaldi

Deepspeechrecognition

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

Stars: ✭ 1,421 (+1266.35%)

Mutual labels: speech-recognition, asr

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+401.92%)

Mutual labels: speech-recognition, asr

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (+82.69%)

Mutual labels: speech-recognition, asr

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (+82.69%)

Mutual labels: speech-recognition, asr

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+68.27%)

Mutual labels: speech-recognition, asr

Kaldi Onnx

Kaldi model converter to ONNX

Stars: ✭ 174 (+67.31%)

Mutual labels: speech-recognition, kaldi

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+88.46%)

Mutual labels: speech-recognition, kaldi

Lingvo

Stars: ✭ 2,361 (+2170.19%)

Mutual labels: speech-recognition, asr

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (-42.31%)

Mutual labels: speech-recognition, transducer

Chinese text normalization

Chinese text normalization for speech processing

Stars: ✭ 242 (+132.69%)

Mutual labels: speech-recognition, asr

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+139.42%)

Mutual labels: speech-recognition, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+7.69%)

Mutual labels: speech-recognition, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+97.12%)

Mutual labels: speech-recognition, asr

Umbrella

"A collection of functional programming libraries that can be composed together. Unlike a framework, thi.ng is a suite of instruments and you (the user) must be the composer of. Geared towards versatility, not any specific type of music." — @loganpowell via Twitter

Stars: ✭ 2,186 (+2001.92%)

Mutual labels: composition, transducers

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-79.81%)

Mutual labels: speech-recognition, asr

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+240.38%)

Mutual labels: speech-recognition, asr

megs

A merged version of multiple open-source German speech datasets.