pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+177.38%)

Mutual labels: speech-recognition, speech, asr, kaldi

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-97.22%)

Mutual labels: speech-recognition, language-model, asr

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-90.87%)

Mutual labels: speech-recognition, speech, asr

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-97.22%)

Mutual labels: speech-recognition, kaldi, asr

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (-93.39%)

Mutual labels: speech, kaldi, asr

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-98.15%)

Mutual labels: speech, speech-recognition, kaldi

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+95.63%)

Mutual labels: speech-recognition, speech, asr

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-93.12%)

Mutual labels: speech, speech-recognition, asr

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+1375%)

Mutual labels: speech-recognition, speech, kaldi

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (-2.38%)

Mutual labels: speech-recognition, asr, kaldi

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+6.88%)

Mutual labels: speech-recognition, asr, kaldi

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-97.49%)

Mutual labels: speech-recognition, kaldi, asr

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-63.36%)

Mutual labels: speech-recognition, asr, kaldi

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-64.15%)

Mutual labels: speech-recognition, asr, kaldi

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+79.5%)

Mutual labels: speech-recognition, asr, kaldi

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-83.07%)

Mutual labels: speech-recognition, speech, asr

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-80.03%)

Mutual labels: speech-recognition, asr, kaldi

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-48.02%)

Mutual labels: speech-recognition, speech, kaldi

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-72.88%)

Mutual labels: speech-recognition, speech, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-72.88%)

Mutual labels: speech, speech-recognition, asr

asr24

24-hour Automatic Speech Recognition

Stars: ✭ 27 (-96.43%)

Mutual labels: kaldi, language-model, asr

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-86.24%)

Mutual labels: speech-recognition, kaldi, asr

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-92.46%)

Mutual labels: speech-recognition, speech, asr

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-97.22%)

Mutual labels: speech, speech-recognition, asr

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-76.85%)

Mutual labels: speech-recognition, speech, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-76.32%)

Mutual labels: speech, speech-recognition, asr

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-83.73%)

Mutual labels: speech, speech-recognition, asr

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (-18.39%)

Mutual labels: speech-recognition, asr

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-96.83%)

Mutual labels: speech-recognition, asr

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+215.34%)

Mutual labels: speech-recognition, asr

mongolian-nlp

Useful resources for Mongolian NLP

Stars: ✭ 119 (-84.26%)

Mutual labels: speech-recognition, language-model

srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

Stars: ✭ 22 (-97.09%)

Mutual labels: speech-recognition, kaldi

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-98.28%)

Mutual labels: speech, kaldi

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-89.15%)

Mutual labels: speech, speech-recognition

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-97.35%)

Mutual labels: speech-recognition, asr

torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

Stars: ✭ 20 (-97.35%)

Mutual labels: kaldi, asr

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-96.43%)

Mutual labels: speech-recognition, asr

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+1734.66%)

Mutual labels: numpy, speech

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-93.12%)

Mutual labels: speech-recognition, asr

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-70.37%)

Mutual labels: speech, speech-recognition

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-95.37%)

Mutual labels: speech, speech-recognition

Speech Feature Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stars: ✭ 78 (-89.68%)

Mutual labels: speech, feature-extraction

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-95.24%)

Mutual labels: speech-recognition, asr

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (-97.35%)

Mutual labels: speech-recognition, asr

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-96.69%)

Mutual labels: speech-recognition, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (-85.19%)

Mutual labels: speech-recognition, asr

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-94.71%)

Mutual labels: speech, asr

kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

Stars: ✭ 24 (-96.83%)

Mutual labels: kaldi, asr

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-91.93%)

Mutual labels: speech-recognition, kaldi

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (-94.44%)

Mutual labels: speech, asr

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-65.74%)

Mutual labels: speech, kaldi

Docker Kaldi Gstreamer Server

Dockerfile for kaldi-gstreamer-server.

Stars: ✭ 266 (-64.81%)

Mutual labels: asr, kaldi

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-98.15%)

Mutual labels: speech-recognition, asr

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (-60.58%)

Mutual labels: speech-recognition, speech

1-60 of 1560 similar projects

›

next*5