Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (+13.04%)

Mutual labels: speech-recognition, asr

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+2491.3%)

Mutual labels: speech-recognition, asr

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-60.87%)

Mutual labels: speech-recognition, asr

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (+395.65%)

Mutual labels: speech-recognition, asr

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (+194.57%)

Mutual labels: speech-recognition, asr

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (+201.09%)

Mutual labels: speech-recognition, asr

Newpipeextractor

Core part of NewPipe

Stars: ✭ 400 (+334.78%)

Mutual labels: crawler, youtube

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (+306.52%)

Mutual labels: speech-recognition, asr

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+730.43%)

Mutual labels: speech-recognition, asr

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+778.26%)

Mutual labels: speech-recognition, asr

Chinese text normalization

Chinese text normalization for speech processing

Stars: ✭ 242 (+163.04%)

Mutual labels: speech-recognition, asr

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-25%)

Mutual labels: speech-recognition, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+122.83%)

Mutual labels: speech-recognition, asr

Youtube Projects

This repository contains all the code I use in my YouTube tutorials.

Stars: ✭ 144 (+56.52%)

Mutual labels: crawler, youtube

Lingvo

Stars: ✭ 2,361 (+2466.3%)

Mutual labels: speech-recognition, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+122.83%)

Mutual labels: speech-recognition, asr

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+284.78%)

Mutual labels: speech-recognition, asr

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-77.17%)

Mutual labels: speech-recognition, asr

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (+106.52%)

Mutual labels: speech-recognition, asr

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-77.17%)

Mutual labels: speech-recognition, asr

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-78.26%)

Mutual labels: speech-recognition, asr

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-73.91%)

Mutual labels: speech-recognition, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+21.74%)

Mutual labels: speech-recognition, asr

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-79.35%)

Mutual labels: speech-recognition, asr

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-43.48%)

Mutual labels: speech-recognition, asr

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-72.83%)

Mutual labels: speech-recognition, asr

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (+106.52%)

Mutual labels: speech-recognition, asr

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-15.22%)

Mutual labels: speech-recognition, asr

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-84.78%)

Mutual labels: speech-recognition, asr

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (+231.52%)

Mutual labels: speech-recognition, asr

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-76.09%)

Mutual labels: speech-recognition, asr

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

Stars: ✭ 392 (+326.09%)

Mutual labels: speech-recognition, asr

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (+316.3%)

Mutual labels: speech-recognition, asr

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+467.39%)

Mutual labels: speech-recognition, asr

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (+33.7%)

Mutual labels: speech-recognition, asr

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+721.74%)

Mutual labels: speech-recognition, asr

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+702.17%)

Mutual labels: speech-recognition, asr

Social Scraper

Tổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt

Stars: ✭ 47 (-48.91%)

Mutual labels: crawler, youtube

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (+588.04%)

Mutual labels: speech-recognition, asr

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+2179.35%)

Mutual labels: speech-recognition, asr

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+90.22%)

Mutual labels: speech-recognition, asr

download audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Stars: ✭ 53 (-42.39%)

Mutual labels: youtube, speech-recognition

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit