Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → srvk → srvk-eesen-offline-transcriber

srvk / srvk-eesen-offline-transcriber

Licence: other

Top level code to transcribe English audio/video files into text/subtitles

Programming Languages

77523 projects

139335 projects - #7 most used programming language

30231 projects

Labels

speech-recognition kaldi eesen

Projects that are alternatives of or similar to srvk-eesen-offline-transcriber

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (+372.73%)

Mutual labels: speech-recognition, kaldi

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (+609.09%)

Mutual labels: speech-recognition, kaldi

ASR with PyTorch

Stars: ✭ 124 (+463.64%)

Mutual labels: speech-recognition, kaldi

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (+345.45%)

Mutual labels: speech-recognition, kaldi

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+790.91%)

Mutual labels: speech-recognition, kaldi

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+6068.18%)

Mutual labels: speech-recognition, kaldi

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (+586.36%)

Mutual labels: speech-recognition, kaldi

The official repository of the Eesen project

Stars: ✭ 738 (+3254.55%)

Mutual labels: speech-recognition, kaldi

Kaldi model converter to ONNX

Stars: ✭ 174 (+690.91%)

Mutual labels: speech-recognition, kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+9431.82%)

Mutual labels: speech-recognition, kaldi

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (+4990.91%)

Mutual labels: speech-recognition, kaldi

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-36.36%)

Mutual labels: speech-recognition, kaldi

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+3572.73%)

Mutual labels: speech-recognition, kaldi

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (+372.73%)

Mutual labels: speech-recognition, kaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+3336.36%)

Mutual labels: speech-recognition, kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+50586.36%)

Mutual labels: speech-recognition, kaldi

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (+1600%)

Mutual labels: speech-recognition, kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+1686.36%)

Mutual labels: speech-recognition, kaldi

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (+627.27%)

Mutual labels: speech-recognition, kaldi

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (+1027.27%)

Mutual labels: speech-recognition, kaldi

View All Similar Projects ➔

srvk-eesen-offline-transcriber

srvk/eesen customized version of Tanel Alumae's kaldi-offline-transcriber

You probably want to use this inside the SRVK's Eesen Transcriber, not on its own.

Files in this folder

speech2text.sh - Transcribe audio/video file and produce several output formats at once (plaintext, subtitles, NIST CTM scoring input, Audacity labels)
vids2web.sh - Transcribe and create video subtitles and searchable index in a web page
run-segmented.sh - If you have your own segmentation file this may improve transcription accuracy
run-scored.sh - If you have STM ground truth as well as audio/video, produce NIST SCLITE scoring results in build/trans/<videoname>/eesen/decode/score_*
run-scored-8k.sh - Same but for 8khz audio such as Switchboard corpus
batch.sh - Queue several files for transcription
slurm.sh - for batch processing, edit to change which transcribe script is used (speech2text.sh by default)
mkpages.sh - Make/update web pages from video and transcription output
watch.sh - Run this to start watching a shared folder for files to be transcribed
path.sh - set up the PATH environment variable for the above
Makefile - master control for transcriber

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 22

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗