All Projects → srvk → srvk-eesen-offline-transcriber

srvk / srvk-eesen-offline-transcriber

Licence: other
Top level code to transcribe English audio/video files into text/subtitles

Programming Languages

shell
77523 projects
python
139335 projects - #7 most used programming language
Makefile
30231 projects

Projects that are alternatives of or similar to srvk-eesen-offline-transcriber

Kaldi Gop
Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (+372.73%)
Mutual labels:  speech-recognition, kaldi
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+609.09%)
Mutual labels:  speech-recognition, kaldi
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (+463.64%)
Mutual labels:  speech-recognition, kaldi
Factorized Tdnn
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (+345.45%)
Mutual labels:  speech-recognition, kaldi
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+790.91%)
Mutual labels:  speech-recognition, kaldi
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+6068.18%)
Mutual labels:  speech-recognition, kaldi
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+586.36%)
Mutual labels:  speech-recognition, kaldi
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+3254.55%)
Mutual labels:  speech-recognition, kaldi
Kaldi Onnx
Kaldi model converter to ONNX
Stars: ✭ 174 (+690.91%)
Mutual labels:  speech-recognition, kaldi
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+9431.82%)
Mutual labels:  speech-recognition, kaldi
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+4990.91%)
Mutual labels:  speech-recognition, kaldi
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-36.36%)
Mutual labels:  speech-recognition, kaldi
Espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+3572.73%)
Mutual labels:  speech-recognition, kaldi
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+372.73%)
Mutual labels:  speech-recognition, kaldi
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+3336.36%)
Mutual labels:  speech-recognition, kaldi
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+50586.36%)
Mutual labels:  speech-recognition, kaldi
Zamia Speech
Open tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+1600%)
Mutual labels:  speech-recognition, kaldi
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+1686.36%)
Mutual labels:  speech-recognition, kaldi
Kaldiio
A pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+627.27%)
Mutual labels:  speech-recognition, kaldi
Zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+1027.27%)
Mutual labels:  speech-recognition, kaldi

srvk-eesen-offline-transcriber

srvk/eesen customized version of Tanel Alumae's kaldi-offline-transcriber

You probably want to use this inside the SRVK's Eesen Transcriber, not on its own.

Files in this folder

  • speech2text.sh - Transcribe audio/video file and produce several output formats at once (plaintext, subtitles, NIST CTM scoring input, Audacity labels)
  • vids2web.sh - Transcribe and create video subtitles and searchable index in a web page
  • run-segmented.sh - If you have your own segmentation file this may improve transcription accuracy
  • run-scored.sh - If you have STM ground truth as well as audio/video, produce NIST SCLITE scoring results in build/trans/<videoname>/eesen/decode/score_*
  • run-scored-8k.sh - Same but for 8khz audio such as Switchboard corpus
  • batch.sh - Queue several files for transcription
  • slurm.sh - for batch processing, edit to change which transcribe script is used (speech2text.sh by default)
  • mkpages.sh - Make/update web pages from video and transcription output
  • watch.sh - Run this to start watching a shared folder for files to be transcribed
  • path.sh - set up the PATH environment variable for the above
  • Makefile - master control for transcriber
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].