All Projects → Tentacule → Pgstosrt

Tentacule / Pgstosrt

PGS to Srt converter

Projects that are alternatives of or similar to Pgstosrt

video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Stars: ✭ 1,763 (+8295.24%)
Mutual labels:  ocr, subtitles, srt
ttml2srt
Convert TTML subtitles used by Netflix, HBO, CMore and others to SRT format
Stars: ✭ 51 (+142.86%)
Mutual labels:  subtitles, srt
ttml2srt
convert TTML subtitles to SRT subtitles
Stars: ✭ 46 (+119.05%)
Mutual labels:  subtitles, srt
pysub-parser
Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).
Stars: ✭ 40 (+90.48%)
Mutual labels:  subtitles, srt
Submerger
SRT Subtitles Merger
Stars: ✭ 92 (+338.1%)
Mutual labels:  srt, subtitles
Srt
A simple library for parsing, modifying, and composing SRT files.
Stars: ✭ 210 (+900%)
Mutual labels:  srt, subtitles
pysubs2
A Python library for editing subtitle files
Stars: ✭ 174 (+728.57%)
Mutual labels:  subtitles, srt
meltsub
Convert hardsub to softsub
Stars: ✭ 19 (-9.52%)
Mutual labels:  ocr, subtitles
ccaptioner
An extension to assign a text track to a video element in a web page
Stars: ✭ 45 (+114.29%)
Mutual labels:  subtitles, srt
libgosubs
golang library to read and write various subtitle formats
Stars: ✭ 20 (-4.76%)
Mutual labels:  subtitles, srt
Subed
Subtitle editor for Emacs
Stars: ✭ 77 (+266.67%)
Mutual labels:  srt, subtitles
Netflix To Srt
Rip, extract and convert subtitles to .srt closed captions from .xml/dfxp/ttml and .vtt/WebVTT (e.g. Netflix, YouTube)
Stars: ✭ 387 (+1742.86%)
Mutual labels:  srt, subtitles
Parsing With Haskell Parser Combinators
🔍 A step-by-step guide to parsing using Haskell parser combinators.
Stars: ✭ 72 (+242.86%)
Mutual labels:  srt, subtitles
Subtitle.js
Stream-based library for parsing and manipulating subtitle files
Stars: ✭ 234 (+1014.29%)
Mutual labels:  srt, subtitles
subed
Subtitle editor for Emacs
Stars: ✭ 143 (+580.95%)
Mutual labels:  subtitles, srt
Ccextractor
CCExtractor - Official version maintained by the core team
Stars: ✭ 356 (+1595.24%)
Mutual labels:  ocr, subtitles
Ffsubsync
Automagically synchronize subtitles with video.
Stars: ✭ 5,167 (+24504.76%)
Mutual labels:  srt, subtitles
Fasubrip
FaSubrip autodetects and converts encoding of farsi (persian) subrip files. with additional functionalities.
Stars: ✭ 22 (+4.76%)
Mutual labels:  srt
Cogstack Pipeline
Distributed, fault tolerant batch processing for Natural Language Applications and Search, using remote partitioning
Stars: ✭ 26 (+23.81%)
Mutual labels:  ocr
Tesseract
A PHP wrapper for the Tesseract OCR engine
Stars: ✭ 19 (-9.52%)
Mutual labels:  ocr

PgsToSrt

Convert pgs subtitles to srt using OCR.

Prerequisites

Data files must be placed in a tessdata folder inside PgsToSrt folder, or the path can be specified in the command line with the --tesseractdata parameter.

You only need data files for the language(s) you want to convert.

Usage

dotnet PgsToSrt.dll [parameters]

Parameters              
--input Input filename, can be an mkv file or pgs subtitle extracted to a .sup file with mkvextract.
--output Output Subrip (.srt) filename. Auto generated from input filename if not set.
--track Track number of the subtitle to process in an mkv file (only required when input is a matroska file)
This can be obtained with mkvinfo
--trackLanguage Convert all tracks of the specified language (only works with mkv input)
--tesseractlanguage Tesseract language to use if multiple languages are available in the tesseract data directory.
--tesseractdata Path of tesseract language data files, by default 'tessdata' in the executable directory.

Example (Command Line)

dotnet PgsToSrt.dll --input video1.fr.sup --output video1.fr.srt --tesseractlanguage fra
dotnet PgsToSrt.dll --input video1.mkv --output video1.srt --track 4

Example (Docker)

docker run -it --rm -v /data:/data \
           -e INPUT=/data/myImageSubtitle.sup \
           -e OUTPUT=/data/myTextSubtitle.srt \
           -e LANGUAGE=eng \
           tentacule/pgstosrt

Dependencies

  • Windows : none, tesseract/leptonica libraries are included in the release package.
  • Linux : libtesseract4 (sudo apt install libtesseract4)

Build

To build PgsToSrt.dll execute this commands

dotnet restore
dotnet publish -c Release -o out
#The file is on  PgsToSrt/out/PgsToSrt.dll

To build docker image (for now only for linux)

docker build -t pgstosrt .

Built With

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].