TesseractThis package contains an OCR engine - libtesseract and a command line program - tesseract.
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
on line recognition, but also still supports the legacy Tesseract OCR engine of
Tesseract 3 which works by recognizing character patterns. Compatibility with
Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0).
It also needs traineddata files which support the legacy engine, for example
those from the tessdata repository.
Stars: β 43,199 (+113581.58%)
Image2textπ Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: β 243 (+539.47%)
CcextractorCCExtractor - Official version maintained by the core team
Stars: β 356 (+836.84%)
TextshotPython tool for grabbing text via screenshot
Stars: β 1,163 (+2960.53%)
Aadhaar Card OcrExtract text information from Aadhaar Card using tesseract-ocr π
Stars: β 112 (+194.74%)
Tesseract4androidFork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Stars: β 148 (+289.47%)
TesseractBindings to Tesseract OCR engine for R
Stars: β 192 (+405.26%)
Nkocrππ This is a module to make specifics OCRs at food products and nutritional tables.
Stars: β 15 (-60.53%)
GosseractGo package for OCR (Optical Character Recognition), by using Tesseract C++ library
Stars: β 1,622 (+4168.42%)
breach-protocol-autosolverSolve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: β 28 (-26.32%)
Pan card ocr projectTo extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
Stars: β 39 (+2.63%)
IdmatchMatch faces on id cards with OCR capabilities.
Stars: β 52 (+36.84%)
Penteract OcrβοΈ The native node.js bindings to the Tesseract OCR project.
Stars: β 86 (+126.32%)
Ultimatemrz SdkMachine-readable zone/travel document (MRZ / MRTD) detector and recognizer using deep learning
Stars: β 66 (+73.68%)
Image text readerThe module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.
Stars: β 97 (+155.26%)
Links Detectorπ ππ» Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.
Stars: β 106 (+178.95%)
TabuloTable Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)
Stars: β 110 (+189.47%)
LaraOCRLaravel Optical Character Reader(OCR) package using ocr engines like Tesseract
Stars: β 88 (+131.58%)
TesserocrA Python wrapper for the tesseract-ocr API
Stars: β 1,567 (+4023.68%)
Tesseract MacosObjective C wrapper for the open source OCR Engine Tesseract (macOS)
Stars: β 154 (+305.26%)
textocryTextocry - Copy text from Images (chrome extension)
Stars: β 29 (-23.68%)
nimtesseractA Tesseract OCR wrapper for Nim
Stars: β 23 (-39.47%)
OcrbotAn OCR (Optical Character Recognition) bot for Mastodon (and compatible) instances
Stars: β 39 (+2.63%)
BlackoutNaNoGenMo 2016 entry #2
Stars: β 36 (-5.26%)
Tesseract PythonExamples to implement OCR(Optical Character Recognition) using tesseract using Python
Stars: β 49 (+28.95%)
Cogstack PipelineDistributed, fault tolerant batch processing for Natural Language Applications and Search, using remote partitioning
Stars: β 26 (-31.58%)
Ocr Electron Vueπ A Simple OCR Application built on Electron, Vue.js & Tesseract.js
Stars: β 67 (+76.32%)
Android OcrExperimental optical character recognition app
Stars: β 2,177 (+5628.95%)
Tesseract4javaJava GUI and Tools for Tesseract OCR
Stars: β 214 (+463.16%)
Tessdata fastFast integer versions of trained LSTM models
Stars: β 221 (+481.58%)
ParsrTransforms PDF, Documents and Images into Enriched Structured Data
Stars: β 2,736 (+7100%)
ocrevalUpdate of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support
Stars: β 48 (+26.32%)
pmOCRA wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
Stars: β 53 (+39.47%)
tesseract-ocrNode.js wrapper for Tesseract OCR CLI.
Stars: β 29 (-23.68%)
OcrtableRecognize tables and text from scanned images that contain tables. δ»ε
ε«θ‘¨ζ Όηζ«ζεΎηδΈθ―ε«θ‘¨ζ Όεζε
Stars: β 155 (+307.89%)
OCRmyPDFOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Stars: β 6,560 (+17163.16%)
PyocrA Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
Stars: β 932 (+2352.63%)
SwiftytesseractA Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
Stars: β 170 (+347.37%)
Text DetectionText detection with mainly MSER and SWT
Stars: β 167 (+339.47%)
TesstrainTrain Tesseract LSTM with make
Stars: β 251 (+560.53%)
ScribeBotA highly scriptable automation system full of cool features. Automate everything with a little bit of Lua.
Stars: β 72 (+89.47%)
Ocr TableExtract tables from scanned image PDFs using Optical Character Recognition.
Stars: β 165 (+334.21%)
mementoOrganize your meme image cluster in a better format using OCR from the meme to sort them using tesseract along with editing memes by segmenting them using OpenCV within a directory
Stars: β 70 (+84.21%)
saramGet OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:
Stars: β 51 (+34.21%)
ocr2textConvert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: β 90 (+136.84%)
MouseTooltipTranslatorchrome extension - When mouse hover on text, it shows translated tooltip using google translate
Stars: β 93 (+144.74%)
erpnext ocrπ βοΈ Optical Character Recognition using tesseract within Frappe.
Stars: β 58 (+52.63%)
tesseract-unityStandalone OCR plugin for Unity using Tesseract
Stars: β 35 (-7.89%)
TesseractA PHP wrapper for the Tesseract OCR engine
Stars: β 19 (-50%)
Pytesseractidδ½Ώη¨ pytesseract ocr θ―ε« 18 δ½θΊ«δ»½θ―ε·
Stars: β 23 (-39.47%)
Lambda Text ExtractorAWS Lambda functions to extract text from various binary formats.
Stars: β 159 (+318.42%)