ocr-fileformatValidate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
Stars: ✭ 142 (+305.71%)
kitodo-presentationKitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
Stars: ✭ 33 (-5.71%)
PAN-Card-OCRRetrive meaningful information from PAN Card image using tesseract-ocr 😎
Stars: ✭ 115 (+228.57%)
jochreJava Optical CHaracter Recognition
Stars: ✭ 18 (-48.57%)
Tesseract4androidFork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Stars: ✭ 148 (+322.86%)
vrpdrDeep Learning Applied To Vehicle Registration Plate Detection and Recognition in PyTorch.
Stars: ✭ 36 (+2.86%)
BnLMetsExporterCommand Line Interface (CLI) to export METS/ALTO documents to other formats.
Stars: ✭ 11 (-68.57%)
hOCR-to-ALTOConvert between Tesseract hOCR and ALTO XML using XSL stylesheets
Stars: ✭ 40 (+14.29%)
doctr-tfjs-demoJavascript demo of docTR, powered by TensorFlowJS
Stars: ✭ 21 (-40%)
IdCardRecognitionAndroid id card recognition based on OCR. 安卓基于OCR的身份证识别。
Stars: ✭ 35 (+0%)
Pan card ocr projectTo extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
Stars: ✭ 39 (+11.43%)
doctrdocTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Stars: ✭ 1,409 (+3925.71%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+38125.71%)
SwiftocrFast and simple OCR library written in Swift
Stars: ✭ 4,459 (+12640%)
SsocrSeven Segment Optical Character Recognition
Stars: ✭ 133 (+280%)
EyevisAndroid based Vocal Vision for Visually Impaired. Object Detection, Voice Assistance, Optical Character Reader, Read Aloud, Face Recognition, Landmark Recognition, Image Labelling etc.
Stars: ✭ 48 (+37.14%)
Penteract Ocr⭐️ The native node.js bindings to the Tesseract OCR project.
Stars: ✭ 86 (+145.71%)
DocumentLabOCR using tesseract, ImageMagick, EmguCV, an advanced query language and a fluent query interface for C#
Stars: ✭ 64 (+82.86%)
Persian-OCROptical character recognition of Farsi and Arabic letters
Stars: ✭ 36 (+2.86%)
blinkid-in-browserBlinkID In-browser SDK for WebAssembly-enabled browsers.
Stars: ✭ 40 (+14.29%)
Ocr TableExtract tables from scanned image PDFs using Optical Character Recognition.
Stars: ✭ 165 (+371.43%)
SwiftytesseractA Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
Stars: ✭ 170 (+385.71%)
Receipt ScannerReceipt scanner extracts information from your PDF or image receipts - built in NodeJS
Stars: ✭ 190 (+442.86%)
Android OcrExperimental optical character recognition app
Stars: ✭ 2,177 (+6120%)
Image2text📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: ✭ 243 (+594.29%)
TesserocrA Python wrapper for the tesseract-ocr API
Stars: ✭ 1,567 (+4377.14%)
Signature extractorA super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
Stars: ✭ 205 (+485.71%)
ImageToTextOCR with Google's AI technology (Cloud Vision API)
Stars: ✭ 30 (-14.29%)
Snipping-OcrA simple Snipping tool for Windows with OCR capabilities
Stars: ✭ 82 (+134.29%)
digdetA realtime digit OCR on the browser using Machine Learning
Stars: ✭ 22 (-37.14%)
DocTrThe official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Stars: ✭ 202 (+477.14%)
deep-text-recognition-benchmarkProvide the OCR model in ONNX format so that the OpenCV DNN module can use them directly and correctly.
Stars: ✭ 32 (-8.57%)
ocr2textConvert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: ✭ 90 (+157.14%)
video-subtitle-extractor视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Stars: ✭ 1,763 (+4937.14%)
libcrowds-viewerA Vue component for crowdsourcing Web Annotations.
Stars: ✭ 22 (-37.14%)
webgrepGrep Web pages with extra features like JS deobfuscation and OCR
Stars: ✭ 86 (+145.71%)
tibetan-ocrPython OCR for Handwritten Tibetan Mauscripts
Stars: ✭ 19 (-45.71%)
paperbaseOpen source document organizer with automatic OCR and full text search
Stars: ✭ 21 (-40%)
Multi-Type-TD-TSRExtracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Stars: ✭ 174 (+397.14%)
saramGet OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:
Stars: ✭ 51 (+45.71%)
Printed-Chinese-Character-OCRThis is a Chinese Character ocr system based on Deep learning (VGG like CNN neural net work),this rep include trainning set generating,image preprocesing,NN model optimizing based on Keras high level NN framwork
Stars: ✭ 21 (-40%)
i-librarian-freeI, Librarian - open-source version of a PDF managing SaaS.
Stars: ✭ 110 (+214.29%)
mementoOrganize your meme image cluster in a better format using OCR from the meme to sort them using tesseract along with editing memes by segmenting them using OpenCV within a directory
Stars: ✭ 70 (+100%)
SynthText ChineseModify from https://github.com/JarveeLee/SynthText_Chinese_version.git with python3 and cv3.
Stars: ✭ 35 (+0%)
Inventory KameraScans Genshin Impact characters, artifacts, and weapons from the game window into a JSON file.
Stars: ✭ 348 (+894.29%)
Shadow计算机基础知识,数据结构,设计模式,Tomcat中间件的实现
Stars: ✭ 19 (-45.71%)
omynote众山小笔记 - 集中管理你的读书笔记
Stars: ✭ 154 (+340%)
shape-context-ocrThe Shape Context is a shape descriptor that captures the relative positions of other points on the shape contours, and is used to recognize characters.
Stars: ✭ 20 (-42.86%)
kuzushiji-recognition5th place solution for the Kaggle Kuzushiji Recognition Challenge
Stars: ✭ 41 (+17.14%)