hOCR-to-ALTOConvert between Tesseract hOCR and ALTO XML using XSL stylesheets
Stars: ✭ 40 (-71.83%)
ocrd cisOCR-D python tools
Stars: ✭ 28 (-80.28%)
ImageToTextOCR with Google's AI technology (Cloud Vision API)
Stars: ✭ 30 (-78.87%)
i-librarian-freeI, Librarian - open-source version of a PDF managing SaaS.
Stars: ✭ 110 (-22.54%)
vrpdrDeep Learning Applied To Vehicle Registration Plate Detection and Recognition in PyTorch.
Stars: ✭ 36 (-74.65%)
omynote众山小笔记 - 集中管理你的读书笔记
Stars: ✭ 154 (+8.45%)
ddddocr带带弟弟 通用验证码识别OCR pypi版
Stars: ✭ 4,093 (+2782.39%)
digdetA realtime digit OCR on the browser using Machine Learning
Stars: ✭ 22 (-84.51%)
kuzushiji-recognitionKuzushiji Recognition Kaggle 2019. Build a DL model to transcribe ancient Kuzushiji into contemporary Japanese characters. Opening the door to a thousand years of Japanese culture.
Stars: ✭ 16 (-88.73%)
tibetan-ocrPython OCR for Handwritten Tibetan Mauscripts
Stars: ✭ 19 (-86.62%)
butterflyApplication transformation tool
Stars: ✭ 35 (-75.35%)
shape-context-ocrThe Shape Context is a shape descriptor that captures the relative positions of other points on the shape contours, and is used to recognize characters.
Stars: ✭ 20 (-85.92%)
ingest-fileIngestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
Stars: ✭ 40 (-71.83%)
spydrnetA flexible framework for analyzing and transforming FPGA netlists. Official repository.
Stars: ✭ 49 (-65.49%)
EverTranslatorTranslate text anytime and everywhere, even you are gaming!
Stars: ✭ 59 (-58.45%)
CLPR.pytorchEnd to End Chinese License Plate Recognition
Stars: ✭ 75 (-47.18%)
veryfi-goGo module for communicating with the Veryfi OCR API
Stars: ✭ 18 (-87.32%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (-45.77%)
form-segmentationLet's explore how we can extract text from forms
Stars: ✭ 42 (-70.42%)
kitodo-presentationKitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
Stars: ✭ 33 (-76.76%)
gintonicA declarative transformation language for GraphQL 🍸
Stars: ✭ 27 (-80.99%)
fakemenotApplication to check authenticity of Twitter screenshots. Written in Python 🐍
Stars: ✭ 29 (-79.58%)
deep-text-recognition-benchmarkProvide the OCR model in ONNX format so that the OpenCV DNN module can use them directly and correctly.
Stars: ✭ 32 (-77.46%)
wrangleA data transformation package for deep learning with Autonomio, Keras and TensorFlow.
Stars: ✭ 15 (-89.44%)
Inventory KameraScans Genshin Impact characters, artifacts, and weapons from the game window into a JSON file.
Stars: ✭ 348 (+145.07%)
Shadow计算机基础知识,数据结构,设计模式,Tomcat中间件的实现
Stars: ✭ 19 (-86.62%)
tesseract-unityStandalone OCR plugin for Unity using Tesseract
Stars: ✭ 35 (-75.35%)
blinkid-in-browserBlinkID In-browser SDK for WebAssembly-enabled browsers.
Stars: ✭ 40 (-71.83%)
Transformer-ocrHandwritten text recognition using transformers.
Stars: ✭ 92 (-35.21%)
erpnext ocr🐍 ⚗️ Optical Character Recognition using tesseract within Frappe.
Stars: ✭ 58 (-59.15%)
ruzzle-solverA python script that solves ruzzle boards
Stars: ✭ 46 (-67.61%)
crnn.mxnetcrnn in mxnet.can train with chinese characters
Stars: ✭ 47 (-66.9%)
extract-information-from-identity-cardFrom identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.
Stars: ✭ 81 (-42.96%)
Snipping-OcrA simple Snipping tool for Windows with OCR capabilities
Stars: ✭ 82 (-42.25%)
DocTrThe official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Stars: ✭ 202 (+42.25%)
video-subtitle-extractor视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Stars: ✭ 1,763 (+1141.55%)
webgrepGrep Web pages with extra features like JS deobfuscation and OCR
Stars: ✭ 86 (-39.44%)
paperbaseOpen source document organizer with automatic OCR and full text search
Stars: ✭ 21 (-85.21%)
ocr2textConvert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: ✭ 90 (-36.62%)
Printed-Chinese-Character-OCRThis is a Chinese Character ocr system based on Deep learning (VGG like CNN neural net work),this rep include trainning set generating,image preprocesing,NN model optimizing based on Keras high level NN framwork
Stars: ✭ 21 (-85.21%)
Table-Extractor-From-ImageThis repository contains the code that extracts a table from an image and exports it to an Excel.
Stars: ✭ 46 (-67.61%)
Multi-Type-TD-TSRExtracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Stars: ✭ 174 (+22.54%)
blog技术资料日常积累(欢迎投稿)
Stars: ✭ 59 (-58.45%)
nimtesseractA Tesseract OCR wrapper for Nim
Stars: ✭ 23 (-83.8%)
iccciggenerate images of itunes card content code. (for readable OCR)
Stars: ✭ 14 (-90.14%)