A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.

Stars: ✭ 205 (-96.31%)

Mutual labels: image-processing, ocr

pmOCR

A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity

Stars: ✭ 53 (-99.04%)

Mutual labels: ocr, tesseract

Papermerge

Open Source Document Management System for Digital Archives (Scanned Documents)

Stars: ✭ 1,177 (-78.79%)

Mutual labels: pdf, ocr

Koreader Base

Base framework offering a Lua scriptable environment for creating document readers

Stars: ✭ 81 (-98.54%)

Mutual labels: pdf, tesseract

Paperwork

Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab

Stars: ✭ 2,392 (-56.89%)

Mutual labels: pdf, ocr

Open Paperless

Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)

Stars: ✭ 2,538 (-54.26%)

Mutual labels: pdf, ocr

Tessdata

Trained models with support for legacy and LSTM OCR engine

Stars: ✭ 4,173 (-24.8%)

Mutual labels: ocr, tesseract

Libvips

A fast image processing library with low memory needs.

Stars: ✭ 6,094 (+9.82%)

Mutual labels: image-processing, pdf

Scene Text Recognition

Scene text detection and recognition based on Extremal Region(ER)

Stars: ✭ 146 (-97.37%)

Mutual labels: image-processing, ocr

Govips

A lightning fast image processing and resizing library for Go

Stars: ✭ 442 (-92.03%)

Mutual labels: image-processing, pdf

erpnext ocr

🐍 ⚗️ Optical Character Recognition using tesseract within Frappe.

Stars: ✭ 58 (-98.95%)

Mutual labels: ocr, tesseract

How-to-use-tesseract-ocr-4.0-with-csharp

How to use Tesseract OCR 4.0 with C#

Stars: ✭ 60 (-98.92%)

Mutual labels: ocr, tesseract

tesseract-unity

Standalone OCR plugin for Unity using Tesseract

Stars: ✭ 35 (-99.37%)

Mutual labels: ocr, tesseract

textocry

Textocry - Copy text from Images (chrome extension)

Stars: ✭ 29 (-99.48%)

Mutual labels: ocr, tesseract

nimtesseract

A Tesseract OCR wrapper for Nim

Stars: ✭ 23 (-99.59%)

Mutual labels: ocr, tesseract

LaraOCR

Laravel Optical Character Reader(OCR) package using ocr engines like Tesseract

Stars: ✭ 88 (-98.41%)

Mutual labels: ocr, tesseract

IdCardRecognition

Android id card recognition based on OCR. 安卓基于OCR的身份证识别。

Stars: ✭ 35 (-99.37%)

Mutual labels: ocr, tesseract

Mybox

Easy tools of document, image, file, network, location, color, and media.

Stars: ✭ 45 (-99.19%)

Mutual labels: pdf, ocr

Tesstrain

Train Tesseract LSTM with make

Stars: ✭ 251 (-95.48%)

Mutual labels: ocr, tesseract

Remarks

Extract highlights, scribbles, and annotations from PDFs marked with the reMarkable tablet. Export to Markdown, PDF, PNG, and SVG

Stars: ✭ 94 (-98.31%)

Mutual labels: pdf, ocr

Image2text

📋 Python wrapper to grab text from images and save as text files using Tesseract Engine

Stars: ✭ 243 (-95.62%)

Mutual labels: ocr, tesseract

React Native Tesseract Ocr

Tesseract OCR wrapper for React Native

Stars: ✭ 384 (-93.08%)

Mutual labels: ocr, tesseract

Open Semantic Etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

Stars: ✭ 165 (-97.03%)

Mutual labels: pdf, ocr

Tessdata fast

Fast integer versions of trained LSTM models

Stars: ✭ 221 (-96.02%)

Mutual labels: ocr, tesseract

Prlib

Pre-Recognition Library - library with algorithms for improving OCR quality.

Stars: ✭ 18 (-99.68%)

Mutual labels: image-processing, ocr

Dmsmsgrcg

A photo OCR project aims to output DMS messages contained in sign structure images.

Stars: ✭ 18 (-99.68%)

Mutual labels: image-processing, ocr

ocr

Simple app to extract text from pictures using Tesseract

Stars: ✭ 98 (-98.23%)

Mutual labels: ocr, tesseract

breach-protocol-autosolver

Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.

Stars: ✭ 28 (-99.5%)

Mutual labels: ocr, tesseract

Docspell

Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.

Stars: ✭ 303 (-94.54%)

Mutual labels: pdf, ocr

Easyocr

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Stars: ✭ 13,379 (+141.11%)

Mutual labels: image-processing, ocr

ScribeBot

A highly scriptable automation system full of cool features. Automate everything with a little bit of Lua.

Stars: ✭ 72 (-98.7%)

Mutual labels: ocr, tesseract

Tesseract

Bindings to Tesseract OCR engine for R

Stars: ✭ 192 (-96.54%)

Mutual labels: ocr, tesseract

ocr2text

Convert a PDF via OCR to a TXT file in UTF-8 encoding

Stars: ✭ 90 (-98.38%)

Mutual labels: ocr, tesseract

saram

Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:

Stars: ✭ 51 (-99.08%)

Mutual labels: ocr, tesseract

ruzzle-solver

A python script that solves ruzzle boards

Stars: ✭ 46 (-99.17%)

Mutual labels: ocr, tesseract

memento

Organize your meme image cluster in a better format using OCR from the meme to sort them using tesseract along with editing memes by segmenting them using OpenCV within a directory

Stars: ✭ 70 (-98.74%)

Mutual labels: ocr, tesseract

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Stars: ✭ 6,560 (+18.22%)

Mutual labels: ocr, tesseract

Nkocr

🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.

Stars: ✭ 15 (-99.73%)

Mutual labels: ocr, tesseract

tesseract-ocr

Node.js wrapper for Tesseract OCR CLI.