All Projects → Pdfocr → Similar Projects or Alternatives

815 Open source projects that are alternatives of or similar to Pdfocr

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

Stars: ✭ 165 (-42.51%)

Mutual labels: pdf, ocr

Scanbot Sdk Example Android

Document scanning SDK example apps for the Scanbot SDK for Android.

Stars: ✭ 67 (-76.66%)

Mutual labels: pdf, ocr

Lambda Text Extractor

AWS Lambda functions to extract text from various binary formats.

Stars: ✭ 159 (-44.6%)

Mutual labels: pdf, ocr

Papermerge

Open Source Document Management System for Digital Archives (Scanned Documents)

Stars: ✭ 1,177 (+310.1%)

Mutual labels: pdf, ocr

Mybox

Easy tools of document, image, file, network, location, color, and media.

Stars: ✭ 45 (-84.32%)

Mutual labels: pdf, ocr

Pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Stars: ✭ 1,969 (+586.06%)

Mutual labels: pdf, ocr

Remarks

Extract highlights, scribbles, and annotations from PDFs marked with the reMarkable tablet. Export to Markdown, PDF, PNG, and SVG

Stars: ✭ 94 (-67.25%)

Mutual labels: pdf, ocr

Ocrmypdf

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Stars: ✭ 5,549 (+1833.45%)

Mutual labels: pdf, ocr

Ambar

🔍 Ambar: Document Search Engine

Stars: ✭ 1,829 (+537.28%)

Mutual labels: pdf, ocr

Docspell

Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.

Stars: ✭ 303 (+5.57%)

Mutual labels: pdf, ocr

Open Paperless

Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)

Stars: ✭ 2,538 (+784.32%)

Mutual labels: pdf, ocr

Paperwork

Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab

Stars: ✭ 2,392 (+733.45%)

Mutual labels: pdf, ocr

Mayan Edms

Free Open Source Document Management System (mirror, no pull request or issues)

Stars: ✭ 226 (-21.25%)

Mutual labels: pdf, ocr

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

Stars: ✭ 2,736 (+853.31%)

Mutual labels: pdf, ocr

ocr

Simple app to extract text from pictures using Tesseract

Stars: ✭ 98 (-65.85%)

Mutual labels: ocr

Pdftilecut

pdftilecut lets you sub-divide a PDF page(s) into smaller pages so you can print them on small form printers.

Stars: ✭ 258 (-10.1%)

Mutual labels: pdf

PRLib

Pre-Recognition Library - library with algorithms for improving OCR quality.

Stars: ✭ 22 (-92.33%)

Mutual labels: ocr

staff identity card ocr project

Staff Identity Card OCR Project

Stars: ✭ 15 (-94.77%)

Mutual labels: ocr

Pdf

Rust library to read, manipulate and write PDF files.

Stars: ✭ 265 (-7.67%)

Mutual labels: pdf

Boxable

Boxable is a library that can be used to easily create tables in pdf documents.

Stars: ✭ 253 (-11.85%)

Mutual labels: pdf

OCR-Reader

An Android app to extract text from camera preview directly.

Stars: ✭ 43 (-85.02%)

Mutual labels: ocr

Android-Text-Scanner

Read text and numbers with android camera OCR

Stars: ✭ 27 (-90.59%)

Mutual labels: ocr

attentionocr

Attention OCR in Tensorflow 2.0

Stars: ✭ 45 (-84.32%)

Mutual labels: ocr

Iron-OCR-Image-to-Text-in-CSharp

Image to Text Tutorial in C# - See https://ironsoftware.com/csharp/ocr/tutorials/how-to-read-text-from-an-image-in-csharp-net/

Stars: ✭ 65 (-77.35%)

Mutual labels: ocr

Seven-Segment-OCR

Computer vision project to automatically recognize digits characters in a seven-segments display

Stars: ✭ 58 (-79.79%)

Mutual labels: ocr

Quickbill

Create unlimited invoices for free.

Stars: ✭ 278 (-3.14%)

Mutual labels: pdf

Deck

Slide Decks

Stars: ✭ 261 (-9.06%)

Mutual labels: pdf

BasicArabicOCR

A very basic Arabic OCR based on tesseract OCR engine written in Java.

Stars: ✭ 19 (-93.38%)

Mutual labels: ocr

tutorials

Git Repo for Articles on Ergo Sum blog and the youtube channel https://www.youtube.com/channel/UCiie9CN--dazA7iT2sry5FA

Stars: ✭ 42 (-85.37%)

Mutual labels: ocr

ScreenAccess

Anti Recoil system with weapon type built-in recognition based on OCR, currently support next games: Apex Legends

Stars: ✭ 41 (-85.71%)

Mutual labels: ocr

Ocr Corrector

利用语言模型，纠正OCR识别错误

Stars: ✭ 259 (-9.76%)

Mutual labels: ocr

pdf2xml-viewer

A simple viewer and inspection tool for text boxes in PDF documents

Stars: ✭ 82 (-71.43%)

Mutual labels: ocr

Thinreports Generator

Report Generator for Ruby

Stars: ✭ 268 (-6.62%)

Mutual labels: pdf

tesseract-server

A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.

Stars: ✭ 15 (-94.77%)

Mutual labels: ocr

Cloud Reports

Scans your AWS cloud resources and generates reports. Check out free hosted version:

Stars: ✭ 255 (-11.15%)

Mutual labels: pdf

VehicleInfoOCR

Use your camera to read number plates and obtain vehicle details. Simple, ad-free and faster alternative to existing playstore apps

Stars: ✭ 35 (-87.8%)

Mutual labels: ocr

Attention ocr.pytorch

This repository implements the the encoder and decoder model with attention model for OCR

Stars: ✭ 278 (-3.14%)

Mutual labels: ocr

screenshot-actions

Dunst actions for screenshots (OCR, upload to 0x0.st, delete, rename, move to/from clipboard)

Stars: ✭ 49 (-82.93%)

Mutual labels: ocr

idcardocr

离线环境下第二代居民身份证信息识别

Stars: ✭ 358 (+24.74%)

Mutual labels: ocr

easyocr

easy to ocr

Stars: ✭ 49 (-82.93%)

Mutual labels: ocr

Ionic Ocr Example

📷 Simple Ionic app using ocrad.js

Stars: ✭ 263 (-8.36%)

Mutual labels: ocr

scanbot-sdk-example-ionic

Scanbot scanner SDK example app for Ionic with Cordova.

Stars: ✭ 24 (-91.64%)

Mutual labels: ocr

meltsub

Convert hardsub to softsub

Stars: ✭ 19 (-93.38%)

Mutual labels: ocr

PSENet-Tensorflow

TensorFlow implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Networkt)

Stars: ✭ 51 (-82.23%)

Mutual labels: ocr

Reptile

爬取机械工业出版社所有的计算机方面的书

Stars: ✭ 282 (-1.74%)

Mutual labels: pdf

TextBoxGAN

Generate text boxes from input words with a GAN.

Stars: ✭ 50 (-82.58%)

Mutual labels: ocr

smart-docs-parser

An OCR based document parser to extract information from identity document images

Stars: ✭ 14 (-95.12%)

Mutual labels: ocr

deep-text-recognition-benchmark

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

Stars: ✭ 123 (-57.14%)

Mutual labels: ocr

granblue-automation-android

Educational application written in Kotlin aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using MediaProjection, AccessibilityService, and OpenCV.

Stars: ✭ 26 (-90.94%)

Mutual labels: ocr

Tableexport

tableExport（table导出文件，支持json、csv、txt、xml、word、excel、image、pdf）

Stars: ✭ 261 (-9.06%)

Mutual labels: pdf

breach-protocol-autosolver

Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.

Stars: ✭ 28 (-90.24%)

Mutual labels: ocr

go-ocr

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

Stars: ✭ 31 (-89.2%)

Mutual labels: ocr

python-ocr-example

The code for the blogpost A Python Approach to Character Recognition

Stars: ✭ 54 (-81.18%)

Mutual labels: ocr

CTC-OCR

A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem

Stars: ✭ 27 (-90.59%)

Mutual labels: ocr

doctr-tfjs-demo

Javascript demo of docTR, powered by TensorFlowJS

Stars: ✭ 21 (-92.68%)

Mutual labels: ocr

ibm-cloud-functions-serverless-ocr-openchecks

Serverless bank check deposit processing with object storage and optical character recognition using Apache OpenWhisk powered by IBM Cloud Functions. See the Tech Talk replay for a demo.

Stars: ✭ 40 (-86.06%)

Mutual labels: ocr

Starter Book

A book starter to kickstart your writing journey 🎉

Stars: ✭ 277 (-3.48%)

Mutual labels: pdf

Uxmpdfkit

An iOS PDF viewer and annotator written in Swift that can be embedded into any application.

Stars: ✭ 260 (-9.41%)

Mutual labels: pdf

namsel

An OCR application focused on machine-print Tibetan text