Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → cseas → Ocr Table

cseas / Ocr Table

Licence: mit

Extract tables from scanned image PDFs using Optical Character Recognition.

Programming Languages

139335 projects - #7 most used programming language

77523 projects

Labels

ocr tesseract optical-character-recognition

Projects that are alternatives of or similar to Ocr Table

Swiftytesseract

A Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications

Stars: ✭ 170 (+3.03%)

Mutual labels: ocr, tesseract, optical-character-recognition

Tesseract4android

Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.

Stars: ✭ 148 (-10.3%)

Mutual labels: ocr, tesseract, optical-character-recognition

⭐️ The native node.js bindings to the Tesseract OCR project.

Stars: ✭ 86 (-47.88%)

Mutual labels: ocr, tesseract, optical-character-recognition

Pan card ocr project

To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format

Stars: ✭ 39 (-76.36%)

Mutual labels: ocr, tesseract, optical-character-recognition

IdCardRecognition

Android id card recognition based on OCR. 安卓基于OCR的身份证识别。

Stars: ✭ 35 (-78.79%)

Mutual labels: ocr, tesseract, optical-character-recognition

Experimental optical character recognition app

Stars: ✭ 2,177 (+1219.39%)

Mutual labels: ocr, tesseract, optical-character-recognition

A Python wrapper for the tesseract-ocr API

Stars: ✭ 1,567 (+849.7%)

Mutual labels: ocr, tesseract, optical-character-recognition

📋 Python wrapper to grab text from images and save as text files using Tesseract Engine

Stars: ✭ 243 (+47.27%)

Mutual labels: ocr, tesseract, optical-character-recognition

React Native Tesseract Ocr

Tesseract OCR wrapper for React Native

Stars: ✭ 384 (+132.73%)

Mutual labels: ocr, tesseract, optical-character-recognition

Swiftytesseractrte

SwiftyTesseract Real-Time Engine

Stars: ✭ 49 (-70.3%)

Mutual labels: ocr, tesseract, optical-character-recognition

Python tool for grabbing text via screenshot

Stars: ✭ 1,163 (+604.85%)

Mutual labels: ocr, tesseract

Ocr Electron Vue

📇 A Simple OCR Application built on Electron, Vue.js & Tesseract.js

Stars: ✭ 67 (-59.39%)

Mutual labels: ocr, tesseract

Node Tesseract Ocr

A Node.js wrapper for the Tesseract OCR API

Stars: ✭ 92 (-44.24%)

Mutual labels: ocr, tesseract

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Stars: ✭ 1,622 (+883.03%)

Mutual labels: ocr, tesseract

Match faces on id cards with OCR capabilities.

Stars: ✭ 52 (-68.48%)

Mutual labels: ocr, tesseract

📖 👆🏻 Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.

Stars: ✭ 106 (-35.76%)

Mutual labels: ocr, tesseract

Tesseract Python

Examples to implement OCR(Optical Character Recognition) using tesseract using Python

Stars: ✭ 49 (-70.3%)

Mutual labels: ocr, tesseract

This package contains an OCR engine - libtesseract and a command line program - tesseract. Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.

Stars: ✭ 43,199 (+26081.21%)

Mutual labels: ocr, tesseract

Recognize tables and text from scanned images that contain tables. 从包含表格的扫描图片中识别表格和文字

Stars: ✭ 155 (-6.06%)

Mutual labels: ocr, tesseract

Tesseract Ocr for windows

Visual Studio Projects for Tessearct and dependencies

Stars: ✭ 122 (-26.06%)

Mutual labels: ocr, tesseract

View All Similar Projects ➔

ocr-table

This project aims to extract tables from scanned image PDFs using Optical Character Recognition.

Install Requirements

Tesseract OCR
```
sudo apt-get install tesseract-ocr
```
Imagemagick
```
sudo apt-get install imagemagick
```
PDF Utilities
```
sudo apt-get install poppler-utils
```
Python packages
```
sudo pip install -r requirements.txt
```

Usage

Clear the pdf/ folder and copy all your pdf files to be scanned in it.
Run the OCR:
```
python3 shellocr.py
```
The scanned text files shall be available in the txt/ folder once the process completes.

Alternate

If the above doesn't work for you, try the alternate method.
Save your file as input.pdf in the root directory.
Run
```
python3 pdf_miner.py 
```

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 165

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗