cseas / Ocr Table
Licence: mit
Extract tables from scanned image PDFs using Optical Character Recognition.
Stars: ✭ 165
Projects that are alternatives of or similar to Ocr Table
Swiftytesseract
A Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
Stars: ✭ 170 (+3.03%)
Mutual labels: ocr, tesseract, optical-character-recognition
Tesseract4android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Stars: ✭ 148 (-10.3%)
Mutual labels: ocr, tesseract, optical-character-recognition
Penteract Ocr
⭐️ The native node.js bindings to the Tesseract OCR project.
Stars: ✭ 86 (-47.88%)
Mutual labels: ocr, tesseract, optical-character-recognition
Pan card ocr project
To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
Stars: ✭ 39 (-76.36%)
Mutual labels: ocr, tesseract, optical-character-recognition
IdCardRecognition
Android id card recognition based on OCR. 安卓基于OCR的身份证识别。
Stars: ✭ 35 (-78.79%)
Mutual labels: ocr, tesseract, optical-character-recognition
Android Ocr
Experimental optical character recognition app
Stars: ✭ 2,177 (+1219.39%)
Mutual labels: ocr, tesseract, optical-character-recognition
Tesserocr
A Python wrapper for the tesseract-ocr API
Stars: ✭ 1,567 (+849.7%)
Mutual labels: ocr, tesseract, optical-character-recognition
Image2text
📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: ✭ 243 (+47.27%)
Mutual labels: ocr, tesseract, optical-character-recognition
React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+132.73%)
Mutual labels: ocr, tesseract, optical-character-recognition
Swiftytesseractrte
SwiftyTesseract Real-Time Engine
Stars: ✭ 49 (-70.3%)
Mutual labels: ocr, tesseract, optical-character-recognition
Textshot
Python tool for grabbing text via screenshot
Stars: ✭ 1,163 (+604.85%)
Mutual labels: ocr, tesseract
Ocr Electron Vue
📇 A Simple OCR Application built on Electron, Vue.js & Tesseract.js
Stars: ✭ 67 (-59.39%)
Mutual labels: ocr, tesseract
Node Tesseract Ocr
A Node.js wrapper for the Tesseract OCR API
Stars: ✭ 92 (-44.24%)
Mutual labels: ocr, tesseract
Gosseract
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
Stars: ✭ 1,622 (+883.03%)
Mutual labels: ocr, tesseract
Idmatch
Match faces on id cards with OCR capabilities.
Stars: ✭ 52 (-68.48%)
Mutual labels: ocr, tesseract
Links Detector
📖 👆🏻 Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.
Stars: ✭ 106 (-35.76%)
Mutual labels: ocr, tesseract
Tesseract Python
Examples to implement OCR(Optical Character Recognition) using tesseract using Python
Stars: ✭ 49 (-70.3%)
Mutual labels: ocr, tesseract
Tesseract
This package contains an OCR engine - libtesseract and a command line program - tesseract.
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
on line recognition, but also still supports the legacy Tesseract OCR engine of
Tesseract 3 which works by recognizing character patterns. Compatibility with
Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0).
It also needs traineddata files which support the legacy engine, for example
those from the tessdata repository.
Stars: ✭ 43,199 (+26081.21%)
Mutual labels: ocr, tesseract
Ocrtable
Recognize tables and text from scanned images that contain tables. 从包含表格的扫描图片中识别表格和文字
Stars: ✭ 155 (-6.06%)
Mutual labels: ocr, tesseract
Tesseract Ocr for windows
Visual Studio Projects for Tessearct and dependencies
Stars: ✭ 122 (-26.06%)
Mutual labels: ocr, tesseract
ocr-table
This project aims to extract tables from scanned image PDFs using Optical Character Recognition.
Install Requirements
-
Tesseract OCR
sudo apt-get install tesseract-ocr
-
Imagemagick
sudo apt-get install imagemagick
-
PDF Utilities
sudo apt-get install poppler-utils
-
Python packages
sudo pip install -r requirements.txt
Usage
-
Clear the pdf/ folder and copy all your pdf files to be scanned in it.
-
Run the OCR:
python3 shellocr.py
-
The scanned text files shall be available in the txt/ folder once the process completes.
Alternate
-
If the above doesn't work for you, try the alternate method.
-
Save your file as input.pdf in the root directory.
-
Run
python3 pdf_miner.py
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].