All Projects → rohanpillai20 → Table-Extractor-From-Image

rohanpillai20 / Table-Extractor-From-Image

Licence: Apache-2.0 license
This repository contains the code that extracts a table from an image and exports it to an Excel.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Table-Extractor-From-Image

How-to-use-tesseract-ocr-4.0-with-csharp
How to use Tesseract OCR 4.0 with C#
Stars: ✭ 60 (+30.43%)
Mutual labels:  ocr
ruzzle-solver
A python script that solves ruzzle boards
Stars: ✭ 46 (+0%)
Mutual labels:  ocr
ImageToText
OCR with Google's AI technology (Cloud Vision API)
Stars: ✭ 30 (-34.78%)
Mutual labels:  ocr
blog
技术资料日常积累(欢迎投稿)
Stars: ✭ 59 (+28.26%)
Mutual labels:  ocr
PlotDigitizer
A Python utility to digitize plots.
Stars: ✭ 64 (+39.13%)
Mutual labels:  data-extraction
Php-Google-Vision-Api
Google Vision Api for PHP (https://cloud.google.com/vision/)
Stars: ✭ 61 (+32.61%)
Mutual labels:  ocr
deep-text-recognition-benchmark
Provide the OCR model in ONNX format so that the OpenCV DNN module can use them directly and correctly.
Stars: ✭ 32 (-30.43%)
Mutual labels:  ocr
fakemenot
Application to check authenticity of Twitter screenshots. Written in Python 🐍
Stars: ✭ 29 (-36.96%)
Mutual labels:  ocr
veryfi-go
Go module for communicating with the Veryfi OCR API
Stars: ✭ 18 (-60.87%)
Mutual labels:  ocr
Shadow
计算机基础知识,数据结构,设计模式,Tomcat中间件的实现
Stars: ✭ 19 (-58.7%)
Mutual labels:  ocr
Printed-Chinese-Character-OCR
This is a Chinese Character ocr system based on Deep learning (VGG like CNN neural net work),this rep include trainning set generating,image preprocesing,NN model optimizing based on Keras high level NN framwork
Stars: ✭ 21 (-54.35%)
Mutual labels:  ocr
DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Stars: ✭ 202 (+339.13%)
Mutual labels:  ocr
ingest-file
Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
Stars: ✭ 40 (-13.04%)
Mutual labels:  ocr
Tess4Android
A new fork base on tess-two and Tesseract 4.0.0
Stars: ✭ 31 (-32.61%)
Mutual labels:  ocr
mirador-textoverlay
Text Overlay plugin for Mirador 3
Stars: ✭ 35 (-23.91%)
Mutual labels:  ocr
digdet
A realtime digit OCR on the browser using Machine Learning
Stars: ✭ 22 (-52.17%)
Mutual labels:  ocr
omynote
众山小笔记 - 集中管理你的读书笔记
Stars: ✭ 154 (+234.78%)
Mutual labels:  ocr
alfresco-simple-ocr
Simple OCR action for Alfresco
Stars: ✭ 40 (-13.04%)
Mutual labels:  ocr
kuzushiji-recognition
Kuzushiji Recognition Kaggle 2019. Build a DL model to transcribe ancient Kuzushiji into contemporary Japanese characters. Opening the door to a thousand years of Japanese culture.
Stars: ✭ 16 (-65.22%)
Mutual labels:  ocr
fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Stars: ✭ 123 (+167.39%)
Mutual labels:  data-manipulation

Table Extractor From Image

This repository contains the code that extracts a table from an image and exports it to an Excel. To do this, the image is "read" by an OCR which provides a JSON output which is used as the input to the program. The program then arranges the cells row and column-wise as per the JSON input.

NOTE: Only those input cells read by the OCR will be displayed in the Excel.

Modules Required

os
copy
pandas==0.22.0
openpyxl==2.4.9
You can also use requirements.txt to install the packages. How? Follow this link.

Flow

Image -> JSON -> Excel

Steps

  1. First of all, install all the import packages specified in the requirements.txt
  2. For "reading" an image, use an OCR that converts the format to JSON.
  3. In the program, change the input path and output path according to your requirement.
  4. Run the program (JSON-to-Excel.py).

Sample Test Case

Input Image:

Input Image

It's Corresponding JSON:

JSON

Excel Output:

Excel Output

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].