All Projects → yardstick17 → Image_text_reader

yardstick17 / Image_text_reader

Licence: mit
The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Image text reader

Nkocr
🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.
Stars: ✭ 15 (-84.54%)
Mutual labels:  ocr, tesseract-ocr
idcardocr
离线环境下第二代居民身份证信息识别
Stars: ✭ 358 (+269.07%)
Mutual labels:  ocr, tesseract-ocr
TesseractStudio.Net
A free Windows graphical interface to the Tesseract 4.0 OCR engine.
Stars: ✭ 38 (-60.82%)
Mutual labels:  ocr, tesseract-ocr
NLP-image-to-text
code to extract text from images
Stars: ✭ 28 (-71.13%)
Mutual labels:  ocr, tesseract-ocr
Gimagereader
A Gtk/Qt front-end to tesseract-ocr.
Stars: ✭ 786 (+710.31%)
Mutual labels:  ocr, tesseract-ocr
How-to-use-tesseract-ocr-4.0-with-csharp
How to use Tesseract OCR 4.0 with C#
Stars: ✭ 60 (-38.14%)
Mutual labels:  ocr, tesseract-ocr
BasicArabicOCR
A very basic Arabic OCR based on tesseract OCR engine written in Java.
Stars: ✭ 19 (-80.41%)
Mutual labels:  ocr, tesseract-ocr
Tesseract4java
Java GUI and Tools for Tesseract OCR
Stars: ✭ 214 (+120.62%)
Mutual labels:  ocr, tesseract-ocr
React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+295.88%)
Mutual labels:  ocr, tesseract-ocr
Ccextractor
CCExtractor - Official version maintained by the core team
Stars: ✭ 356 (+267.01%)
Mutual labels:  ocr, tesseract-ocr
ocreval
Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support
Stars: ✭ 48 (-50.52%)
Mutual labels:  ocr, tesseract-ocr
Blackout
NaNoGenMo 2016 entry #2
Stars: ✭ 36 (-62.89%)
Mutual labels:  ocr, tesseract-ocr
receipt-manager-app
Receipt parser application written in dart.
Stars: ✭ 140 (+44.33%)
Mutual labels:  ocr, tesseract-ocr
Textshot
Python tool for grabbing text via screenshot
Stars: ✭ 1,163 (+1098.97%)
Mutual labels:  ocr, tesseract-ocr
Image2text
📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: ✭ 243 (+150.52%)
Mutual labels:  ocr, tesseract-ocr
breach-protocol-autosolver
Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: ✭ 28 (-71.13%)
Mutual labels:  ocr, tesseract-ocr
Text Detection
Text detection with mainly MSER and SWT
Stars: ✭ 167 (+72.16%)
Mutual labels:  ocr, tesseract-ocr
Tesseract
Bindings to Tesseract OCR engine for R
Stars: ✭ 192 (+97.94%)
Mutual labels:  ocr, tesseract-ocr
Idcardocr
离线环境下第二代居民身份证信息识别
Stars: ✭ 328 (+238.14%)
Mutual labels:  ocr, tesseract-ocr
Pyocr
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
Stars: ✭ 932 (+860.82%)
Mutual labels:  ocr, tesseract-ocr

image_text_reader

Gitter Build Status

It's a very basic tool to read images , images formatted like a restaurant-menu.

Tesseract-ocr

This tools need tesseract-ocr engine. Help yourself with this --

Linux

Tesseract is available directly from many Linux distributions. The package is generally called 'tesseract' or 'tesseract-ocr' - search your distribution's repositories to find it. Thus you can install Tesseract 4.x and it's developer tools on Ubuntu 18.x bionic by simply running:

sudo apt install tesseract-ocr
sudo apt install libtesseract-dev

Refer here for more on installation on all other systems.

macOS

Homebrew

To install Tesseract run this command:

brew install tesseract
OCR reads the text extracted image from the full image. Click here

Command to use:

Dockerized image reading

docker run -it yardstick17/image-text-reader bash -c "PYTHONPATH='.' python3 read_image.py read_text_from_local_image -f images/sample_image.jpg"

Read from url

PYTHONPATH='.' python3 read_image.py read_text_from_image_url -u https://marketplace.canva.com/MACHUlPU93Q/1/0/thumbnail_large/canva-peach-green-leaves-garden-vegetarian-pizza-menu-MACHUlPU93Q.jpg

[2017-07-07 16:20:34,119] INFO : Downloading image from url: https://marketplace.canva.com/MACHUlPU93Q/1/0/thumbnail_large/canva-peach-green-leaves-garden-vegeta
[2017-07-07 16:20:35,997] INFO : Saving file: /var/folders/cz/n3vkz7x91qs06nmm9byxxgz00000gr/T/tmpienrxu2c
[2017-07-07 16:20:35,997] INFO : Processing image for text Extraction
[2017-07-07 16:20:36,308] INFO : Removing noise and smoothening image
[2017-07-07 16:20:36,431] INFO : Reading the text inside the contour plotted

Read from local image

PYTHONPATH='.' python3 read_image.py read_text_from_local_image -f images/sample_image.jpg

[2017-07-07 16:32:38,862] INFO : Processing image for text Extraction
[2017-07-07 16:32:39,232] INFO : Removing noise and smoothening image
[2017-07-07 16:32:39,442] INFO : Reading the text inside the contour plotted

Deploy an api for reading text from image!

PYTHONPATH='.' python3 api/app.py

[2017-07-07 16:49:57,818] INFO :  * Running on http://0.0.0.0:6600/ (Press CTRL+C to quit)
[2017-07-07 16:49:57,820] INFO :  * Restarting with stat
[2017-07-07 16:49:58,712] WARNING :  * Debugger is active!
[2017-07-07 16:49:58,738] INFO :  * Debugger pin code: 316-405-633

Sample api deployed on my tiny server. Please be patient with them.

curl -X POST \
  http://54.254.214.96/read_image_from_file/url \
  -F url=https://africatalentbank.com/wp-content/uploads/2014/10/Menu.jpg

Digital Menu

Digital Image

Original Image

Original Image

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].