All Projects → azmiozgen → Text Detection

azmiozgen / Text Detection

Licence: gpl-3.0
Text detection with mainly MSER and SWT

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Text Detection

React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+129.94%)
Mutual labels:  text-detection, ocr, tesseract-ocr
Differentiablebinarization
DB (Real-time Scene Text Detection with Differentiable Binarization) implementation in Keras and Tensorflow
Stars: ✭ 106 (-36.53%)
Mutual labels:  text-detection, ocr
Craft Pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Stars: ✭ 2,220 (+1229.34%)
Mutual labels:  text-detection, ocr
Aadhaar Card Ocr
Extract text information from Aadhaar Card using tesseract-ocr 😎
Stars: ✭ 112 (-32.93%)
Mutual labels:  ocr, tesseract-ocr
Ctpn
Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)
Stars: ✭ 1,220 (+630.54%)
Mutual labels:  text-detection, ocr
Keras Ctpn
keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》;欢迎试用,关注,并反馈问题...
Stars: ✭ 89 (-46.71%)
Mutual labels:  text-detection, ocr
Tesseract
This package contains an OCR engine - libtesseract and a command line program - tesseract. Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.
Stars: ✭ 43,199 (+25767.66%)
Mutual labels:  ocr, tesseract-ocr
Pyocr
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
Stars: ✭ 932 (+458.08%)
Mutual labels:  ocr, tesseract-ocr
Craft Remade
Implementation of CRAFT Text Detection
Stars: ✭ 127 (-23.95%)
Mutual labels:  text-detection, ocr
Craft keras
Keras implementation of Character Region Awareness for Text Detection (CRAFT)
Stars: ✭ 143 (-14.37%)
Mutual labels:  text-detection, ocr
Tedeval
TedEval: A Fair Evaluation Metric for Scene Text Detectors
Stars: ✭ 143 (-14.37%)
Mutual labels:  text-detection, ocr
Textshot
Python tool for grabbing text via screenshot
Stars: ✭ 1,163 (+596.41%)
Mutual labels:  ocr, tesseract-ocr
Ultimatemrz Sdk
Machine-readable zone/travel document (MRZ / MRTD) detector and recognizer using deep learning
Stars: ✭ 66 (-60.48%)
Mutual labels:  ocr, tesseract-ocr
Image text reader
The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.
Stars: ✭ 97 (-41.92%)
Mutual labels:  ocr, tesseract-ocr
Blackout
NaNoGenMo 2016 entry #2
Stars: ✭ 36 (-78.44%)
Mutual labels:  ocr, tesseract-ocr
Gosseract
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
Stars: ✭ 1,622 (+871.26%)
Mutual labels:  ocr, tesseract-ocr
Adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Stars: ✭ 2,565 (+1435.93%)
Mutual labels:  text-detection, ocr
Image Text Localization Recognition
A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約
Stars: ✭ 788 (+371.86%)
Mutual labels:  text-detection, ocr
Gimagereader
A Gtk/Qt front-end to tesseract-ocr.
Stars: ✭ 786 (+370.66%)
Mutual labels:  ocr, tesseract-ocr
Tesseract Ocr for windows
Visual Studio Projects for Tessearct and dependencies
Stars: ✭ 122 (-26.95%)
Mutual labels:  ocr, tesseract-ocr

text-detection

This project aims to detect text regions in images using only image processing techniques with MSER (Maximally Stable Extremal Regions) and SWT (Stroke Width Transform). And also Tesseract-OCR tool is used optionally, as assistance to the algorithm.

Please cite the paper:

Özgen, A.C., Fasounaki, M. and Ekenel, H.K., 2018, May. Text detection in natural and computer-generated images. In 2018 26th Signal Processing and Communications Applications Conference (SIU) (pp. 1-4). IEEE.

INSTALLING

You can create conda environment with

conda env create -f requirements.txt

For OCR assistance, install Tesseract from package manager

sudo apt install tesseract-ocr

USAGE

Basic usage is

python detect.py -i <input-image>

You can give output file

python detect.py -i images/scenetext01.jpg -o <output-image>

More options available

python detect.py -i images/scenetext01.jpg -o <output-file> -d <light,dark,both,both+> -t

Option -i is image path, -o is output path, -d is SWT direction (default is both+), -t option chooses if Tesseract will be used. Normally Tesseract runs poorly if whole image given as input. But it is used as final decision of bounding boxes.

If you want to give whole image to Tesseract to see the impact of the algorithm, try this.

python detect.py -i images/scenetext01.jpg -f

For more detail (seeing intermediate steps), the usage given below is also available.

python detect.py -i images/scenetext01.jpg -d both+ -t --details

Sample Results

sample1

sample2

sample3

sample4

REFERENCES

B. Epshtein, E. Ofek, and Y. Wexler. Detecting text in natural scenes with stroke width transform. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pages 2963–2970, June 2010.

Á. González, L. M. Bergasa, J. J. Yebes, and S. Bronte. Text location in complex images. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), pages 617–620, Nov 2012.

Y. Li and H. Lu. Scene text detection via stroke width. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), pages 681–684, Nov 2012.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].