Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.

Stars: ✭ 97 (-3.96%)

Mutual labels: ocr

Penteract Ocr

⭐️ The native node.js bindings to the Tesseract OCR project.

Stars: ✭ 86 (-14.85%)

Mutual labels: ocr

Ngx Dynamic Form Builder

FormBuilder + class-transformer + class-validator = dynamic form group builder for Angular10+

Stars: ✭ 93 (-7.92%)

Mutual labels: transformer

Setr Pytorch

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Stars: ✭ 96 (-4.95%)

Mutual labels: transformer

Vision Transformer

Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Stars: ✭ 90 (-10.89%)

Mutual labels: transformer

Nanonets Ocr Sample Python

NanoNets OCR API Example for Python

Stars: ✭ 92 (-8.91%)

Mutual labels: ocr

View All Similar Projects ➔

2D Attentional Irregular Scene Text Recognizer

Unofficial PyTorch implementation of the paper, which transforms the irregular text with 2D layout to character sequence directly via 2D attentional scheme. They utilize a relation attention module to capture the dependencies of feature maps and a parallel attention module to decode all characters in parallel.

At present, the accuracy of the paper cannot be achieved. And i borrowed code from deep-text-recognition-benchmark

model

result
Test on ICDAR2019 with only 51.15%, will continue to improve.

Feature

Output image string once not like the seqtoseq model

Requirements

Pytorch >= 1.1.0

Test

download the pretrained model Baidu password: kdah.
test on images which in demo_image folder

python demo.py --image_folder demo_image --saved_model <model_path/best_accuracy.pth>

some examples

demo images	Bert_OCR
	available
	shakesshack
	london
	greenstead
	toast
	merry
	underground
	ronaldo
	bally
	university

result on benchmark data sets

IIIT5k_3000	SVT	IC03_860	IC03_867	IC13_857	IC13_1015	IC15_1811	IC15_2077	SVTP	CUTE80
84.367	79.907	91.860	91.465	88.448	86.010	65.654	63.215	68.527	81.185

total_accuracy: 78.423

Train

I prepared a small dataset for train.The image and labels are in ./dataset/BAIDU.

python train.py --root ./dataset/BAIDU/images/ --train_csv ./dataset/BAIDU/small_train.txt --val_csv ./dataset/BAIDU/small_train.txt

Reference

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 101

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (6) 🔗