Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → whai362 → Pan_pp.pytorch

whai362 / Pan_pp.pytorch

Licence: apache-2.0

Official implementations of PSENet, PAN and PAN++.

Programming Languages

139335 projects - #7 most used programming language

Labels

text-recognition text-detection pan

Projects that are alternatives of or similar to Pan pp.pytorch

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Stars: ✭ 1,409 (+899.29%)

Mutual labels: text-recognition, text-detection

Awesome Scene Text Recognition

A curated list of resources dedicated to scene text localization and recognition

Stars: ✭ 1,637 (+1060.99%)

Mutual labels: text-detection, text-recognition

awesome-scene-text

A curated list of papers and resources for scene text detection and recognition

Stars: ✭ 43 (-69.5%)

Mutual labels: text-recognition, text-detection

Awesome Deep Text Detection Recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Stars: ✭ 2,282 (+1518.44%)

Mutual labels: text-detection, text-recognition

Image Text Localization Recognition

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

Stars: ✭ 788 (+458.87%)

Mutual labels: text-detection, text-recognition

No description or website provided.

Stars: ✭ 68 (-51.77%)

Mutual labels: text-recognition, text-detection

A research project for text detection and recognition using PyTorch 1.2.

Stars: ✭ 332 (+135.46%)

Mutual labels: text-detection, text-recognition

A pure pytorch implemented ocr project including text detection and recognition

Stars: ✭ 196 (+39.01%)

Mutual labels: text-detection, text-recognition

Total Text Dataset

Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.

Stars: ✭ 580 (+311.35%)

Mutual labels: text-detection, text-recognition

React Native Tesseract Ocr

Tesseract OCR wrapper for React Native

Stars: ✭ 384 (+172.34%)

Mutual labels: text-detection, text-recognition

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Stars: ✭ 2,565 (+1719.15%)

Mutual labels: text-detection, text-recognition

Chinese Text Detection And Recognition

Assignment of Image Analysis and Understanding

Stars: ✭ 53 (-62.41%)

Mutual labels: text-detection, text-recognition

Chineseaddress ocr

Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。

Stars: ✭ 309 (+119.15%)

Mutual labels: text-detection, text-recognition

Awesome Ocr Resources

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

Stars: ✭ 335 (+137.59%)

Mutual labels: text-detection, text-recognition

Training extensions

Trainable models and NN optimization tools

Stars: ✭ 857 (+507.8%)

Mutual labels: text-detection, text-recognition

CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

Stars: ✭ 92 (-34.75%)

Mutual labels: text-detection, text-recognition

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

Stars: ✭ 89 (-36.88%)

Mutual labels: text-detection

Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection

Stars: ✭ 116 (-17.73%)

Mutual labels: text-recognition

Text Detection Using Yolo Algorithm In Keras Tensorflow

Implemented the YOLO algorithm for scene text detection in keras-tensorflow (No object detection API used) The code can be tweaked to train for a different object detection task using YOLO.

Stars: ✭ 87 (-38.3%)

Mutual labels: text-detection

Ios11 Visionframework

Vision Framework IOS WWDC 2017

Stars: ✭ 85 (-39.72%)

Mutual labels: text-detection

View All Similar Projects ➔

Introduction

Official Pytorch implementations of PSENet [1], PAN [2] and PAN++ [3].

[1] W. Wang, E. Xie, X. Li, W. Hou, T. Lu, G. Yu, and S. Shao. Shape robust text detection with progressive scale expansion network. In Proc. IEEE Conf. Comp. Vis. Patt. Recogn., pages 9336–9345, 2019.
[2] W. Wang, E. Xie, X. Song, Y. Zang, W. Wang, T. Lu, G. Yu, and C. Shen. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. In Proc. IEEE Int. Conf. Comp. Vis., pages 8440–8449, 2019.
[3] Paper is in preparation.

This repository only contains PAN now, PSENet and PAN++ are on the way.

Recommended environment

Python 3.6+
Pytorch 1.1.0
torchvision 0.3
mmcv 0.2.12
editdistance
Polygon3
pyclipper
opencv-python 3.4.2.17
Cython

Install

pip install -r requirement.txt
./compile.sh

Training

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py ${CONFIG_FILE}

For example:

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py config/pan/pan_r18_ic15.py

Test

python test.py ${CONFIG_FILE} ${CHECKPOINT_FILE}

For example:

python test.py config/pan/pan_r18_ic15.py checkpoints/pan_r18_ic15/checkpoint.pth.tar

Speed

python test.py ${CONFIG_FILE} ${CHECKPOINT_FILE} --report_speed

For example:

python test.py config/pan/pan_r18_ic15.py checkpoints/pan_r18_ic15/checkpoint.pth.tar --report_speed

Evaluation

See eval.

Benchmark and model zoo

PAN

Todo:

PSENet
PAN++

Citation

@inproceedings{wang2019shape,
  title={Shape robust text detection with progressive scale expansion network},
  author={Wang, Wenhai and Xie, Enze and Li, Xiang and Hou, Wenbo and Lu, Tong and Yu, Gang and Shao, Shuai},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={9336--9345},
  year={2019}
}

@inproceedings{wang2019efficient,
  title={Efficient and accurate arbitrary-shaped text detection with pixel aggregation network},
  author={Wang, Wenhai and Xie, Enze and Song, Xiaoge and Zang, Yuhang and Wang, Wenjia and Lu, Tong and Yu, Gang and Shen, Chunhua},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={8440--8449},
  year={2019}
}

License

This project is released under the Apache 2.0 license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 141

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗