All Projects → whai362 → Pan_pp.pytorch

whai362 / Pan_pp.pytorch

Licence: apache-2.0
Official implementations of PSENet, PAN and PAN++.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pan pp.pytorch

doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Stars: ✭ 1,409 (+899.29%)
Mutual labels:  text-recognition, text-detection
Awesome Scene Text Recognition
A curated list of resources dedicated to scene text localization and recognition
Stars: ✭ 1,637 (+1060.99%)
Mutual labels:  text-detection, text-recognition
awesome-scene-text
A curated list of papers and resources for scene text detection and recognition
Stars: ✭ 43 (-69.5%)
Mutual labels:  text-recognition, text-detection
Awesome Deep Text Detection Recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
Stars: ✭ 2,282 (+1518.44%)
Mutual labels:  text-detection, text-recognition
Image Text Localization Recognition
A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約
Stars: ✭ 788 (+458.87%)
Mutual labels:  text-detection, text-recognition
AE TextSpotter
No description or website provided.
Stars: ✭ 68 (-51.77%)
Mutual labels:  text-recognition, text-detection
Megreader
A research project for text detection and recognition using PyTorch 1.2.
Stars: ✭ 332 (+135.46%)
Mutual labels:  text-detection, text-recognition
Ocr.pytorch
A pure pytorch implemented ocr project including text detection and recognition
Stars: ✭ 196 (+39.01%)
Mutual labels:  text-detection, text-recognition
Total Text Dataset
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
Stars: ✭ 580 (+311.35%)
Mutual labels:  text-detection, text-recognition
React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+172.34%)
Mutual labels:  text-detection, text-recognition
Adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Stars: ✭ 2,565 (+1719.15%)
Mutual labels:  text-detection, text-recognition
Chinese Text Detection And Recognition
Assignment of Image Analysis and Understanding
Stars: ✭ 53 (-62.41%)
Mutual labels:  text-detection, text-recognition
Chineseaddress ocr
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
Stars: ✭ 309 (+119.15%)
Mutual labels:  text-detection, text-recognition
Awesome Ocr Resources
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
Stars: ✭ 335 (+137.59%)
Mutual labels:  text-detection, text-recognition
Training extensions
Trainable models and NN optimization tools
Stars: ✭ 857 (+507.8%)
Mutual labels:  text-detection, text-recognition
Cleval
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
Stars: ✭ 92 (-34.75%)
Mutual labels:  text-detection, text-recognition
Keras Ctpn
keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》;欢迎试用,关注,并反馈问题...
Stars: ✭ 89 (-36.88%)
Mutual labels:  text-detection
Sightseq
Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Stars: ✭ 116 (-17.73%)
Mutual labels:  text-recognition
Text Detection Using Yolo Algorithm In Keras Tensorflow
Implemented the YOLO algorithm for scene text detection in keras-tensorflow (No object detection API used) The code can be tweaked to train for a different object detection task using YOLO.
Stars: ✭ 87 (-38.3%)
Mutual labels:  text-detection
Ios11 Visionframework
Vision Framework IOS WWDC 2017
Stars: ✭ 85 (-39.72%)
Mutual labels:  text-detection

Introduction

Official Pytorch implementations of PSENet [1], PAN [2] and PAN++ [3].

[1] W. Wang, E. Xie, X. Li, W. Hou, T. Lu, G. Yu, and S. Shao. Shape robust text detection with progressive scale expansion network. In Proc. IEEE Conf. Comp. Vis. Patt. Recogn., pages 9336–9345, 2019.
[2] W. Wang, E. Xie, X. Song, Y. Zang, W. Wang, T. Lu, G. Yu, and C. Shen. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. In Proc. IEEE Int. Conf. Comp. Vis., pages 8440–8449, 2019.
[3] Paper is in preparation.

#f03c15 This repository only contains PAN now, PSENet and PAN++ are on the way.

Recommended environment

Python 3.6+
Pytorch 1.1.0
torchvision 0.3
mmcv 0.2.12
editdistance
Polygon3
pyclipper
opencv-python 3.4.2.17
Cython

Install

pip install -r requirement.txt
./compile.sh

Training

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py ${CONFIG_FILE}

For example:

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py config/pan/pan_r18_ic15.py

Test

python test.py ${CONFIG_FILE} ${CHECKPOINT_FILE}

For example:

python test.py config/pan/pan_r18_ic15.py checkpoints/pan_r18_ic15/checkpoint.pth.tar

Speed

python test.py ${CONFIG_FILE} ${CHECKPOINT_FILE} --report_speed

For example:

python test.py config/pan/pan_r18_ic15.py checkpoints/pan_r18_ic15/checkpoint.pth.tar --report_speed

Evaluation

See eval.

Benchmark and model zoo

Todo:

  • PSENet
  • PAN++

Citation

@inproceedings{wang2019shape,
  title={Shape robust text detection with progressive scale expansion network},
  author={Wang, Wenhai and Xie, Enze and Li, Xiang and Hou, Wenbo and Lu, Tong and Yu, Gang and Shao, Shuai},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={9336--9345},
  year={2019}
}

@inproceedings{wang2019efficient,
  title={Efficient and accurate arbitrary-shaped text detection with pixel aggregation network},
  author={Wang, Wenhai and Xie, Enze and Song, Xiaoge and Zang, Yuhang and Wang, Wenjia and Lu, Tong and Yu, Gang and Shen, Chunhua},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={8440--8449},
  year={2019}
}

License

This project is released under the Apache 2.0 license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].