All Projects → whai362 → AE_TextSpotter

whai362 / AE_TextSpotter

Licence: Apache-2.0 license
No description or website provided.

Programming Languages

python
139335 projects - #7 most used programming language
Cuda
1817 projects
C++
36643 projects - #6 most used programming language

Projects that are alternatives of or similar to AE TextSpotter

awesome-scene-text
A curated list of papers and resources for scene text detection and recognition
Stars: ✭ 43 (-36.76%)
Mutual labels:  text-recognition, text-detection, text-detection-recognition
Awesome Deep Text Detection Recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
Stars: ✭ 2,282 (+3255.88%)
Mutual labels:  text-recognition, text-detection, text-detection-recognition
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Stars: ✭ 1,409 (+1972.06%)
Mutual labels:  text-recognition, text-detection, text-detection-recognition
Chineseaddress ocr
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
Stars: ✭ 309 (+354.41%)
Mutual labels:  text-recognition, text-detection
Awesome Ocr Resources
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
Stars: ✭ 335 (+392.65%)
Mutual labels:  text-recognition, text-detection
Awesome Scene Text Recognition
A curated list of resources dedicated to scene text localization and recognition
Stars: ✭ 1,637 (+2307.35%)
Mutual labels:  text-recognition, text-detection
React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+464.71%)
Mutual labels:  text-recognition, text-detection
Megreader
A research project for text detection and recognition using PyTorch 1.2.
Stars: ✭ 332 (+388.24%)
Mutual labels:  text-recognition, text-detection
Adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Stars: ✭ 2,565 (+3672.06%)
Mutual labels:  text-recognition, text-detection
Training extensions
Trainable models and NN optimization tools
Stars: ✭ 857 (+1160.29%)
Mutual labels:  text-recognition, text-detection
Image Text Localization Recognition
A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約
Stars: ✭ 788 (+1058.82%)
Mutual labels:  text-recognition, text-detection
Chinese Text Detection And Recognition
Assignment of Image Analysis and Understanding
Stars: ✭ 53 (-22.06%)
Mutual labels:  text-recognition, text-detection
Pan pp.pytorch
Official implementations of PSENet, PAN and PAN++.
Stars: ✭ 141 (+107.35%)
Mutual labels:  text-recognition, text-detection
Total Text Dataset
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
Stars: ✭ 580 (+752.94%)
Mutual labels:  text-recognition, text-detection
Cleval
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
Stars: ✭ 92 (+35.29%)
Mutual labels:  text-recognition, text-detection
Ocr.pytorch
A pure pytorch implemented ocr project including text detection and recognition
Stars: ✭ 196 (+188.24%)
Mutual labels:  text-recognition, text-detection
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (+95.59%)
Mutual labels:  text-recognition
Transformer str
PyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer,Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.
Stars: ✭ 131 (+92.65%)
Mutual labels:  text-recognition
Questor
Scan & Search text from hardcopy books and images! 📖 🔍
Stars: ✭ 36 (-47.06%)
Mutual labels:  text-recognition
Tesseract Ocr For Php
A wrapper to work with Tesseract OCR inside PHP.
Stars: ✭ 2,247 (+3204.41%)
Mutual labels:  text-recognition

AE TextSpotter

Introduction

This is the official implementation of AE TextSpotter, which introduces linguistic information to eliminate the ambiguity in text detection. This code is based on MMDetection v1.0rc1.

demo image

Recommended environment

Python 3.6+
Pytorch 1.1.0
torchvision 0.2.1
pytorch_transformers 1.1.0
mmcv 0.2.13
Polygon3
opencv-python 4.4.0

Install

Please refer to MMDetection v1.0rc1 for installation.

Preparing data

Step1: Downloading dataset from ICDAR 2019 ReCTS.

Step2: The root of "data/ReCTS" should be:

data/ReCTS/
├── train
│   ├── img
│   ├── gt
├── test
│   ├── img

In folder "data/ReCTS/", files "TDA_ReCTS_train_list.txt" and "TDA_ReCTS_val_list.txt" are downloaded from TDA-ReCTS. Other json files can be generated by run "python tools/rects_prepare_data.py".

Step3: Download and unzip bert-base-chinese.zip in the root of this repository.

unzip bert-base-chinese.zip

Training

Step1:

tools/rects_dist_train.sh local_configs/rects_ae_textspotter_r50_1x.py 8

Step2:

tools/rects_dist_train.sh local_configs/rects_ae_textspotter_lm_r50_1x.py 8

Test

TDA-ReCTS

tools/rects_dist_test.sh local_configs/rects_ae_textspotter_lm_r50_1x.py work_dirs/rects_ae_textspotter_lm_r50_1x/latest.pth 8 --json_out results.json

ICDAR 2019 ReCTS Task 4: End-to-End Text Spotting

tools/rects_dist_test.sh local_configs/rects_ae_textspotter_lm_r50_1x_test.py work_dirs/rects_ae_textspotter_lm_r50_1x/latest.pth 8 --json_out results_test.json
python tools/rects_trans2submit.py

Visualization

python tools/rects_test.py local_configs/rects_ae_textspotter_lm_r50_1x.py work_dirs/rects_ae_textspotter_lm_r50_1x/latest.pth --show

Evaluation

The training list, validation list, and evaluation script of this code come from TDA-ReCTS

python tools/rects_eval.py

The output of the evaluation script should be:

[Best F-Measure] p: 84.94, r: 78.10, f: 81.37, 1-ned: 51.02, best_score_th: 0.569
[Best 1-NED]     p: 86.68, r: 76.09, f: 81.04, 1-ned: 51.51, best_score_th: 0.626

Results and Models

TDA-ReCTS

Method Precision (%) Recall (%) F-measure (%) 1-NED (%) Model
AE TextSpotter 84.94 78.10 81.37 51.51 Google Drive
AE TextSpotter (Paper) 84.78 78.28 81.39 51.32 -

ICDAR 2019 ReCTS

Method Precision (%) Recall (%) F-measure (%) 1-NED (%) Model
AE TextSpotter 93.38 89.98 91.65 71.83 Same as TDA-ReCTS
AE TextSpotter (Paper) 92.60 91.01 91.80 71.81 -

License

This project is released under the Apache 2.0 license.

Citation

If you use this work in your research, please cite us.

@inproceedings{wenhai2020ae,
  title={AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting},
  author={Wang, Wenhai and Liu, Xuebo and Ji, Xiaozhong and Xie, Enze and Liang, Ding and Yang, ZhiBo and Lu, Tong and Shen, Chunhua and Luo, Ping},
  booktitle={European Conference on Computer Vision (ECCV)},
  year={2020}
}

Other Projects:

PAN (ICCV 2019): https://github.com/whai362/pan_pp.pytorch

PSENet (CVPR 2019): https://github.com/whai362/PSENet

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].