All Projects → ayumiymk → Aster.pytorch

ayumiymk / Aster.pytorch

Licence: mit
ASTER in Pytorch

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Aster.pytorch

NLP-image-to-text
code to extract text from images
Stars: ✭ 28 (-94.08%)
Mutual labels:  ocr, text-recognition
Cnn lstm ctc ocr
Tensorflow-based CNN+LSTM trained with CTC-loss for OCR
Stars: ✭ 464 (-1.9%)
Mutual labels:  ocr, text-recognition
EverTranslator
Translate text anytime and everywhere, even you are gaming!
Stars: ✭ 59 (-87.53%)
Mutual labels:  ocr, text-recognition
Ocr.pytorch
A pure pytorch implemented ocr project including text detection and recognition
Stars: ✭ 196 (-58.56%)
Mutual labels:  ocr, text-recognition
Chineseaddress ocr
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
Stars: ✭ 309 (-34.67%)
Mutual labels:  ocr, text-recognition
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Stars: ✭ 1,409 (+197.89%)
Mutual labels:  ocr, text-recognition
insightocr
MXNet OCR implementation. Including text recognition and detection.
Stars: ✭ 100 (-78.86%)
Mutual labels:  ocr, text-recognition
Adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Stars: ✭ 2,565 (+442.28%)
Mutual labels:  ocr, text-recognition
Vedastr
A scene text recognition toolbox based on PyTorch
Stars: ✭ 290 (-38.69%)
Mutual labels:  ocr, text-recognition
ocr
Simple app to extract text from pictures using Tesseract
Stars: ✭ 98 (-79.28%)
Mutual labels:  ocr, text-recognition
Deep Text Recognition Benchmark
Text recognition (optical character recognition) with deep learning methods.
Stars: ✭ 2,665 (+463.42%)
Mutual labels:  ocr, text-recognition
Awesome Ocr Resources
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
Stars: ✭ 335 (-29.18%)
Mutual labels:  ocr, text-recognition
Awesome Deep Text Detection Recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
Stars: ✭ 2,282 (+382.45%)
Mutual labels:  ocr, text-recognition
lego-mindstorms-51515-jetson-nano
Combines the LEGO Mindstorms 51515 with the NVIDIA Jetson Nano
Stars: ✭ 31 (-93.45%)
Mutual labels:  ocr, text-recognition
Tesseract Ocr For Php
A wrapper to work with Tesseract OCR inside PHP.
Stars: ✭ 2,247 (+375.05%)
Mutual labels:  ocr, text-recognition
CRNN
Convolutional recurrent neural network for scene text recognition or OCR in Keras
Stars: ✭ 96 (-79.7%)
Mutual labels:  ocr, text-recognition
Crnn.pytorch
crnn实现水平和垂直方向中文文字识别, 提供在3w多个中文字符训练的水平识别和垂直识别的预训练模型; 欢迎关注,试用和反馈问题... ...
Stars: ✭ 145 (-69.34%)
Mutual labels:  ocr, text-recognition
Scene Text Recognition
Scene text detection and recognition based on Extremal Region(ER)
Stars: ✭ 146 (-69.13%)
Mutual labels:  ocr, text-recognition
MLKit
🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。
Stars: ✭ 294 (-37.84%)
Mutual labels:  ocr, text-recognition
Megreader
A research project for text detection and recognition using PyTorch 1.2.
Stars: ✭ 332 (-29.81%)
Mutual labels:  ocr, text-recognition

ASTER: Attentional Scene Text Recognizer with Flexible Rectification

This repository implements the ASTER in pytorch. Origin software could be found in here.

ASTER is an accurate scene text recognizer with flexible rectification mechanism. The research paper can be found here.

ASTER Overview

Installation

conda env create -f environment.yml

Train

[NOTE] Some users say that they can't reproduce the reported performance with minor modification, like 1 and 2. I haven't try other settings, so I can't guarantee the same performance with different settings. The users should just run the following script without any modification to reproduce the results.

bash scripts/stn_att_rec.sh

Test

You can test with .lmdb files by

bash scripts/main_test_all.sh

Or test with single image by

bash scripts/main_test_image.sh

Pretrained model

The pretrained model is available on our release page. Download demo.pth.tar and put it to somewhere. Before running, modify the --resume to the location of this file.

Reproduced results

IIIT5k SVT IC03 IC13 IC15 SVTP CUTE
ASTER (L2R) 92.67 - 93.72 90.74 - 78.76 76.39
ASTER.Pytorch 93.2 89.2 92.2 91 78.0 81.2 81.9

At present, the bidirectional attention decoder proposed in ASTER is not included in my implementation.

You can use the codes to bootstrap for your next text recognition research project.

Data preparation

We give an example to construct your own datasets. Details please refer to tools/create_svtp_lmdb.py.

We also provide datasets for training (password: wi05) and testing.

Citation

If you find this project helpful for your research, please cite the following papers:

@article{bshi2018aster,
  author    = {Baoguang Shi and
               Mingkun Yang and
               Xinggang Wang and
               Pengyuan Lyu and
               Cong Yao and
               Xiang Bai},
  title     = {ASTER: An Attentional Scene Text Recognizer with Flexible Rectification},
  journal   = {IEEE Transactions on Pattern Analysis and Machine Intelligence}, 
  volume    = {41},
  number    = {9},
  pages     = {2035--2048},
  year      = {2019},
}

@inproceedings{ShiWLYB16,
  author    = {Baoguang Shi and
               Xinggang Wang and
               Pengyuan Lyu and
               Cong Yao and
               Xiang Bai},
  title     = {Robust Scene Text Recognition with Automatic Rectification},
  booktitle = {2016 {IEEE} Conference on Computer Vision and Pattern Recognition,
               {CVPR} 2016, Las Vegas, NV, USA, June 27-30, 2016},
  pages     = {4168--4176},
  year      = {2016}
}

IMPORTANT NOTICE: Although this software is licensed under MIT, our intention is to make it free for academic research purposes. If you are going to use it in a product, we suggest you contact us regarding possible patent issues.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].