All Projects → bgshih → Aster

bgshih / Aster

Licence: mit
Recognizing cropped text in natural images.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Aster

Deep Text Recognition Benchmark
Text recognition (optical character recognition) with deep learning methods.
Stars: ✭ 2,665 (+325.72%)
Mutual labels:  recognition, ocr
Cnn lstm ctc ocr for icpr
Forked from weinman/cnn_lstm_ctc_ocr for the ICPR MTWI 2018 challenge 1
Stars: ✭ 62 (-90.1%)
Mutual labels:  recognition, ocr
Korean-OCR-Model-Design-based-on-Keras-CNN
Korean OCR Model Design(한글 OCR 모델 설계)
Stars: ✭ 34 (-94.57%)
Mutual labels:  ocr, recognition
Crnn chinese characters rec
(CRNN) Chinese Characters Recognition.
Stars: ✭ 1,259 (+101.12%)
Mutual labels:  recognition, ocr
Handwriting Ocr
OCR software for recognition of handwritten text
Stars: ✭ 411 (-34.35%)
Mutual labels:  recognition, ocr
East
This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.
Stars: ✭ 478 (-23.64%)
Mutual labels:  ocr
Scanner
二维码/条码识别、身份证识别、银行卡识别、车牌识别、图片文字识别、黄图识别、驾驶证(驾照)识别
Stars: ✭ 547 (-12.62%)
Mutual labels:  ocr
Tensorflow psenet
This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:
Stars: ✭ 472 (-24.6%)
Mutual labels:  ocr
Easyocr
Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).
Stars: ✭ 466 (-25.56%)
Mutual labels:  ocr
Receipt Parser Legacy
A supermarket receipt parser written in Python using tesseract OCR
Stars: ✭ 614 (-1.92%)
Mutual labels:  ocr
Tensorflowandroiddemo
TensorFlow android demo 车道线 车辆 人脸 动作 骨架 识别 检测 抽烟 打电话 闭眼 睁眼
Stars: ✭ 589 (-5.91%)
Mutual labels:  recognition
Millionheroassistant
百万 / 冲顶 / 芝士 / UC / 万能 答题助手(知识图谱更加专业,自动推荐答案, Android手机自动屏幕适配,模拟器支持,多开)
Stars: ✭ 524 (-16.29%)
Mutual labels:  ocr
Seglink
An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments
Stars: ✭ 479 (-23.48%)
Mutual labels:  ocr
R2cnn faster Rcnn tensorflow
Rotational region detection based on Faster-RCNN.
Stars: ✭ 548 (-12.46%)
Mutual labels:  ocr
Swiftocr
Fast and simple OCR library written in Swift
Stars: ✭ 4,459 (+612.3%)
Mutual labels:  ocr
Tr
Free Offline OCR 离线的中文文本检测+识别SDK
Stars: ✭ 598 (-4.47%)
Mutual labels:  ocr
Aster.pytorch
ASTER in Pytorch
Stars: ✭ 473 (-24.44%)
Mutual labels:  ocr
Fots.pytorch
FOTS Pytorch Implementation
Stars: ✭ 513 (-18.05%)
Mutual labels:  ocr
Soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
Stars: ✭ 554 (-11.5%)
Mutual labels:  recognition
Crow Translate
A simple and lightweight translator that allows to translate and speak text using Google, Yandex and Bing.
Stars: ✭ 503 (-19.65%)
Mutual labels:  ocr

ASTER: Attentional Scene Text Recognizer with Flexible Rectification

ASTER is an accurate scene text recognizer with flexible rectification mechanism. The research paper can be found here.

ASTER Overview

The implementation of ASTER reuses code from Tensorflow Object Detection API.

Update

[07/13/2019] A PyTorch port has been made by @ayumiymk.

Correction (10/22/2018)

We have identified a bug we accidentally made in the code that causes only part of SVT images being tested and results in higher results. The bug has been fixed in commit a7e8613. Below are the corrected numbers on SVT. The results are still state-of-the-art, so the conclusions are not affected.

  • SVT (50) ASTER: 97.4%; ASTER-A: 96.3%; ASTER-B: 96.1%;
  • SVT (None): ASTER: 89.5%; ASTER-A: 80.2%; ASTER-B: 81.6%

Prerequisites

ASTER was developed and tested with TensorFlow r1.4. Higher versions may not work.

ASTER requires Protocol Buffers (version>=2.6). Besides, in Ubuntu 16.04:

sudo apt install cmake libcupti-dev
pip3 install --user protobuf tqdm numpy editdistance

Installation

  1. Go to c_ops/ and run build.sh to build the custom operators
  2. Execute protoc aster/protos/*.proto --python_out=. to build the protobuf files
  3. Add /path/to/aster to PYTHONPATH, or set this variable for every run

Demo

A demo program is located at aster/demo.py, accompanied with pretrained model files available on our release page. Download model-demo.zip and extract it under aster/experiments/demo/ before running the demo.

To run the demo, simply execute:

python3 aster/demo.py

This will output the recognition result of the demo image and the rectified image.

Training and on-the-fly evaluation

Data preparation scripts for several popular scene text datasets are located under aster/tools. See their source code for usage.

To run the example training, execute

python3 aster/train.py \
  --exp_dir experiments/demo \
  --num_clones 2

Change the configuration in experiments/aster/trainval.prototxt to configure your own training process.

During the training, you can run a separate program to repeatedly evaluates the produced checkpoints.

python3 aster/eval.py \
   --exp_dir experiments/demo

Evaluation configuration is also in trainval.prototxt.

Citation

If you find this project helpful for your research, please cite the following papers:

@article{bshi2018aster,
  author  = {Baoguang Shi and
               Mingkun Yang and
               Xinggang Wang and
               Pengyuan Lyu and
               Cong Yao and
               Xiang Bai},
  title   = {ASTER: An Attentional Scene Text Recognizer with Flexible Rectification},
  journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence}, 
  volume  = {}, 
  number  = {}, 
  pages   = {1-1},
  year    = {2018}, 
}

@inproceedings{ShiWLYB16,
  author    = {Baoguang Shi and
               Xinggang Wang and
               Pengyuan Lyu and
               Cong Yao and
               Xiang Bai},
  title     = {Robust Scene Text Recognition with Automatic Rectification},
  booktitle = {2016 {IEEE} Conference on Computer Vision and Pattern Recognition,
               {CVPR} 2016, Las Vegas, NV, USA, June 27-30, 2016},
  pages     = {4168--4176},
  year      = {2016}
}

IMPORTANT NOTICE: Although this software is licensed under MIT, our intention is to make it free for academic research purposes. If you are going to use it in a product, we suggest you contact us regarding possible patent issues.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].