All Projects → opconty → Transformer_str

opconty / Transformer_str

PyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer,Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Transformer str

React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+193.13%)
Mutual labels:  ocr, text-recognition
Image Text Localization Recognition
A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約
Stars: ✭ 788 (+501.53%)
Mutual labels:  ocr, text-recognition
Cnn lstm ctc ocr
Tensorflow-based CNN+LSTM trained with CTC-loss for OCR
Stars: ✭ 464 (+254.2%)
Mutual labels:  ocr, text-recognition
Chineseaddress ocr
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
Stars: ✭ 309 (+135.88%)
Mutual labels:  ocr, text-recognition
Php Apache Tika
Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats
Stars: ✭ 76 (-41.98%)
Mutual labels:  ocr, text-recognition
Megreader
A research project for text detection and recognition using PyTorch 1.2.
Stars: ✭ 332 (+153.44%)
Mutual labels:  ocr, text-recognition
Tr
Free Offline OCR 离线的中文文本检测+识别SDK
Stars: ✭ 598 (+356.49%)
Mutual labels:  ocr, text-recognition
insightocr
MXNet OCR implementation. Including text recognition and detection.
Stars: ✭ 100 (-23.66%)
Mutual labels:  ocr, text-recognition
Text recognition toolbox
text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.
Stars: ✭ 114 (-12.98%)
Mutual labels:  ocr, text-recognition
Sar tf
This is an implementation of Show, Attend and Read with tensorflow
Stars: ✭ 70 (-46.56%)
Mutual labels:  ocr, text-recognition
Vedastr
A scene text recognition toolbox based on PyTorch
Stars: ✭ 290 (+121.37%)
Mutual labels:  ocr, text-recognition
Node Tesseract Ocr
A Node.js wrapper for the Tesseract OCR API
Stars: ✭ 92 (-29.77%)
Mutual labels:  ocr, text-recognition
ocr
Simple app to extract text from pictures using Tesseract
Stars: ✭ 98 (-25.19%)
Mutual labels:  ocr, text-recognition
Awesome Ocr Resources
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
Stars: ✭ 335 (+155.73%)
Mutual labels:  ocr, text-recognition
MLKit
🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。
Stars: ✭ 294 (+124.43%)
Mutual labels:  ocr, text-recognition
Aster.pytorch
ASTER in Pytorch
Stars: ✭ 473 (+261.07%)
Mutual labels:  ocr, text-recognition
EverTranslator
Translate text anytime and everywhere, even you are gaming!
Stars: ✭ 59 (-54.96%)
Mutual labels:  ocr, text-recognition
CRNN
Convolutional recurrent neural network for scene text recognition or OCR in Keras
Stars: ✭ 96 (-26.72%)
Mutual labels:  ocr, text-recognition
Crnn
Convolutional recurrent neural network for scene text recognition or OCR in Keras
Stars: ✭ 68 (-48.09%)
Mutual labels:  ocr, text-recognition
Crnn With Stn
implement CRNN in Keras with Spatial Transformer Network
Stars: ✭ 83 (-36.64%)
Mutual labels:  ocr, text-recognition

Transformer-based Scene Text Recognition (Transformer-STR)

  • PyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer.

I adapted the four-stage STR framework devised by deep-text-recognition-benchmark, and replaced the Pred. stage with Transformer.

Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.

Download pretrained weights from here

This pre-trained weights trained on Synthetic dataset for about 700K iters.

Git clone this repo and download the weights file, move it to checkpoints directory.

Download lmdb dataset for traininig and evaluation from here(provided by deep-text-recognition-benchmark)

data_lmdb_release.zip contains below.
training datasets : MJSynth (MJ)[1] and SynthText (ST)[2]
validation datasets : the union of the training sets IC13[3], IC15[4], IIIT[5], and SVT[6].
evaluation datasets : benchmark evaluation datasets, consist of IIIT[5], SVT[6], IC03[7], IC13[3], IC15[4], SVTP[8], and CUTE[9].

Training

Please configure your data_dir in config.py file, then run:

python tools/train.py

Evaluation on CUTE80

The Transformer-base STR achieves 0.815972 accuracy on CUTE80, outperforming the best model of deep-text-recognition-benchmark, which is 0.74

compared

If you want to reproduce the evaluation result, please run:

python evaluation.py

Make sure your cute80_dir and saved_model path is correct. you'll get the result 0.815972

Contact

Feel free to contact me ([email protected]).

License

This project is released under the Apache 2.0 license.

References

deep-text-recognition-benchmark

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].