Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Megvii-CSG → Megreader

Megvii-CSG / Megreader

A research project for text detection and recognition using PyTorch 1.2.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

deep-learning pytorch ocr text-recognition text-detection ctc

Projects that are alternatives of or similar to Megreader

doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Stars: ✭ 1,409 (+324.4%)

Mutual labels: ocr, text-recognition, text-detection

Sightseq

Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection

Stars: ✭ 116 (-65.06%)

Mutual labels: ctc, ocr, text-recognition

Cnn lstm ctc ocr

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

Stars: ✭ 464 (+39.76%)

Mutual labels: ctc, ocr, text-recognition

Awesome Ocr Resources

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

Stars: ✭ 335 (+0.9%)

Mutual labels: text-detection, ocr, text-recognition

Chineseaddress ocr

Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。

Stars: ✭ 309 (-6.93%)

Mutual labels: text-detection, ocr, text-recognition

Awesome Deep Text Detection Recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Stars: ✭ 2,282 (+587.35%)

Mutual labels: text-detection, ocr, text-recognition

Text recognition toolbox

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

Stars: ✭ 114 (-65.66%)

Mutual labels: ctc, ocr, text-recognition

React Native Tesseract Ocr

Tesseract OCR wrapper for React Native

Stars: ✭ 384 (+15.66%)

Mutual labels: text-detection, ocr, text-recognition

Image Text Localization Recognition

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

Stars: ✭ 788 (+137.35%)

Mutual labels: text-detection, ocr, text-recognition

Adelaidet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Stars: ✭ 2,565 (+672.59%)

Mutual labels: text-detection, ocr, text-recognition

Ocr.pytorch

A pure pytorch implemented ocr project including text detection and recognition

Stars: ✭ 196 (-40.96%)

Mutual labels: text-detection, ocr, text-recognition

Text Detection Ctpn

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Stars: ✭ 3,242 (+876.51%)

Mutual labels: text-detection, ocr

pytorch.ctpn

pytorch, ctpn ,text detection ,ocr,文本检测

Stars: ✭ 123 (-62.95%)

Mutual labels: ocr, text-detection

AE TextSpotter

No description or website provided.

Stars: ✭ 68 (-79.52%)

Mutual labels: text-recognition, text-detection

NLP-image-to-text

code to extract text from images

Stars: ✭ 28 (-91.57%)

Mutual labels: ocr, text-recognition

lego-mindstorms-51515-jetson-nano

Combines the LEGO Mindstorms 51515 with the NVIDIA Jetson Nano

Stars: ✭ 31 (-90.66%)

Mutual labels: ocr, text-recognition

EverTranslator

Translate text anytime and everywhere, even you are gaming!

Stars: ✭ 59 (-82.23%)

Mutual labels: ocr, text-recognition

CRNN

Convolutional recurrent neural network for scene text recognition or OCR in Keras

Stars: ✭ 96 (-71.08%)

Mutual labels: ocr, text-recognition

insightocr

MXNet OCR implementation. Including text recognition and detection.

Stars: ✭ 100 (-69.88%)

Mutual labels: ocr, text-recognition

East

A tensorflow implementation of EAST text detector

Stars: ✭ 2,804 (+744.58%)

Mutual labels: text-detection, ocr

View All Similar Projects ➔

MegReader

A project for research in text detection and recognition using PyTorch 1.2.

This project is originated from the research repo, which heavily relies on closed-source libraries, of CSG-Algorithm team of Megvii(https://megvii.com). We are in ongoing progress to transfer models into this repo gradually, released implementations are listed in Progress.

Highlights

Implementations of representative text detection and recognition methods.
An effective framework for conducting experiments: We use yaml files to configure experiments, making it convenient to take experiments.
Thorough logging features which make it easy to follow and analyze experimental results.
CPU/GPU compatible for training and inference.
Distributed training support.

Install

Requirements

pip install -r requirements.txt

Python3.7
PyTorch 1.2 and CUDA 10.0.
gcc 5.5(Important for compiling)

Compile cuda ops (If needed)

cd PATH_TO_OPS

python setup.py build_ext --inplace

ops may be used:

DeformableConvV2 assets/ops/dcn
CTC2DLoss ops/ctc_2d

Configuration(optional)

Edit configurations in config.py.

Training

See detailed options: python3 train.py --help

Datasets

We provide data loading implementation with annotation packed with json for quick start. Also, lmdb format data are now available too. You can refer the usage in demo. Datasets used in our recognition experiments can be downloaded from onedrive. The transform script are provide to convert json format data to lmdb.

Non-distributed

python3 train.py PATH_TO_EXPERIMENT.yaml --validate --visualize --name NAME_OF_EXPERIMENT

Following we provide some of configurations of the released recognition models:

CRNN: experiments/recognition/crnn.yaml
2D CTC: experiments/recognition/res50-ppm-2d-ctc.yaml
Attention Decoder: experiments/recognition/fpn50-attention-decoder.yaml

Distributed(recommended for multi-gpu training)

python3 -m torch.distributed.launch --nproc_per_node=NUM_GPUS train.py PATH_TO_EXPERIMENT.yaml -d --validate

Evaluating

See detailed options: python3 eval.py --help.

Keeping ratio tesing is recommended: python3 eval.py PATH_TO_EXPERIMENT.yaml --resize_mode keep_ratio

Model zoo

Trained models are comming soon.

Progress

Recognition Methods

[x] 2D CTC
[x] CRNN
[x] Attention Decoder
[ ] Rectification

Detection Methods

[x] Text Snake
[x] EAST

End-to-end

[ ] Mask Text Spotter

Contributing

Contributing.md

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 332

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (11) 🔗