Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Canjie-Luo → Moran_v2

Canjie-Luo / Moran_v2

Licence: mit

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Programming Languages

139335 projects - #7 most used programming language

Labels

attention-mechanism

Projects that are alternatives of or similar to Moran v2

To eventually become an unofficial Pytorch implementation / replication of Alphafold2, as details of the architecture get released

Stars: ✭ 298 (-44.4%)

Mutual labels: attention-mechanism

A PyTorch implementation of "SimGNN: A Neural Network Approach to Fast Graph Similarity Computation" (WSDM 2019).

Stars: ✭ 351 (-34.51%)

Mutual labels: attention-mechanism

Awesome Graph Classification

A collection of important graph embedding, classification and representation learning papers with implementations.

Stars: ✭ 4,309 (+703.92%)

Mutual labels: attention-mechanism

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (-43.1%)

Mutual labels: attention-mechanism

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Stars: ✭ 3,646 (+580.22%)

Mutual labels: attention-mechanism

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (-23.88%)

Mutual labels: attention-mechanism

Attention is all you need

Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.

Stars: ✭ 303 (-43.47%)

Mutual labels: attention-mechanism

Keras Self Attention

Attention mechanism for processing sequential data that considers the context for each timestamp.

Stars: ✭ 489 (-8.77%)

Mutual labels: attention-mechanism

Action Recognition Visual Attention

Action recognition using soft attention based deep recurrent neural networks

Stars: ✭ 350 (-34.7%)

Mutual labels: attention-mechanism

Transformer Tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Stars: ✭ 418 (-22.01%)

Mutual labels: attention-mechanism

Seq2seq chatbot

基于seq2seq模型的简单对话系统的tf实现，具有embedding、attention、beam_search等功能，数据集是Cornell Movie Dialogs

Stars: ✭ 308 (-42.54%)

Mutual labels: attention-mechanism

Keras implementation of the graph attention networks (GAT) by Veličković et al. (2017; https://arxiv.org/abs/1710.10903)

Stars: ✭ 334 (-37.69%)

Mutual labels: attention-mechanism

Bottleneck Transformer Pytorch

Implementation of Bottleneck Transformer in Pytorch

Stars: ✭ 408 (-23.88%)

Mutual labels: attention-mechanism

Seq2seq Summarizer

Pointer-generator reinforced seq2seq summarization in PyTorch

Stars: ✭ 306 (-42.91%)

Mutual labels: attention-mechanism

Structured Self Attention

A Structured Self-attentive Sentence Embedding

Stars: ✭ 459 (-14.37%)

Mutual labels: attention-mechanism

Adaptiveattention

Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"

Stars: ✭ 303 (-43.47%)

Mutual labels: attention-mechanism

Code for PaperRobot: Incremental Draft Generation of Scientific Ideas

Stars: ✭ 372 (-30.6%)

Mutual labels: attention-mechanism

Neural Machine Translation with Keras

Stars: ✭ 501 (-6.53%)

Mutual labels: attention-mechanism

Deeplearning.ai Natural Language Processing Specialization

This repository contains my full work and notes on Coursera's NLP Specialization (Natural Language Processing) taught by the instructor Younes Bensouda Mourri and Łukasz Kaiser offered by deeplearning.ai

Stars: ✭ 473 (-11.75%)

Mutual labels: attention-mechanism

Pytorch Original Transformer

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.

Stars: ✭ 411 (-23.32%)

Mutual labels: attention-mechanism

View All Similar Projects ➔

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Python 2.7	Python 3.6

MORAN is a network with rectification mechanism for general scene text recognition. The paper (accepted to appear in Pattern Recognition, 2019) in arXiv, final version is available now.

Here is a brief introduction in Chinese.

Recent Update

2019.03.21 Fix a bug about Fractional Pickup.
Support Python 3.

Improvements of MORAN v2:

More stable rectification network for one-stage training
Replace VGG backbone by ResNet
Use bidirectional decoder (a trick borrowed from ASTER)

Version	IIIT5K	SVT	IC03	IC13	SVT-P	CUTE80	IC15 (1811)	IC15 (2077)
MORAN v1 (curriculum training)*	91.2	88.3	95.0	92.4	76.1	77.4	74.7	68.8
MORAN v2 (one-stage training)	93.4	88.3	94.2	93.2	79.7	81.9	77.8	73.9

*The results of v1 were reported in our paper. If this project is helpful for your research, please cite our Pattern Recognition paper.

Requirements

(Welcome to develop MORAN together.)

We recommend you to use Anaconda to manage your libraries.

Python 2.7 or Python 3.6 (Python 3 is faster than Python 2)
PyTorch 0.3.* (Higher version causes slow training, please ref to issue#8)
TorchVision
OpenCV
PIL (Pillow)
Colour
LMDB
matplotlib

Or use pip to install the libraries. (Maybe the torch is different from the anaconda version. Please check carefully and fix the warnings in training stage if necessary.)

    pip install -r requirements.txt

Data Preparation

Please convert your own dataset to LMDB format by using the tool (run in Python 2.7) provided by @Baoguang Shi.

You can also download the training (NIPS 2014, CVPR 2016) and testing datasets prepared by us.

The raw pictures of testing datasets can be found here.

Training and Testing

Modify the path to dataset folder in train_MORAN.sh:

	--train_nips path_to_dataset \
	--train_cvpr path_to_dataset \
	--valroot path_to_dataset \

And start training: (manually decrease the learning rate for your task)

	sh train_MORAN.sh

The training process should take less than 20s for 100 iterations on a 1080Ti.

Demo

Download the model parameter file demo.pth.

BaiduCloud (password: l8em)
Google Drive
OneDrive

Put it into root folder. Then, execute the demo.py for more visualizations.

	python demo.py

Citation

@article{cluo2019moran,
  author    = {Canjie Luo and Lianwen Jin and Zenghui Sun},
  title     = {MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition},
  journal   = {Pattern Recognition}, 
  volume    = {90}, 
  pages     = {109--118},
  year      = {2019},
  publisher = {Elsevier}
}

Acknowledgment

The repo is developed based on @Jieru Mei's crnn.pytorch and @marvis' ocr_attention. Thanks for your contribution.

Attention

The project is only free for academic research purposes.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 536

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (18) 🔗