All Projects → zfxxfeng → Cnn_lstm_ctc_ocr_for_icpr

zfxxfeng / Cnn_lstm_ctc_ocr_for_icpr

Licence: gpl-3.0
Forked from weinman/cnn_lstm_ctc_ocr for the ICPR MTWI 2018 challenge 1

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Cnn lstm ctc ocr for icpr

Korean-OCR-Model-Design-based-on-Keras-CNN
Korean OCR Model Design(한글 OCR 모델 설계)
Stars: ✭ 34 (-45.16%)
Mutual labels:  ocr, recognition
Crnn chinese characters rec
(CRNN) Chinese Characters Recognition.
Stars: ✭ 1,259 (+1930.65%)
Mutual labels:  recognition, ocr
Handwriting Ocr
OCR software for recognition of handwritten text
Stars: ✭ 411 (+562.9%)
Mutual labels:  recognition, ocr
Deep Text Recognition Benchmark
Text recognition (optical character recognition) with deep learning methods.
Stars: ✭ 2,665 (+4198.39%)
Mutual labels:  recognition, ocr
Aster
Recognizing cropped text in natural images.
Stars: ✭ 626 (+909.68%)
Mutual labels:  recognition, ocr
Tika Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+1508.06%)
Mutual labels:  recognition
Coremlvisionscanner
CoreML Vision Text Data & Animal Detector iOS App
Stars: ✭ 49 (-20.97%)
Mutual labels:  ocr
Pan card ocr project
To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
Stars: ✭ 39 (-37.1%)
Mutual labels:  ocr
Gesture recognition
a gesture recognition verification lock
Stars: ✭ 37 (-40.32%)
Mutual labels:  recognition
Sane Scan Pdf
Sane command-line scan-to-pdf script on Linux with OCR and deskew support
Stars: ✭ 58 (-6.45%)
Mutual labels:  ocr
Seven Segment Ocr
Digitizes numbers from videos & images of seven segment displays
Stars: ✭ 53 (-14.52%)
Mutual labels:  ocr
Openimager
Image processing Toolkit in R
Stars: ✭ 45 (-27.42%)
Mutual labels:  recognition
Dialectid e2e
End to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-35.48%)
Mutual labels:  recognition
Swiftytesseractrte
SwiftyTesseract Real-Time Engine
Stars: ✭ 49 (-20.97%)
Mutual labels:  ocr
Pottan Ocr
A stupid OCR for malayalam language
Stars: ✭ 39 (-37.1%)
Mutual labels:  ocr
Simplehtr
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
Stars: ✭ 1,072 (+1629.03%)
Mutual labels:  ocr
Ocrbot
An OCR (Optical Character Recognition) bot for Mastodon (and compatible) instances
Stars: ✭ 39 (-37.1%)
Mutual labels:  ocr
Mybox
Easy tools of document, image, file, network, location, color, and media.
Stars: ✭ 45 (-27.42%)
Mutual labels:  ocr
Idmatch
Match faces on id cards with OCR capabilities.
Stars: ✭ 52 (-16.13%)
Mutual labels:  ocr
Vue.js With Asp.net Core Sample
This provides a sample code using vue.js running on ASP.NET Core
Stars: ✭ 44 (-29.03%)
Mutual labels:  ocr

Forked from weinman/cnn_lstm_ctc_ocr for the ICPR MTWI 2018 challenge 1

Introduction

This is a repository forked from weinman/cnn_lstm_ctc_ocr for the ICPR MTWI 2018 challenge1.
Origin Repository: weinman/cnn_lstm_ctc_ocr - A tensorflow implementation of end to end text recognition
Origin Author: weinman

Author: Feng zhang
Email: [email protected]

Contents

  1. Transform
  2. Models
  3. Demo
  4. Train
  5. Test
  6. Results

Transform

You should cut the data by yourself.Use the target_cut.py, you only need to change the data path to your own origin data.I use the dataset like ICPR MTWI 2018. Note,I use Affine changes the change the Oblique picture to rectangle.So I need the clockwise direction of target label. Use the getTxt.py to change the label's direction.

Next, use mjsynth-tfrecord.py to change your data into tfrecord.You can Find the way in [weinman/cnn_lstm_ctc_ocr/Makefile] You only need to change some paths.

Models

I use the new word_dictionary which consists of English, Chinese and number.I only upload a old pretrain model,it works badly. If you train it for one day with your data,it will work well.And,I add some data augmentation for the model. You also can change the model to denseNet,it will work better.Do as follows in the train.py :

Import denseNet 
%features,sequence_length = model.convnet_layers( image, width, mode)
features,sequence_length = zf_mod_denseNet2.Dense_net( image, width, mode)

I only share the origin Model which is trained on ICPR MTWI 2018 (train): [model_download].The password is 2h1z. Some English data can find in [weinman]

Demo

Download models and copy it to data/model. Then,Run:

python validate.py  your picture's path
eg.python validate.py E:/1.jpg

Train

I already upload MTWI 2018 data£¬You can download and copy to data/model/.After that ,train the model for a day,you can get your own model.[ICPR MTWI 2018 data+extra Eng-data+Eng-data].The password is diyj. When we make your data to tfrecord,You can train. I suggest you had better use your own data.

cd src
python train.py

Test

use also can test the tfrecord's accuracy.[usage] put your data in src/data/val and do as follows:

cd src;
python test.py

Results

Here are some results on ICPR MTWI 2018:
image

Enjoy yourself

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].