Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → yinchangchang → Ocr_densenet

yinchangchang / Ocr_densenet

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch ocr densenet ocr-recognition

Projects that are alternatives of or similar to Ocr densenet

ID-Card-Passport-Recognition-SDK-Android

On-Device ID Card & Passport & Driver License Recognition SDK for Android

Stars: ✭ 223 (-47.53%)

Mutual labels: ocr, ocr-recognition

nimtesseract

A Tesseract OCR wrapper for Nim

Stars: ✭ 23 (-94.59%)

Mutual labels: ocr, ocr-recognition

Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

Stars: ✭ 174 (-59.06%)

Mutual labels: ocr, ocr-recognition

Sightseq

Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection

Stars: ✭ 116 (-72.71%)

Mutual labels: densenet, ocr

Android-Text-Scanner

Read text and numbers with android camera OCR

Stars: ✭ 27 (-93.65%)

Mutual labels: ocr, ocr-recognition

receipt-manager-app

Receipt parser application written in dart.

Stars: ✭ 140 (-67.06%)

Mutual labels: ocr, ocr-recognition

Transformer-ocr

Handwritten text recognition using transformers.

Stars: ✭ 92 (-78.35%)

Mutual labels: ocr, ocr-recognition

Deep Text Recognition Benchmark

Text recognition (optical character recognition) with deep learning methods.

Stars: ✭ 2,665 (+527.06%)

Mutual labels: ocr-recognition, ocr

python-ocr-example

The code for the blogpost A Python Approach to Character Recognition

Stars: ✭ 54 (-87.29%)

Mutual labels: ocr, ocr-recognition

IdCardRecognition

Android id card recognition based on OCR. 安卓基于OCR的身份证识别。

Stars: ✭ 35 (-91.76%)

Mutual labels: ocr, ocr-recognition

Caffe ocr

主流ocr算法研究实验性的项目，目前实现了CNN+BLSTM+CTC架构

Stars: ✭ 1,156 (+172%)

Mutual labels: densenet, ocr

VehicleInfoOCR

Use your camera to read number plates and obtain vehicle details. Simple, ad-free and faster alternative to existing playstore apps

Stars: ✭ 35 (-91.76%)

Mutual labels: ocr, ocr-recognition

Opencv

📷 Computer-Vision Demos

Stars: ✭ 244 (-42.59%)

Mutual labels: ocr-recognition, ocr

deep-learning-for-document-dewarping

An application of high resolution GANs to dewarp images of perturbed documents

Stars: ✭ 100 (-76.47%)

Mutual labels: ocr, ocr-recognition

Awesome Ocr

Stars: ✭ 198 (-53.41%)

Mutual labels: ocr-recognition, ocr

EverTranslator

Translate text anytime and everywhere, even you are gaming!

Stars: ✭ 59 (-86.12%)

Mutual labels: ocr, ocr-recognition

Textshot

Python tool for grabbing text via screenshot

Stars: ✭ 1,163 (+173.65%)

Mutual labels: ocr-recognition, ocr

Awesome Deep Text Detection Recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Stars: ✭ 2,282 (+436.94%)

Mutual labels: ocr-recognition, ocr

LoL-TFT-Champion-Masking

League Of Legends - Teamfight Tactics Champion Masking

Stars: ✭ 23 (-94.59%)

Mutual labels: ocr, ocr-recognition

OCR-Reader

An Android app to extract text from camera preview directly.

Stars: ✭ 43 (-89.88%)

Mutual labels: ocr, ocr-recognition

View All Similar Projects ➔

OCR

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）冠军

模型结果

该比赛计算每一个条目的f1score，取所有条目的平均，具体计算方式在这里。这里的计算方式不对一句话里的相同文字重复计算，故f1score比提交的最终结果低：

-	train	val
f1score	0.9911	0.9582
recall	0.9943	0.9574
precision	0.9894	0.9637

模型说明

模型

采用densenet结构，模型输入为(64×512)的图片，输出为(8×64×2159)的概率。

将图片划分为多个(8×8)的方格，在每个方格预测2159个字符的概率。

Loss

将(8×64×2159)的概率沿着长宽方向取最大值，得到(2159)的概率，表示这张图片里有对应字符的概率。

balance: 对正例和负例分别计算loss，使得正例loss权重之和与负例loss权重之和相等，解决数据不平衡的问题。

hard-mining

文字检测将(8×64×2159)的概率沿着宽方向取最大值，得到(64×2159)的概率。沿着长方向一个个方格预测文字，然后连起来可得到一句完整的语句。

存在问题：两个连续的文字无法重复检测

下图是一个文字识别正确的示例：的长为半径作圆

下图是一个文字识别错误的示例：为10元；经粗加工后销售，每

文件目录

ocr
|
|--code
|
|--files
|	|
|	|--train.csv
|
|--data
	|
	|--dataset
	|	|
	|	|--train
	|	|
	|	|--test
	|
	|--result
	|	|
	|	|--test_result.csv
	|
	|--images		此文件夹放置任何图片均可，我放的celebA数据集用作pretrain

运行环境

Ubuntu16.04, python2.7, CUDA9.0

安装pytorch, 推荐版本: 0.2.0_3

pip install -r requirement.txt

下载数据

从这里下载初赛、复赛数据、模型，合并训练集、测试集。

预处理

如果不更换数据集，不需要执行这一步。

如果更换其他数据集，一并更换 files/train.csv

cd code/preprocessing
python map_word_to_index.py
python analysis_dataset.py

训练

cd code/ocr
python main.py

测试

f1score在0.9以下，lr=0.001，不使用hard-mining；

f1score在0.9以上，lr=0.0001，使用hard-mining；

生成的model保存在不同的文件夹里。

cd code/ocr
python main.py --phase test --resume  ../../data/models-small/densenet/eval-16-1/best_f1score.ckpt

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 425

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (15) 🔗