All Projects → alleveenstra → attentionocr

alleveenstra / attentionocr

Licence: MIT License
Attention OCR in Tensorflow 2.0

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to attentionocr

CRNN.tf2
Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
Stars: ✭ 131 (+191.11%)
Mutual labels:  ocr, tensorflow2
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Stars: ✭ 1,409 (+3031.11%)
Mutual labels:  ocr, tensorflow2
OCR-Reader
An Android app to extract text from camera preview directly.
Stars: ✭ 43 (-4.44%)
Mutual labels:  ocr
Knowledge distillation via TF2.0
The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API
Stars: ✭ 87 (+93.33%)
Mutual labels:  tensorflow2
ocromore
Process, enhance and evaluate multiple OCR output.
Stars: ✭ 16 (-64.44%)
Mutual labels:  ocr
staff identity card ocr project
Staff Identity Card OCR Project
Stars: ✭ 15 (-66.67%)
Mutual labels:  ocr
solr-ocrpayload-plugin
Efficient indexing and retrieval of OCR bounding boxes in Solr
Stars: ✭ 22 (-51.11%)
Mutual labels:  ocr
Android-Text-Scanner
Read text and numbers with android camera OCR
Stars: ✭ 27 (-40%)
Mutual labels:  ocr
BasicArabicOCR
A very basic Arabic OCR based on tesseract OCR engine written in Java.
Stars: ✭ 19 (-57.78%)
Mutual labels:  ocr
ScreenAccess
Anti Recoil system with weapon type built-in recognition based on OCR, currently support next games: Apex Legends
Stars: ✭ 41 (-8.89%)
Mutual labels:  ocr
CTC-OCR
A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem
Stars: ✭ 27 (-40%)
Mutual labels:  ocr
ocr
Simple app to extract text from pictures using Tesseract
Stars: ✭ 98 (+117.78%)
Mutual labels:  ocr
tesseract-server
A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.
Stars: ✭ 15 (-66.67%)
Mutual labels:  ocr
MillionHeros
Android直播答题助手,支持全部答题APP,百万英雄/百万赢家/冲顶大会/芝士超人
Stars: ✭ 23 (-48.89%)
Mutual labels:  ocr
VehicleInfoOCR
Use your camera to read number plates and obtain vehicle details. Simple, ad-free and faster alternative to existing playstore apps
Stars: ✭ 35 (-22.22%)
Mutual labels:  ocr
breach-protocol-autosolver
Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: ✭ 28 (-37.78%)
Mutual labels:  ocr
screenshot-actions
Dunst actions for screenshots (OCR, upload to 0x0.st, delete, rename, move to/from clipboard)
Stars: ✭ 49 (+8.89%)
Mutual labels:  ocr
pdf2xml-viewer
A simple viewer and inspection tool for text boxes in PDF documents
Stars: ✭ 82 (+82.22%)
Mutual labels:  ocr
car-OCR
基于机器学习和OCR的车牌识别系统 @fujunhao
Stars: ✭ 39 (-13.33%)
Mutual labels:  ocr
meltsub
Convert hardsub to softsub
Stars: ✭ 19 (-57.78%)
Mutual labels:  ocr

Attention OCR

A clear and maintainable implementation of Attention OCR in Tensorflow 2.0.

This sequence to sequence OCR model aims to provide a clear and maintainable implementation of attention based OCR.

Please note that this is currently a work in progress. Documentation is missing, but will be added when the code is stable.

This repository depends upon the following:

  • Tensorflow 2.0
  • Python 3.6+

Training a model

To train a model, first download the sources for generating synthetic data:

cd synthetic
./download_data_sources.sh

Next, in this project's root folder, run the training script:

python3 run.py

This will run a test training run. If everything went well, you'll find a file named "trained.h5" in your directory. To train a real model you should change the training parameters. See run.py its arguments to find out what is configurable.

python3 run.py --help

References

This work is based on the following work:

To do

  • Make image height variable
  • Name all input and output tensors
  • Write unit tests with full coverage
  • Show a test case on google colab
  • Perform a grid search on best parameters for a toy dataset
  • Document the whole API

Codacy Badge

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].