All Projects → SachaIZADI → Seven-Segment-OCR

SachaIZADI / Seven-Segment-OCR

Licence: other
Computer vision project to automatically recognize digits characters in a seven-segments display

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Seven-Segment-OCR

caltech birds
A set of notebooks as a guide to the process of fine-grained image classification of birds species, using PyTorch based deep neural networks.
Stars: ✭ 29 (-50%)
Mutual labels:  classification-model
KTP-OCR
An Open Source OCR tool for Indonesian ID card (KTP).
Stars: ✭ 48 (-17.24%)
Mutual labels:  ocr
granblue-automation-android
Educational application written in Kotlin aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using MediaProjection, AccessibilityService, and OpenCV.
Stars: ✭ 26 (-55.17%)
Mutual labels:  ocr
polling-station-app
Voting station app to redeem the suffrage on the blockchain using a machine readable travel document.
Stars: ✭ 39 (-32.76%)
Mutual labels:  ocr
craft-text-detector
Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector
Stars: ✭ 151 (+160.34%)
Mutual labels:  ocr
doctr-tfjs-demo
Javascript demo of docTR, powered by TensorFlowJS
Stars: ✭ 21 (-63.79%)
Mutual labels:  ocr
gazou
Japanese OCR for Linux & Windows
Stars: ✭ 32 (-44.83%)
Mutual labels:  ocr
tutorials
Git Repo for Articles on Ergo Sum blog and the youtube channel https://www.youtube.com/channel/UCiie9CN--dazA7iT2sry5FA
Stars: ✭ 42 (-27.59%)
Mutual labels:  ocr
proxy-scrape
scrapin' proxies with ocr
Stars: ✭ 20 (-65.52%)
Mutual labels:  ocr
go-ocr
A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.
Stars: ✭ 31 (-46.55%)
Mutual labels:  ocr
ScreencapToTextBot
Reddit bot that takes the screencap of a conversation and converts it in reddit formatted text
Stars: ✭ 12 (-79.31%)
Mutual labels:  ocr
lookup
🔍 Pure Go implementation of fast image search and simple OCR, focused on reading info from screenshots
Stars: ✭ 35 (-39.66%)
Mutual labels:  ocr
CPG
Steven C. Y. Hung, Cheng-Hao Tu, Cheng-En Wu, Chien-Hung Chen, Yi-Ming Chan, and Chu-Song Chen, "Compacting, Picking and Growing for Unforgetting Continual Learning," Thirty-third Conference on Neural Information Processing Systems, NeurIPS 2019
Stars: ✭ 91 (+56.9%)
Mutual labels:  classification-model
pdf-scripts
📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
Stars: ✭ 33 (-43.1%)
Mutual labels:  ocr
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
Stars: ✭ 123 (+112.07%)
Mutual labels:  ocr
R2CNN
caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection
Stars: ✭ 80 (+37.93%)
Mutual labels:  ocr
ibm-cloud-functions-serverless-ocr-openchecks
Serverless bank check deposit processing with object storage and optical character recognition using Apache OpenWhisk powered by IBM Cloud Functions. See the Tech Talk replay for a demo.
Stars: ✭ 40 (-31.03%)
Mutual labels:  ocr
PSENet-Tensorflow
TensorFlow implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Networkt)
Stars: ✭ 51 (-12.07%)
Mutual labels:  ocr
TextBoxGAN
Generate text boxes from input words with a GAN.
Stars: ✭ 50 (-13.79%)
Mutual labels:  ocr
python-ocr-example
The code for the blogpost A Python Approach to Character Recognition
Stars: ✭ 54 (-6.9%)
Mutual labels:  ocr

Optimizer

A seven-segment digits OCR

Optimizer is a seven-segment digits OCR class project carried out by Alex, Priscille, Charlotte and Sacha.

Objective

The aim of the project is to digitize the monitoring of mines activities. We focused on the gas and lubricant consumption of vehicles within the mines. The idea is to build computer vision model that would enable operators to take a picture of the gas pump with their smartphones, and automatically log the value of the gas transaction. We were given ~850 pictures (of varying quality) of the gas pump with their associated values.

Approaches

We tried 2 different approaches:

  1. The "digit-per-digit" approach

    1. Image processing: identify the screen, crop the picture, grayscale, thresholding, localize digits and crop them.
    2. Learning phase: learn a "MNIST" model that predicts each digit individually.
    3. Inference phase: pass each cropped digit to the "MNIST" model, and append the results.
  2. The "end-to-end" approach

    1. Image processing: identify the screen, crop the picture, grayscale and thresholding.
    2. Learning phase: learn a model that predicts all digits at once. We based our model on the "Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks" paper by Goodfellow & al. The idea is to build a ConvNet that simultaneously learns (i) the digits and (ii) where to look for them.

Results

How to reproduce our work and run our models

TBC...

#1 Preprocessing python frame_extractor.py

#2 Preprocessing python digits_cut.py

#3 Model python main.py

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].