SachaIZADI / Seven-Segment-OCR

Licence: other

Computer vision project to automatically recognize digits characters in a seven-segments display

Programming Languages

Jupyter Notebook

11667 projects

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Seven-Segment-OCR

caltech birds

A set of notebooks as a guide to the process of fine-grained image classification of birds species, using PyTorch based deep neural networks.

Stars: ✭ 29 (-50%)

Mutual labels: classification-model

KTP-OCR

An Open Source OCR tool for Indonesian ID card (KTP).

Stars: ✭ 48 (-17.24%)

Mutual labels: ocr

granblue-automation-android

Educational application written in Kotlin aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using MediaProjection, AccessibilityService, and OpenCV.

Stars: ✭ 26 (-55.17%)

Mutual labels: ocr

polling-station-app

Voting station app to redeem the suffrage on the blockchain using a machine readable travel document.

Stars: ✭ 39 (-32.76%)

Mutual labels: ocr

craft-text-detector

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Stars: ✭ 151 (+160.34%)

Mutual labels: ocr

doctr-tfjs-demo

Javascript demo of docTR, powered by TensorFlowJS

Stars: ✭ 21 (-63.79%)

Mutual labels: ocr

gazou

Japanese OCR for Linux & Windows

Stars: ✭ 32 (-44.83%)

Mutual labels: ocr

tutorials

Git Repo for Articles on Ergo Sum blog and the youtube channel https://www.youtube.com/channel/UCiie9CN--dazA7iT2sry5FA

Stars: ✭ 42 (-27.59%)

Mutual labels: ocr

proxy-scrape

scrapin' proxies with ocr

Stars: ✭ 20 (-65.52%)

Mutual labels: ocr

go-ocr

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

Stars: ✭ 31 (-46.55%)

Mutual labels: ocr

ScreencapToTextBot

Reddit bot that takes the screencap of a conversation and converts it in reddit formatted text

Stars: ✭ 12 (-79.31%)

Mutual labels: ocr

lookup

🔍 Pure Go implementation of fast image search and simple OCR, focused on reading info from screenshots

Stars: ✭ 35 (-39.66%)

Mutual labels: ocr

CPG

Steven C. Y. Hung, Cheng-Hao Tu, Cheng-En Wu, Chien-Hung Chen, Yi-Ming Chan, and Chu-Song Chen, "Compacting, Picking and Growing for Unforgetting Continual Learning," Thirty-third Conference on Neural Information Processing Systems, NeurIPS 2019

Stars: ✭ 91 (+56.9%)

Mutual labels: classification-model

pdf-scripts

📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs

Stars: ✭ 33 (-43.1%)

Mutual labels: ocr

deep-text-recognition-benchmark

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

Stars: ✭ 123 (+112.07%)

Mutual labels: ocr

R2CNN

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

Stars: ✭ 80 (+37.93%)

Mutual labels: ocr

ibm-cloud-functions-serverless-ocr-openchecks

Serverless bank check deposit processing with object storage and optical character recognition using Apache OpenWhisk powered by IBM Cloud Functions. See the Tech Talk replay for a demo.

Stars: ✭ 40 (-31.03%)

Mutual labels: ocr

PSENet-Tensorflow

TensorFlow implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Networkt)

Stars: ✭ 51 (-12.07%)

Mutual labels: ocr

TextBoxGAN

Generate text boxes from input words with a GAN.

Stars: ✭ 50 (-13.79%)

Mutual labels: ocr

python-ocr-example

The code for the blogpost A Python Approach to Character Recognition

Stars: ✭ 54 (-6.9%)

Mutual labels: ocr

View All Similar Projects ➔

Optimizer

A seven-segment digits OCR

Optimizer is a seven-segment digits OCR class project carried out by Alex, Priscille, Charlotte and Sacha.

Objective

The aim of the project is to digitize the monitoring of mines activities. We focused on the gas and lubricant consumption of vehicles within the mines. The idea is to build computer vision model that would enable operators to take a picture of the gas pump with their smartphones, and automatically log the value of the gas transaction. We were given ~850 pictures (of varying quality) of the gas pump with their associated values.

Approaches

We tried 2 different approaches:

The "digit-per-digit" approach
1. Image processing: identify the screen, crop the picture, grayscale, thresholding, localize digits and crop them.
2. Learning phase: learn a "MNIST" model that predicts each digit individually.
3. Inference phase: pass each cropped digit to the "MNIST" model, and append the results.
The "end-to-end" approach
1. Image processing: identify the screen, crop the picture, grayscale and thresholding.
2. Learning phase: learn a model that predicts all digits at once. We based our model on the "Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks" paper by Goodfellow & al. The idea is to build a ConvNet that simultaneously learns (i) the digits and (ii) where to look for them.

Results

How to reproduce our work and run our models

TBC...

#1 Preprocessing python frame_extractor.py

#2 Preprocessing python digits_cut.py

#3 Model python main.py

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

SachaIZADI / Seven-Segment-OCR

Programming Languages

Labels

Projects that are alternatives of or similar to Seven-Segment-OCR

Optimizer

A seven-segment digits OCR

Objective

Approaches

Results

How to reproduce our work and run our models