All Projects → zacharywhitley → Awesome Ocr

zacharywhitley / Awesome Ocr

Projects that are alternatives of or similar to Awesome Ocr

Android-Text-Scanner
Read text and numbers with android camera OCR
Stars: ✭ 27 (-86.36%)
Mutual labels:  ocr, ocr-recognition
Ocr densenet
第一届西安交通大学人工智能实践大赛(2018AI实践大赛--图片文字识别)第一名;仅采用densenet识别图中文字
Stars: ✭ 425 (+114.65%)
Mutual labels:  ocr-recognition, ocr
OCR-Reader
An Android app to extract text from camera preview directly.
Stars: ✭ 43 (-78.28%)
Mutual labels:  ocr, ocr-recognition
LoL-TFT-Champion-Masking
League Of Legends - Teamfight Tactics Champion Masking
Stars: ✭ 23 (-88.38%)
Mutual labels:  ocr, ocr-recognition
Zxingcamera
Camera for Android,身份证号码识别 (本地,实时)
Stars: ✭ 34 (-82.83%)
Mutual labels:  ocr-recognition, ocr
IdCardRecognition
Android id card recognition based on OCR. 安卓基于OCR的身份证识别。
Stars: ✭ 35 (-82.32%)
Mutual labels:  ocr, ocr-recognition
Vedastr
A scene text recognition toolbox based on PyTorch
Stars: ✭ 290 (+46.46%)
Mutual labels:  ocr-recognition, ocr
Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Stars: ✭ 174 (-12.12%)
Mutual labels:  ocr, ocr-recognition
Attention Ocr
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
Stars: ✭ 844 (+326.26%)
Mutual labels:  ocr-recognition, ocr
Trwebocr
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~
Stars: ✭ 618 (+212.12%)
Mutual labels:  ocr-recognition, ocr
nimtesseract
A Tesseract OCR wrapper for Nim
Stars: ✭ 23 (-88.38%)
Mutual labels:  ocr, ocr-recognition
Awesome Deep Text Detection Recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
Stars: ✭ 2,282 (+1052.53%)
Mutual labels:  ocr-recognition, ocr
Transformer-ocr
Handwritten text recognition using transformers.
Stars: ✭ 92 (-53.54%)
Mutual labels:  ocr, ocr-recognition
python-ocr-example
The code for the blogpost A Python Approach to Character Recognition
Stars: ✭ 54 (-72.73%)
Mutual labels:  ocr, ocr-recognition
EverTranslator
Translate text anytime and everywhere, even you are gaming!
Stars: ✭ 59 (-70.2%)
Mutual labels:  ocr, ocr-recognition
VehicleInfoOCR
Use your camera to read number plates and obtain vehicle details. Simple, ad-free and faster alternative to existing playstore apps
Stars: ✭ 35 (-82.32%)
Mutual labels:  ocr, ocr-recognition
deep-learning-for-document-dewarping
An application of high resolution GANs to dewarp images of perturbed documents
Stars: ✭ 100 (-49.49%)
Mutual labels:  ocr, ocr-recognition
ID-Card-Passport-Recognition-SDK-Android
On-Device ID Card & Passport & Driver License Recognition SDK for Android
Stars: ✭ 223 (+12.63%)
Mutual labels:  ocr, ocr-recognition
Easyocr
Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).
Stars: ✭ 466 (+135.35%)
Mutual labels:  ocr-recognition, ocr
Textshot
Python tool for grabbing text via screenshot
Stars: ✭ 1,163 (+487.37%)
Mutual labels:  ocr-recognition, ocr

Awesome OCR

Deskewing and Dewarping

Segmentation

Line Segmentation

Character Segmentation

Word Segmentation

Document Segmentation

Form Segmentation

Handwritten

Table detection

Language detection

  • lingua - The most accurate natural language detection library for Java and other JVM languages, suitable for long and short text alike
  • langdetect
  • whatthelang - Lightning Fast Language Prediction rocket
  • wiki-lang-detect

Text detection and localization

  • CUTIE
  • AttentionOCR
  • crpn
  • PAN-PSEnet
  • Scene-Text-Detection-with-SPECNET - Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.
  • Character-Region-Awareness-for-Text-Detection
  • Real-time-Scene-Text-Detection-and-Recognition-System - End-to-end pipeline for real-time scene text detection and recognition.
  • ocr_attention - Robust Scene Text Recognition with Automatic Rectification.
  • masktextspotter.caffee2 - The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes".
  • InceptText-Tensorflow - An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection.
  • textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention
  • RRD - RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection.
  • crpn - Corner-based Region Proposal Network.
  • SSTDNet - Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'.
  • R2CNN - caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection.
  • RRPN - Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals
  • Tensorflow_SceneText_Oriented_Box_Predictor - This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.
  • DeepSceneTextReader - This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.
  • DeRPN - A novel region proposal network for more general object detection ( including scene text detection ).
  • Bartzi/see - SEE: Towards Semi-Supervised End-to-End Scene Text Recognition
  • Bartzi/stn-ocr - Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition
  • beacandler/R2CNN - caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection
  • HsiehYiChia/Scene-text-recognition - Scene text detection and recognition based on Extremal Region(ER)
  • tensorflow_PSENet - This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.
  • R2CNN_Faster-RCNN_Tensorflow - Rotational region detection based on Faster-RCNN.
  • corner - Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
  • Corner_Segmentation_TextDetection - Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation.
  • TextSnake.pytorch - A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes
  • AON - Implementation for CVPR 2018 text recognition Paper by Tensorflow: "AON: Towards Arbitrarily-Oriented Text Recognition"
  • pixel_link - Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018
  • seglink - An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments (=> pixe_link)
  • SSTD - Single Shot Text Detector with Regional Attention
  • MORAN_v2 - MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
  • Curve-Text-Detector - This repository provides train&test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking table.
  • tensorflow_PSENet - This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network
  • HCIILAB/DeRPN - A novel region proposal network for more general object detection ( including scene text detection ).
  • TextField - TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)
  • tensorflow-TextMountain - TextMountain: Accurate Scene Text Detection via Instance Segmentation
  • Bartzi/see - Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
  • bgshih/aster - Recognizing cropped text in natural images.

FOTS

  • FOTS - An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network.
  • FOTS_OCR

CTPN

EAST

CRAFT

TextBoxes

PSENet

  • tensorflow_PSENet - PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network
  • PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

Font detection

  • typefont - The first open-source library that detects the font of a text in a image.

Optical Character Recognition Engines and Frameworks

Awesome lists

Proprietary OCR Engines

File formats and tools

  • nw-page-editor - Simple app for visual editing of Page XML files
  • hocr
  • alto
  • PageXML
  • ocr-fileformat - Validate and transform various OCR file formats
  • hocr-tools - Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Datasets

Data augmentation and Synthetic data generation

Pre OCR Processing

Post OCR Correction

Benchmarks

misc

  • ocrodeg - a small Python library implementing document image degradation for data augmentation for handwriting recognition and OCR applications.
  • scantailor - Scan Tailor is an interactive post-processing tool for scanned pages.
  • jlsutherland/doc2text - help researchers fix these errors and extract the highest quality text from their pdfs as possible.
  • mauvilsa/nw-page-editor - Simple app for visual editing of Page XML files.
  • Transkribus - Transkribus is a comprehensive platform for the digitisation, AI-powered recognition, transcription and searching of historical documents.
  • http://projectnaptha.com/
  • https://github.com/4lex4/scantailor-advanced
  • open-semantic-search - Open Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
  • ocrserver - A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well
  • cosc428-structor - ~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.
  • nidaba - An expandable and scalable OCR pipeline
  • https://github.com/MaybeShewill-CV/CRNN_Tensorflow
  • OCRmyPDF
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].