All Projects → hikopensource → DAVAR-Lab-OCR

hikopensource / DAVAR-Lab-OCR

Licence: Apache-2.0 license
OCR toolbox from Davar-Lab

Programming Languages

python
139335 projects - #7 most used programming language
C++
36643 projects - #6 most used programming language
shell
77523 projects

Labels

Projects that are alternatives of or similar to DAVAR-Lab-OCR

PAN-Card-OCR
Retrive meaningful information from PAN Card image using tesseract-ocr 😎
Stars: ✭ 115 (-71.39%)
Mutual labels:  ocr
LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Stars: ✭ 1,566 (+289.55%)
Mutual labels:  ocr
LaraOCR
Laravel Optical Character Reader(OCR) package using ocr engines like Tesseract
Stars: ✭ 88 (-78.11%)
Mutual labels:  ocr
insightocr
MXNet OCR implementation. Including text recognition and detection.
Stars: ✭ 100 (-75.12%)
Mutual labels:  ocr
blinkid-ui-android
Customizable UI library that includes camera management, scanning screen, and document selection module.
Stars: ✭ 33 (-91.79%)
Mutual labels:  ocr
textocry
Textocry - Copy text from Images (chrome extension)
Stars: ✭ 29 (-92.79%)
Mutual labels:  ocr
OCR-Test
An experiment about OCR in Android
Stars: ✭ 47 (-88.31%)
Mutual labels:  ocr
IdCardRecognition
Android id card recognition based on OCR. 安卓基于OCR的身份证识别。
Stars: ✭ 35 (-91.29%)
Mutual labels:  ocr
nashi
Some bits of javascript to transcribe scanned pages using PageXML
Stars: ✭ 13 (-96.77%)
Mutual labels:  ocr
Persian-OCR
Optical character recognition of Farsi and Arabic letters
Stars: ✭ 36 (-91.04%)
Mutual labels:  ocr
DocumentLab
OCR using tesseract, ImageMagick, EmguCV, an advanced query language and a fluent query interface for C#
Stars: ✭ 64 (-84.08%)
Mutual labels:  ocr
Hyper-Table-OCR
A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
Stars: ✭ 96 (-76.12%)
Mutual labels:  ocr
FileBasedMiniDMS
This php script sorts your documents (by using hardlinks) into subfolders based on the hashtags it finds in your documents filenames.
Stars: ✭ 35 (-91.29%)
Mutual labels:  ocr
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Stars: ✭ 6,560 (+1531.84%)
Mutual labels:  ocr
video-to-text-ocr-demo
视频硬字幕提取
Stars: ✭ 105 (-73.88%)
Mutual labels:  ocr
Document-Scanner-and-OCR
A simple document scanner with OCR implemented using Python and OpenCV
Stars: ✭ 31 (-92.29%)
Mutual labels:  ocr
vietnamese-ocr-toolbox
A toolbox for Vietnamese Optical Character Recognition.
Stars: ✭ 26 (-93.53%)
Mutual labels:  ocr
TesseractStudio.Net
A free Windows graphical interface to the Tesseract 4.0 OCR engine.
Stars: ✭ 38 (-90.55%)
Mutual labels:  ocr
tesseract-ocr
Node.js wrapper for Tesseract OCR CLI.
Stars: ✭ 29 (-92.79%)
Mutual labels:  ocr
jp-ocr-prunned-cnn
Attempting feature map prunning on a CNN trained for Japanese OCR
Stars: ✭ 15 (-96.27%)
Mutual labels:  ocr

DAVAR-OCR

This is the opensourced OCR repository of DAVAR Lab, from Hikvision Research Institute, China.

We begin to maintain this code repository to release the implementations of our recent academic publishments and some re-implementations of previous popular algorithms/modules in OCR.

We also provide some of the ablation experiment comparasions for better reproduction.

Note: Due to the policy limits of the company. All of the codes were re-implemented based on the open-source frameworks, mmdetection-2.11.0 and mmcv-1.3.4, from open-mmlab. The code architecture also refers to mmocr, which means these two frameworks can be well compatible to each other.

Implementations

To date, davarocr contains the following algorithms:

Basic OCR Tasks

Text Detection

Text Recognition

Text Spotting

Video Text Spotting

  • YORO (ACM MM 2019)

  • FREE (to be released) (TIP 2021)

Document Understanding Tasks

Information Extraction

Table Recognition

Layout Recognition

  • VSR (ICDAR 2021)

Reading Order Detection

Named Entity Reocognition

Development Environment

The recommended environment requirements can be found in mmdetection. Follows are the lowest compatible environment.

Basic Env version
Python 3.6+
cuda 10.0+
cudnn 7.6.3+
pytorch 1.3.0+
torchvision 0.4.1+
opencv 3.0.0+

For some of the algorithms (EAST, Text Perceptron), C++ version opencv are required. If you do not need to use these algorithms, you could temporarily ignore the error about 'opencv.hpp' or remove the related codes temporarily.

Installation and Development Instruction

To Download the repository and install the davarocr, please follow the instructions:

git clone https://github.com/hikopensource/DAVAR-Lab-OCR.git
cd DAVAR-Lab-OCR/
bash setup.sh

This script will automatically download and install the "mmdetection" and "mmcv-full". You can also manually install them followinging the official instructions

Going to the specific algorithm's directory to see more details.

Problem solution and collection

For the problems existing in the process of installation and researching, we will reasonably collect them and provide corresponding solutions. Please refer to FAQ.md for details.

Changelog

DavarOCR v0.5.1 was released in 12/05/2022. Please refer to Changelog.md for details and release history.

License

This project is released under the Apache 2.0 license

Copyright

The copyright of corresponding contributions of our implementations belongs to Davar-Lab, Hikvision Research Institute, China, and other codes from open source repository follows the original distributive licenses.

Welcome to DAVAR-LAB!

See latest news in DAVAR-Lab. If you have any question and suggestion, please feel free to contact us. Contact email: [email protected], [email protected], [email protected].

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].