All Projects β†’ Monogramm β†’ erpnext_ocr

Monogramm / erpnext_ocr

Licence: MIT, Unknown licenses found Licenses found MIT LICENSE Unknown license.txt
🐍 βš—οΈ Optical Character Recognition using tesseract within Frappe.

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects
javascript
184084 projects - #8 most used programming language
Dockerfile
14818 projects

Projects that are alternatives of or similar to erpnext ocr

Tesseract
Bindings to Tesseract OCR engine for R
Stars: ✭ 192 (+231.03%)
Mutual labels:  ocr, tesseract
saram
Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:
Stars: ✭ 51 (-12.07%)
Mutual labels:  ocr, tesseract
Tessdata fast
Fast integer versions of trained LSTM models
Stars: ✭ 221 (+281.03%)
Mutual labels:  ocr, tesseract
Swiftytesseract
A Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
Stars: ✭ 170 (+193.1%)
Mutual labels:  ocr, tesseract
memento
Organize your meme image cluster in a better format using OCR from the meme to sort them using tesseract along with editing memes by segmenting them using OpenCV within a directory
Stars: ✭ 70 (+20.69%)
Mutual labels:  ocr, tesseract
Tesseract Ocr For Php
A wrapper to work with Tesseract OCR inside PHP.
Stars: ✭ 2,247 (+3774.14%)
Mutual labels:  ocr, tesseract
Tesstrain
Train Tesseract LSTM with make
Stars: ✭ 251 (+332.76%)
Mutual labels:  ocr, tesseract
Tesseract Macos
Objective C wrapper for the open source OCR Engine Tesseract (macOS)
Stars: ✭ 154 (+165.52%)
Mutual labels:  ocr, tesseract
ocr2text
Convert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: ✭ 90 (+55.17%)
Mutual labels:  ocr, tesseract
ScribeBot
A highly scriptable automation system full of cool features. Automate everything with a little bit of Lua.
Stars: ✭ 72 (+24.14%)
Mutual labels:  ocr, tesseract
Ocr Table
Extract tables from scanned image PDFs using Optical Character Recognition.
Stars: ✭ 165 (+184.48%)
Mutual labels:  ocr, tesseract
ReadToMe
No description or website provided.
Stars: ✭ 51 (-12.07%)
Mutual labels:  ocr, tesseract
Lambda Text Extractor
AWS Lambda functions to extract text from various binary formats.
Stars: ✭ 159 (+174.14%)
Mutual labels:  ocr, tesseract
Android Ocr
Experimental optical character recognition app
Stars: ✭ 2,177 (+3653.45%)
Mutual labels:  ocr, tesseract
Ocrtable
Recognize tables and text from scanned images that contain tables. δ»ŽεŒ…ε«θ‘¨ζ Όηš„ζ‰«ζε›Ύη‰‡δΈ­θ―†εˆ«θ‘¨ζ Όε’Œζ–‡ε­—
Stars: ✭ 155 (+167.24%)
Mutual labels:  ocr, tesseract
Image2text
πŸ“‹ Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: ✭ 243 (+318.97%)
Mutual labels:  ocr, tesseract
Tesseract Ocr for windows
Visual Studio Projects for Tessearct and dependencies
Stars: ✭ 122 (+110.34%)
Mutual labels:  ocr, tesseract
Tesseract4android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Stars: ✭ 148 (+155.17%)
Mutual labels:  ocr, tesseract
renovation-core
The Frappe Javascript Front End Missing SDK
Stars: ✭ 23 (-60.34%)
Mutual labels:  frappe, erpnext
pmOCR
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
Stars: ✭ 53 (-8.62%)
Mutual labels:  ocr, tesseract

License: MIT Managed with Taiga.io Build Status Codacy Badge Coverage Status

ERPNext OCR

βš—οΈ Experimental Frappe OCR application with tesseract.

This project is a fork of ERPNext-OCR by John Vincent Fiel. Its aim is to fix and cleanup the original source code and add some new features.

Check out more on ERPNext Discuss.

πŸ“ˆ Changes

See CHANGELOG

πŸ”– Roadmap

See Taiga.io

🚧 Install

Pre-requisites: tesseract-python and imagemagick

Install tesseract-ocr, plus imagemagick and ghostscript (to work with pdf files) using this command on Debian:

sudo apt-get install tesseract-ocr imagemagick libmagickwand-dev ghostscript

Install Frappe application

bench get-app --branch develop erpnext_ocr https://github.com/Monogramm/erpnext_ocr
bench install-app erpnext_ocr

When installing Frappe app, the following python requirements will be installed:

  • python binding for tesseract, tesserocr

  • image processing library in python, pillow

  • HTTP library in python, requests

  • python binding for imagemagick, wand

πŸš€ Usage

File Being Read:

File Being Read

Sample Screenshot:

Sample Screenshot

Tesseract trained data

In order to use OCR with different languages, you need to install the appropriate trained data files. Check tesseract Wiki for details: https://github.com/tesseract-ocr/tesseract/wiki/Data-Files

Development

If you wish to develop or just test locally this application, you can use docker-compose up -d at the root of the this repository. You can then access your ERPNext OCR dev env at http://localhost:8080.

Known issues

βœ… Run tests

bench run-tests --app erpnext_ocr

πŸ‘€ Authors

Monogramm

John Vincent Fiel

🀝 Contributing

Contributions, issues and feature requests are welcome!
Feel free to check issues page. Check the contributing guide.

πŸ‘ Show your support

Give a ⭐ if this project helped you!

πŸ“„ License

Copyright Β© 2019 Monogramm.
This project is MIT licensed.


This README was generated with ❀️ by readme-md-generator

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].