All Projects → MohamedWael → BasicArabicOCR

MohamedWael / BasicArabicOCR

Licence: other
A very basic Arabic OCR based on tesseract OCR engine written in Java.

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to BasicArabicOCR

nmatheg
A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
Stars: ✭ 19 (+0%)
Mutual labels:  arabic, arabic-nlp
breach-protocol-autosolver
Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: ✭ 28 (+47.37%)
Mutual labels:  ocr, tesseract-ocr
ocreval
Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support
Stars: ✭ 48 (+152.63%)
Mutual labels:  ocr, tesseract-ocr
Tesseract4java
Java GUI and Tools for Tesseract OCR
Stars: ✭ 214 (+1026.32%)
Mutual labels:  ocr, tesseract-ocr
tajmeeaton
تجميعة من المشاريع، وخصوصا مفتوحة المصدر، للنهوض باللغة العربية والأمة. 👨‍💻 👨‍🔬👨‍🏫🧕
Stars: ✭ 115 (+505.26%)
Mutual labels:  arabic, arabic-nlp
Image2text
📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: ✭ 243 (+1178.95%)
Mutual labels:  ocr, tesseract-ocr
NLP-image-to-text
code to extract text from images
Stars: ✭ 28 (+47.37%)
Mutual labels:  ocr, tesseract-ocr
Tesseract Ocr for windows
Visual Studio Projects for Tessearct and dependencies
Stars: ✭ 122 (+542.11%)
Mutual labels:  ocr, tesseract-ocr
farasapy
A Python implementation of Farasa toolkit
Stars: ✭ 69 (+263.16%)
Mutual labels:  arabic, arabic-nlp
arabic-tagger
AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (+100%)
Mutual labels:  arabic, arabic-nlp
Tesseract
Bindings to Tesseract OCR engine for R
Stars: ✭ 192 (+910.53%)
Mutual labels:  ocr, tesseract-ocr
TesseractStudio.Net
A free Windows graphical interface to the Tesseract 4.0 OCR engine.
Stars: ✭ 38 (+100%)
Mutual labels:  ocr, tesseract-ocr
Text Detection
Text detection with mainly MSER and SWT
Stars: ✭ 167 (+778.95%)
Mutual labels:  ocr, tesseract-ocr
receipt-manager-app
Receipt parser application written in dart.
Stars: ✭ 140 (+636.84%)
Mutual labels:  ocr, tesseract-ocr
Tesseract4android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Stars: ✭ 148 (+678.95%)
Mutual labels:  ocr, tesseract-ocr
ar-embeddings
Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec
Stars: ✭ 83 (+336.84%)
Mutual labels:  arabic, arabic-nlp
Tesseract
This package contains an OCR engine - libtesseract and a command line program - tesseract. Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.
Stars: ✭ 43,199 (+227263.16%)
Mutual labels:  ocr, tesseract-ocr
Aadhaar Card Ocr
Extract text information from Aadhaar Card using tesseract-ocr 😎
Stars: ✭ 112 (+489.47%)
Mutual labels:  ocr, tesseract-ocr
How-to-use-tesseract-ocr-4.0-with-csharp
How to use Tesseract OCR 4.0 with C#
Stars: ✭ 60 (+215.79%)
Mutual labels:  ocr, tesseract-ocr
Nkocr
🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.
Stars: ✭ 15 (-21.05%)
Mutual labels:  ocr, tesseract-ocr

BasicArabicOCR

A very basic Arabic OCR based on tesseract OCR engine written in Java.

How to run

Firstly, downlaad the following files and extract them.

Arabic OCR

Tess4J-2.0-src_2.zip

tesseract-ocr-3.02.ara.tar.gz

Secondly, open the project "Arabic OCR" using the NetBeans IDE and then right click on the Libraries directory --> add jar/folder browse to the lib directory in the tess4j project and add the fowllowing jar files

  • ghost4j-0.5.1.jar
  • jai_imageio.jar
  • jna.jar
  • win32-x86-64

repeat the previous process to add the "tess4j.jar" file located in the Tess4J\dist directory

finally, open the class "ProcessImage.java" and find the "instance.setDatapath" using ctrl+f and paste the path of the tessdata directory located in the tesseract-ocr\tessdata

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].