All Projects → tesseract-ocr → Tessdata

tesseract-ocr / Tessdata

Licence: apache-2.0
Trained models with support for legacy and LSTM OCR engine

Projects that are alternatives of or similar to Tessdata

Card Ocr
身份证识别OCR
Stars: ✭ 345 (-91.73%)
Mutual labels:  ocr, tesseract
IdCardRecognition
Android id card recognition based on OCR. 安卓基于OCR的身份证识别。
Stars: ✭ 35 (-99.16%)
Mutual labels:  ocr, tesseract
textocry
Textocry - Copy text from Images (chrome extension)
Stars: ✭ 29 (-99.31%)
Mutual labels:  ocr, tesseract
nimtesseract
A Tesseract OCR wrapper for Nim
Stars: ✭ 23 (-99.45%)
Mutual labels:  ocr, tesseract
ocr
Simple app to extract text from pictures using Tesseract
Stars: ✭ 98 (-97.65%)
Mutual labels:  ocr, tesseract
Nkocr
🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.
Stars: ✭ 15 (-99.64%)
Mutual labels:  ocr, tesseract
tesseract-ocr
Node.js wrapper for Tesseract OCR CLI.
Stars: ✭ 29 (-99.31%)
Mutual labels:  ocr, tesseract
erpnext ocr
🐍 ⚗️ Optical Character Recognition using tesseract within Frappe.
Stars: ✭ 58 (-98.61%)
Mutual labels:  ocr, tesseract
staff identity card ocr project
Staff Identity Card OCR Project
Stars: ✭ 15 (-99.64%)
Mutual labels:  ocr, tesseract
cordova-plugin-tesseract
Cordova Plugin for OCR process using Tesseract
Stars: ✭ 70 (-98.32%)
Mutual labels:  ocr, tesseract
tesseract-unity
Standalone OCR plugin for Unity using Tesseract
Stars: ✭ 35 (-99.16%)
Mutual labels:  ocr, tesseract
Qanswer
【Deprecated】🥇🥇🥇 冲顶大会等游戏答题助手,提供答题辅助决策 ,帮助顺利吃鸡
Stars: ✭ 326 (-92.19%)
Mutual labels:  ocr, tesseract
ruzzle-solver
A python script that solves ruzzle boards
Stars: ✭ 46 (-98.9%)
Mutual labels:  ocr, tesseract
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Stars: ✭ 6,560 (+57.2%)
Mutual labels:  ocr, tesseract
How-to-use-tesseract-ocr-4.0-with-csharp
How to use Tesseract OCR 4.0 with C#
Stars: ✭ 60 (-98.56%)
Mutual labels:  ocr, tesseract
LaraOCR
Laravel Optical Character Reader(OCR) package using ocr engines like Tesseract
Stars: ✭ 88 (-97.89%)
Mutual labels:  ocr, tesseract
saram
Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:
Stars: ✭ 51 (-98.78%)
Mutual labels:  ocr, tesseract
ocr2text
Convert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: ✭ 90 (-97.84%)
Mutual labels:  ocr, tesseract
TesseractStudio.Net
A free Windows graphical interface to the Tesseract 4.0 OCR engine.
Stars: ✭ 38 (-99.09%)
Mutual labels:  ocr, tesseract
breach-protocol-autosolver
Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: ✭ 28 (-99.33%)
Mutual labels:  ocr, tesseract

tessdata

These language data files only work with Tesseract 4.0.0 and newer versions. They are based on the sources in tesseract-ocr/langdata on GitHub. (still to be updated for 4.0.0 - 20180322)

These have models for legacy tesseract engine (--oem 0) as well as the new LSTM neural net based engine (--oem 1).

The LSTM models (--oem 1) in these files have been updated to the integerized versions of tessdata_best on GitHub. So, they should be faster but probably a little less accurate than tessdata_best.

tessdata_fast on GitHub provides an alternate set of integerized LSTM models which have been built with a smaller network. tessdata_fast files are the ones packaged for Debian and Ubuntu.

The legacy tesseract models (--oem 0) have been removed for Indic and Arabic script language files.

tessdata for 3.04 or 3.05

Get language data files for Tesseract 3.04 or 3.05 from the 3.04 tree.

More information and a complete list of all languages is available in the Tesseract wiki.

All data in the repository are licensed under the Apache-2.0 License, see file LICENSE.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].