All Projects → zsmartercn → Tess4Android

zsmartercn / Tess4Android

Licence: other
A new fork base on tess-two and Tesseract 4.0.0

Programming Languages

c
50402 projects - #5 most used programming language
C++
36643 projects - #6 most used programming language
java
68154 projects - #9 most used programming language
shell
77523 projects
Roff
2310 projects
HTML
75241 projects

Projects that are alternatives of or similar to Tess4Android

Basicocr
BasicOCR是一个致力于解决自然场景文字识别算法研究的项目。该项目由长城数字大数据应用技术研究院佟派AI团队发起和维护。
Stars: ✭ 336 (+983.87%)
Mutual labels:  ocr, lstm
Cnn lstm ctc ocr
Tensorflow-based CNN+LSTM trained with CTC-loss for OCR
Stars: ✭ 464 (+1396.77%)
Mutual labels:  ocr, lstm
Crnn attention ocr chinese
CRNN with attention to do OCR,add Chinese recognition
Stars: ✭ 315 (+916.13%)
Mutual labels:  ocr, lstm
Kraken
OCR engine for all the languages
Stars: ✭ 304 (+880.65%)
Mutual labels:  ocr, lstm
Tesseract
This package contains an OCR engine - libtesseract and a command line program - tesseract. Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.
Stars: ✭ 43,199 (+139251.61%)
Mutual labels:  ocr, lstm
Caffe ocr
主流ocr算法研究实验性的项目,目前实现了CNN+BLSTM+CTC架构
Stars: ✭ 1,156 (+3629.03%)
Mutual labels:  ocr, lstm
Cnn lstm ctc tensorflow
CNN+LSTM+CTC based OCR implemented using tensorflow.
Stars: ✭ 343 (+1006.45%)
Mutual labels:  ocr, lstm
Icdar 2019 Sroie
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
Stars: ✭ 202 (+551.61%)
Mutual labels:  ocr, lstm
Ai Reading Materials
Some of the ML and DL related reading materials, research papers that I've read
Stars: ✭ 79 (+154.84%)
Mutual labels:  ocr, lstm
Lstm Ctc Ocr
using rnn (lstm or gru) and ctc to convert line image into text, based on torch7 and warp-ctc
Stars: ✭ 70 (+125.81%)
Mutual labels:  ocr, lstm
Easyocr
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+43058.06%)
Mutual labels:  ocr, lstm
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+609.68%)
Mutual labels:  ocr, lstm
BiLSTM-CRF-NER-PyTorch
This repo contains a PyTorch implementation of a BiLSTM-CRF model for named entity recognition task.
Stars: ✭ 109 (+251.61%)
Mutual labels:  lstm
tibetan-ocr
Python OCR for Handwritten Tibetan Mauscripts
Stars: ✭ 19 (-38.71%)
Mutual labels:  ocr
R Unet
Video prediction using lstm and unet
Stars: ✭ 25 (-19.35%)
Mutual labels:  lstm
5baa61e4c9b93f3f0682250b6cf8331b7ee68fd8
RNN-LSTM that learns passwords from a starting list
Stars: ✭ 35 (+12.9%)
Mutual labels:  lstm
learningspoons
nlp lecture-notes and source code
Stars: ✭ 29 (-6.45%)
Mutual labels:  lstm
EBIM-NLI
Enhanced BiLSTM Inference Model for Natural Language Inference
Stars: ✭ 24 (-22.58%)
Mutual labels:  lstm
Inventory Kamera
Scans Genshin Impact characters, artifacts, and weapons from the game window into a JSON file.
Stars: ✭ 348 (+1022.58%)
Mutual labels:  ocr
shape-context-ocr
The Shape Context is a shape descriptor that captures the relative positions of other points on the shape contours, and is used to recognize characters.
Stars: ✭ 20 (-35.48%)
Mutual labels:  ocr

Tess4Android

A fork is based on tess-two and Tesseract OCR Engine. We porting Tesseract 4.0(final) to Android base on tess-two and rewrite dot product function with ARM NEON. And we use openCV to pre-process the photos we need to identify. The project include a full OCR demo App.

This project works with:

  • Tesseract 4.0.0
  • tess-two 9.0.0
  • Leptonica 1.74.3
  • libjpeg 9b
  • libpng 1.6.25
  • openCV 4.0.1

Pre-requisites

  • Android 5.0 or higher

Usage

  1. Referring to the app module in this project, you can import ocr_zs module into your app project as a library, or add the following dependencies directly, then you can use the full functions of tess4Android. The text types supported by tess4Android are simplified Chinese, English and digital.

     dependencies {
     	implementation 'com.zsmarter:Tess4Android:1.0.1'
     }
    
  2. If you only want to use the tess-two interface that supports Tesseract 4.0, you can refer to the ocr_zs module to import the tess-two module into your app project as a library, or add the following dependencies directly. in addition, you should prepare a v3.05 or higher trained data file for a language. Data files must be copied to the Android device in a subdirectory named tessdata.

     dependencies {
         	implementation 'com.googlecode:tess-two-api:1.0.1'
     }
    

Building

If you want to modify the Tess4Android code, you can build the project locally. See BUILDING.md.

Versions

Release points are tagged with version numbers. A change to the major version number indicates an API change making that version incompatible with previous versions.

The change log shows what's new in each version.

License

(C) Copyright 2018, ZSmarter Technology Co, Ltd.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].