All Projects → Php Apache Tika → Similar Projects or Alternatives

674 Open source projects that are alternatives of or similar to Php Apache Tika

Image Text Localization Recognition
A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約
Stars: ✭ 788 (+936.84%)
Mutual labels:  ocr, text-recognition, text-extraction
ocr
Simple app to extract text from pictures using Tesseract
Stars: ✭ 98 (+28.95%)
Mutual labels:  ocr, text-extraction, text-recognition
Scene Text Recognition
Scene text detection and recognition based on Extremal Region(ER)
Stars: ✭ 146 (+92.11%)
Mutual labels:  ocr, text-recognition
Text recognition toolbox
text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.
Stars: ✭ 114 (+50%)
Mutual labels:  ocr, text-recognition
Transformer str
PyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer,Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.
Stars: ✭ 131 (+72.37%)
Mutual labels:  ocr, text-recognition
Unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Stars: ✭ 4,082 (+5271.05%)
Mutual labels:  ocr, text-recognition
Lambda Text Extractor
AWS Lambda functions to extract text from various binary formats.
Stars: ✭ 159 (+109.21%)
Mutual labels:  ocr, text-extraction
Sar tf
This is an implementation of Show, Attend and Read with tensorflow
Stars: ✭ 70 (-7.89%)
Mutual labels:  ocr, text-recognition
Sightseq
Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Stars: ✭ 116 (+52.63%)
Mutual labels:  ocr, text-recognition
Ocr.pytorch
A pure pytorch implemented ocr project including text detection and recognition
Stars: ✭ 196 (+157.89%)
Mutual labels:  ocr, text-recognition
EverTranslator
Translate text anytime and everywhere, even you are gaming!
Stars: ✭ 59 (-22.37%)
Mutual labels:  ocr, text-recognition
insightocr
MXNet OCR implementation. Including text recognition and detection.
Stars: ✭ 100 (+31.58%)
Mutual labels:  ocr, text-recognition
React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+405.26%)
Mutual labels:  ocr, text-recognition
lego-mindstorms-51515-jetson-nano
Combines the LEGO Mindstorms 51515 with the NVIDIA Jetson Nano
Stars: ✭ 31 (-59.21%)
Mutual labels:  ocr, text-recognition
Crnn
Convolutional recurrent neural network for scene text recognition or OCR in Keras
Stars: ✭ 68 (-10.53%)
Mutual labels:  ocr, text-recognition
Deep Text Recognition Benchmark
Text recognition (optical character recognition) with deep learning methods.
Stars: ✭ 2,665 (+3406.58%)
Mutual labels:  ocr, text-recognition
Chineseaddress ocr
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
Stars: ✭ 309 (+306.58%)
Mutual labels:  ocr, text-recognition
Vedastr
A scene text recognition toolbox based on PyTorch
Stars: ✭ 290 (+281.58%)
Mutual labels:  ocr, text-recognition
Node Tesseract Ocr
A Node.js wrapper for the Tesseract OCR API
Stars: ✭ 92 (+21.05%)
Mutual labels:  ocr, text-recognition
Tesseract Ocr For Php
A wrapper to work with Tesseract OCR inside PHP.
Stars: ✭ 2,247 (+2856.58%)
Mutual labels:  ocr, text-recognition
Awesome Deep Text Detection Recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
Stars: ✭ 2,282 (+2902.63%)
Mutual labels:  ocr, text-recognition
Tika Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+1211.84%)
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Stars: ✭ 1,409 (+1753.95%)
Mutual labels:  ocr, text-recognition
Awesome Ocr Resources
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
Stars: ✭ 335 (+340.79%)
Mutual labels:  ocr, text-recognition
Megreader
A research project for text detection and recognition using PyTorch 1.2.
Stars: ✭ 332 (+336.84%)
Mutual labels:  ocr, text-recognition
Cnn lstm ctc ocr
Tensorflow-based CNN+LSTM trained with CTC-loss for OCR
Stars: ✭ 464 (+510.53%)
Mutual labels:  ocr, text-recognition
CRNN
Convolutional recurrent neural network for scene text recognition or OCR in Keras
Stars: ✭ 96 (+26.32%)
Mutual labels:  ocr, text-recognition
Crnn.pytorch
crnn实现水平和垂直方向中文文字识别, 提供在3w多个中文字符训练的水平识别和垂直识别的预训练模型; 欢迎关注,试用和反馈问题... ...
Stars: ✭ 145 (+90.79%)
Mutual labels:  ocr, text-recognition
Crnn With Stn
implement CRNN in Keras with Spatial Transformer Network
Stars: ✭ 83 (+9.21%)
Mutual labels:  ocr, text-recognition
NLP-image-to-text
code to extract text from images
Stars: ✭ 28 (-63.16%)
Mutual labels:  ocr, text-recognition
Adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Stars: ✭ 2,565 (+3275%)
Mutual labels:  ocr, text-recognition
Textrecognitiondatagenerator
A synthetic data generator for text recognition
Stars: ✭ 2,075 (+2630.26%)
Mutual labels:  ocr, text-recognition
MLKit
🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。
Stars: ✭ 294 (+286.84%)
Mutual labels:  ocr, text-recognition
Aster.pytorch
ASTER in Pytorch
Stars: ✭ 473 (+522.37%)
Mutual labels:  ocr, text-recognition
Tr
Free Offline OCR 离线的中文文本检测+识别SDK
Stars: ✭ 598 (+686.84%)
Mutual labels:  ocr, text-recognition
Pdfio.jl
PDF Reader Library for Native Julia.
Stars: ✭ 56 (-26.32%)
Mutual labels:  text-extraction
Caffe ocr
主流ocr算法研究实验性的项目,目前实现了CNN+BLSTM+CTC架构
Stars: ✭ 1,156 (+1421.05%)
Mutual labels:  ocr
Open Semantic Search Apps
Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)
Stars: ✭ 55 (-27.63%)
Mutual labels:  ocr
Maven Site
Apache Maven site
Stars: ✭ 54 (-28.95%)
Mutual labels:  apache
Snipit
Snipit allows you to capture and save interesting sections from any source of information. Be it textbooks, journals, computer screens, photographs, flyers, writings on a whiteboard, etc.
Stars: ✭ 70 (-7.89%)
Mutual labels:  ocr
Dbnet Lite.pytorch
A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization
Stars: ✭ 69 (-9.21%)
Mutual labels:  ocr
Druid Exporter
A Golang based exporter captures druid API related metrics and receives druid-emitting HTTP JSON data.
Stars: ✭ 54 (-28.95%)
Mutual labels:  apache
Simplehtr
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
Stars: ✭ 1,072 (+1310.53%)
Mutual labels:  ocr
Seven Segment Ocr
Digitizes numbers from videos & images of seven segment displays
Stars: ✭ 53 (-30.26%)
Mutual labels:  ocr
Chinese Text Detection And Recognition
Assignment of Image Analysis and Understanding
Stars: ✭ 53 (-30.26%)
Mutual labels:  text-recognition
Papermerge
Open Source Document Management System for Digital Archives (Scanned Documents)
Stars: ✭ 1,177 (+1448.68%)
Mutual labels:  ocr
Docker Debian Base
More complete Debian environment for Docker
Stars: ✭ 70 (-7.89%)
Mutual labels:  apache
Scanbot Sdk Example Android
Document scanning SDK example apps for the Scanbot SDK for Android.
Stars: ✭ 67 (-11.84%)
Mutual labels:  ocr
Neutron
🌠 Purely functional Apache Pulsar client for Scala built on top of Fs2
Stars: ✭ 53 (-30.26%)
Mutual labels:  apache
Phpwpinfo
Provides an equivalent to the `phpinfo()` but with more WordPress requirements details.
Stars: ✭ 52 (-31.58%)
Mutual labels:  apache
Ocr Electron Vue
📇 A Simple OCR Application built on Electron, Vue.js & Tesseract.js
Stars: ✭ 67 (-11.84%)
Mutual labels:  ocr
Idmatch
Match faces on id cards with OCR capabilities.
Stars: ✭ 52 (-31.58%)
Mutual labels:  ocr
Slowloris
Asynchronous Python implementation of SlowLoris DoS attack
Stars: ✭ 51 (-32.89%)
Mutual labels:  apache
Fail2ban.webexploits
This custom Fail2Ban filter and jail will deal with all scans for common Wordpress, Joomla and other Web Exploits being scanned for by automated bots and those seeking to find exploitable web sites.
Stars: ✭ 67 (-11.84%)
Mutual labels:  apache
Tesseract Python
Examples to implement OCR(Optical Character Recognition) using tesseract using Python
Stars: ✭ 49 (-35.53%)
Mutual labels:  ocr
Swiftytesseractrte
SwiftyTesseract Real-Time Engine
Stars: ✭ 49 (-35.53%)
Mutual labels:  ocr
Ultimatemrz Sdk
Machine-readable zone/travel document (MRZ / MRTD) detector and recognizer using deep learning
Stars: ✭ 66 (-13.16%)
Mutual labels:  ocr
Coremlvisionscanner
CoreML Vision Text Data & Animal Detector iOS App
Stars: ✭ 49 (-35.53%)
Mutual labels:  ocr
Ansible Config encoder filters
Ansible role used to deliver the Config Encoder Filters.
Stars: ✭ 48 (-36.84%)
Mutual labels:  apache
Rare
Fast, realtime regex-extraction, and aggregation into common formats such as histograms, numerical summaries, tables, and more!
Stars: ✭ 76 (+0%)
Mutual labels:  apache
1-60 of 674 similar projects