All Projects → Php Apache Tika → Similar Projects or Alternatives

674 Open source projects that are alternatives of or similar to Php Apache Tika

Aster
Recognizing cropped text in natural images.
Stars: ✭ 626 (+723.68%)
Mutual labels:  ocr
Anti Webspider
Web 端反爬技术方案
Stars: ✭ 486 (+539.47%)
Mutual labels:  ocr
Mlkit
A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS
Stars: ✭ 949 (+1148.68%)
Mutual labels:  text-recognition
Seglink
An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments
Stars: ✭ 479 (+530.26%)
Mutual labels:  ocr
Pdfio.jl
PDF Reader Library for Native Julia.
Stars: ✭ 56 (-26.32%)
Mutual labels:  text-extraction
Swiftocr
Fast and simple OCR library written in Swift
Stars: ✭ 4,459 (+5767.11%)
Mutual labels:  ocr
Mspaintide
Programming in MS Paint
Stars: ✭ 909 (+1096.05%)
Mutual labels:  ocr
Caffe ocr
主流ocr算法研究实验性的项目,目前实现了CNN+BLSTM+CTC架构
Stars: ✭ 1,156 (+1421.05%)
Mutual labels:  ocr
Easyocr
Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).
Stars: ✭ 466 (+513.16%)
Mutual labels:  ocr
Textboxes plusplus
TextBoxes++: A Single-Shot Oriented Scene Text Detector
Stars: ✭ 883 (+1061.84%)
Mutual labels:  ocr
Trwebocr
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~
Stars: ✭ 618 (+713.16%)
Mutual labels:  ocr
Maven Site
Apache Maven site
Stars: ✭ 54 (-28.95%)
Mutual labels:  apache
Couchdb Nano
Nano: The official Apache CouchDB library for Node.js
Stars: ✭ 456 (+500%)
Mutual labels:  apache
Img2latex Mathpix
An image to LaTeX tool by MathpixOCR API and JavaFX
Stars: ✭ 872 (+1047.37%)
Mutual labels:  ocr
Airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Stars: ✭ 24,101 (+31611.84%)
Mutual labels:  apache
Snipit
Snipit allows you to capture and save interesting sections from any source of information. Be it textbooks, journals, computer screens, photographs, flyers, writings on a whiteboard, etc.
Stars: ✭ 70 (-7.89%)
Mutual labels:  ocr
Js Ocr Demo
JavaScript optical character recognition demo
Stars: ✭ 447 (+488.16%)
Mutual labels:  ocr
Neural Network Digit Ocr
Trains a Neural Network to read handwritten digits (OCR). Uses synaptic for Node.js, socket.io and MongoDB
Stars: ✭ 12 (-84.21%)
Mutual labels:  ocr
Ezhttp
The bash shell script stack for installation of Nginx OpenResty Tengine lua_nginx_module nginx_concat_module nginx_upload_module ngx_substitutions_filter_module Apache-2.2 Apache-2.4 MySQL-5.1 MySQL-5.5 MySQL-5.6 MySQL-5.7 PHP-5.2 PHP-5.3 PHP-5.4 PHP-5.5 PHP-5.6 ZendOptimizer ZendGuardLoader Xcache Eaccelerator Imagemagick IonCube Memcache Memcached Redis Mongo Xdebug Mssql Memcached PureFtpd PhpMyAdmin Redis Mongodb PhpRedisAdmin MemAdmin RockMongo Jdk7 Jdk8 Tomcat7 Tomcat8
Stars: ✭ 443 (+482.89%)
Mutual labels:  apache
Simplehtr
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
Stars: ✭ 1,072 (+1310.53%)
Mutual labels:  ocr
Dbnet.pytorch
A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization
Stars: ✭ 435 (+472.37%)
Mutual labels:  ocr
Training extensions
Trainable models and NN optimization tools
Stars: ✭ 857 (+1027.63%)
Mutual labels:  text-recognition
Lodestone
Personal Document Archiving (DMS, EDMS for Personal/Home Office use)
Stars: ✭ 426 (+460.53%)
Mutual labels:  ocr
Patter
speech-to-text in pytorch
Stars: ✭ 71 (-6.58%)
Mutual labels:  ocr
Awesome Solr
A curated list of Awesome Apache Solr links and resources.
Stars: ✭ 69 (-9.21%)
Mutual labels:  apache
Doccreator
DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation
Stars: ✭ 60 (-21.05%)
Mutual labels:  ocr
Pan card ocr project
To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
Stars: ✭ 39 (-48.68%)
Mutual labels:  ocr
Receipt Parser Legacy
A supermarket receipt parser written in Python using tesseract OCR
Stars: ✭ 614 (+707.89%)
Mutual labels:  ocr
Attention Ocr Chinese Version
Attention OCR Based On Tensorflow
Stars: ✭ 421 (+453.95%)
Mutual labels:  text-recognition
Akarata
Indonesian stemmer - Pustaka JavaScript untuk mengambil kata dasar dari kata berimbuhan pada bahasa Indonesia.
Stars: ✭ 26 (-65.79%)
Mutual labels:  apache
Chineseocr
yolo3+ocr
Stars: ✭ 4,558 (+5897.37%)
Mutual labels:  ocr
Chinese Text Detection And Recognition
Assignment of Image Analysis and Understanding
Stars: ✭ 53 (-30.26%)
Mutual labels:  text-recognition
Psenet.pytorch
A pytorch re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network
Stars: ✭ 416 (+447.37%)
Mutual labels:  ocr
Openwhisk Runtime Php
Apache OpenWhisk Runtime PHP supports Apache OpenWhisk functions written in PHP
Stars: ✭ 26 (-65.79%)
Mutual labels:  apache
Opensearchserver
Open-source Enterprise Grade Search Engine Software
Stars: ✭ 408 (+436.84%)
Mutual labels:  ocr
Papermerge
Open Source Document Management System for Digital Archives (Scanned Documents)
Stars: ✭ 1,177 (+1448.68%)
Mutual labels:  ocr
Ctcwordbeamsearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+423.68%)
Mutual labels:  text-recognition
Subnode.org
SubNode: Social Media App
Stars: ✭ 25 (-67.11%)
Mutual labels:  apache
Struts Pwn
An exploit for Apache Struts CVE-2017-5638
Stars: ✭ 391 (+414.47%)
Mutual labels:  apache
Phpwpinfo
Provides an equivalent to the `phpinfo()` but with more WordPress requirements details.
Stars: ✭ 52 (-31.58%)
Mutual labels:  apache
Text renderer
Generate text images for training deep learning ocr model
Stars: ✭ 931 (+1125%)
Mutual labels:  ocr
Tessdata
Trained models with support for legacy and LSTM OCR engine
Stars: ✭ 4,173 (+5390.79%)
Mutual labels:  ocr
Ocr Electron Vue
📇 A Simple OCR Application built on Electron, Vue.js & Tesseract.js
Stars: ✭ 67 (-11.84%)
Mutual labels:  ocr
Nlp
[UNMANTEINED] Extract values from strings and fill your structs with nlp.
Stars: ✭ 367 (+382.89%)
Mutual labels:  text-extraction
Pytesseractid
使用 pytesseract ocr 识别 18 位身份证号
Stars: ✭ 23 (-69.74%)
Mutual labels:  ocr
Ocrserver
A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well
Stars: ✭ 359 (+372.37%)
Mutual labels:  ocr
Slowloris
Asynchronous Python implementation of SlowLoris DoS attack
Stars: ✭ 51 (-32.89%)
Mutual labels:  apache
Pdftools
Text Extraction, Rendering and Converting of PDF Documents
Stars: ✭ 349 (+359.21%)
Mutual labels:  text-extraction
Fakemenot
Application to check authenticity of Twitter screenshots. Written in Python 🐍
Stars: ✭ 22 (-71.05%)
Mutual labels:  ocr
Cnn lstm ctc tensorflow
CNN+LSTM+CTC based OCR implemented using tensorflow.
Stars: ✭ 343 (+351.32%)
Mutual labels:  ocr
Server Error Pages
Easy to use, professional error pages to replace the plaintext error pages that come with any server software like Nginx or Apache
Stars: ✭ 338 (+344.74%)
Mutual labels:  apache
Prlib
Pre-Recognition Library - library with algorithms for improving OCR quality.
Stars: ✭ 18 (-76.32%)
Mutual labels:  ocr
Ocrbot
An OCR (Optical Character Recognition) bot for Mastodon (and compatible) instances
Stars: ✭ 39 (-48.68%)
Mutual labels:  ocr
Openwhisk
Apache OpenWhisk is an open source serverless cloud platform
Stars: ✭ 5,499 (+7135.53%)
Mutual labels:  apache
Engintron
Engintron for cPanel/WHM is the easiest way to integrate Nginx on your cPanel/WHM server. Engintron will improve the performance & web serving capacity of your server, while reducing CPU/RAM load at the same time, by installing & configuring the popular Nginx webserver to act as a reverse caching proxy in front of Apache.
Stars: ✭ 587 (+672.37%)
Mutual labels:  apache
Sane Scan Pdf
Sane command-line scan-to-pdf script on Linux with OCR and deskew support
Stars: ✭ 58 (-23.68%)
Mutual labels:  ocr
Blackout
NaNoGenMo 2016 entry #2
Stars: ✭ 36 (-52.63%)
Mutual labels:  ocr
Total Text Dataset
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
Stars: ✭ 580 (+663.16%)
Mutual labels:  text-recognition
Docker Alpine
Docker containers running Alpine Linux and s6 for process management. Solid, reliable containers.
Stars: ✭ 574 (+655.26%)
Mutual labels:  apache
Paperless
Scan, index, and archive all of your paper documents
Stars: ✭ 7,662 (+9981.58%)
Mutual labels:  ocr
121-180 of 674 similar projects