PaperworkPersonal document manager (Linux/Windows) -- Moved to Gnome's Gitlab
Stars: ✭ 2,392 (+689.44%)
Mayan EdmsFree Open Source Document Management System (mirror, no pull request or issues)
Stars: ✭ 226 (-25.41%)
PapermergeOpen Source Document Management System for Digital Archives (Scanned Documents)
Stars: ✭ 1,177 (+288.45%)
ParsrTransforms PDF, Documents and Images into Enriched Structured Data
Stars: ✭ 2,736 (+802.97%)
MyboxEasy tools of document, image, file, network, location, color, and media.
Stars: ✭ 45 (-85.15%)
Open Semantic EtlPython based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (-45.54%)
OcrmypdfOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Stars: ✭ 5,549 (+1731.35%)
Lambda Text ExtractorAWS Lambda functions to extract text from various binary formats.
Stars: ✭ 159 (-47.52%)
Open PaperlessScan, index, and archive all of your paper documents (acquired by Mayan EDMS)
Stars: ✭ 2,538 (+737.62%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (+503.63%)
PdfocrAdds text to PDF files using the cuneiform OCR software
Stars: ✭ 287 (-5.28%)
Mayan EdmsRepository mirror of GtLab: https://gitlab.com/mayan-edms/mayan-edms Please use the upstream repository for issues and pull requests.
Stars: ✭ 398 (+31.35%)
RemarksExtract highlights, scribbles, and annotations from PDFs marked with the reMarkable tablet. Export to Markdown, PDF, PNG, and SVG
Stars: ✭ 94 (-68.98%)
LodestonePersonal Document Archiving (DMS, EDMS for Personal/Home Office use)
Stars: ✭ 426 (+40.59%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+549.83%)
i-librarian-freeI, Librarian - open-source version of a PDF managing SaaS.
Stars: ✭ 110 (-63.7%)
FileBasedMiniDMSThis php script sorts your documents (by using hardlinks) into subfolders based on the hashtags it finds in your documents filenames.
Stars: ✭ 35 (-88.45%)
idcardocr离线环境下第二代居民身份证信息识别
Stars: ✭ 358 (+18.15%)
Attention ocr.pytorchThis repository implements the the encoder and decoder model with attention model for OCR
Stars: ✭ 278 (-8.25%)
meltsubConvert hardsub to softsub
Stars: ✭ 19 (-93.73%)
smart-docs-parserAn OCR based document parser to extract information from identity document images
Stars: ✭ 14 (-95.38%)
Redux Offline DocsRedux documentation in PDF, ePub and MOBI formats for offline reading.
Stars: ✭ 292 (-3.63%)
Starter BookA book starter to kickstart your writing journey 🎉
Stars: ✭ 277 (-8.58%)
CTC-OCRA TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem
Stars: ✭ 27 (-91.09%)
MillionHerosAndroid直播答题助手,支持全部答题APP,百万英雄/百万赢家/冲顶大会/芝士超人
Stars: ✭ 23 (-92.41%)
BoxableBoxable is a library that can be used to easily create tables in pdf documents.
Stars: ✭ 253 (-16.5%)
LibmergepdfPHP library for merging multiple PDFs
Stars: ✭ 282 (-6.93%)
attentionocrAttention OCR in Tensorflow 2.0
Stars: ✭ 45 (-85.15%)
Pdf数据科学方向 课件&资料
Stars: ✭ 293 (-3.3%)
BasicArabicOCRA very basic Arabic OCR based on tesseract OCR engine written in Java.
Stars: ✭ 19 (-93.73%)
QuickbillCreate unlimited invoices for free.
Stars: ✭ 278 (-8.25%)
breach-protocol-autosolverSolve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: ✭ 28 (-90.76%)
namselAn OCR application focused on machine-print Tibetan text
Stars: ✭ 22 (-92.74%)
HummusrecipeA powerful PDF tool for NodeJS based on HummusJS.
Stars: ✭ 274 (-9.57%)
car-OCR基于机器学习和OCR的车牌识别系统 @fujunhao
Stars: ✭ 39 (-87.13%)
ocromoreProcess, enhance and evaluate multiple OCR output.
Stars: ✭ 16 (-94.72%)
PdfRust library to read, manipulate and write PDF files.
Stars: ✭ 265 (-12.54%)
ScreenAccessAnti Recoil system with weapon type built-in recognition based on OCR, currently support next games: Apex Legends
Stars: ✭ 41 (-86.47%)
ocrSimple app to extract text from pictures using Tesseract
Stars: ✭ 98 (-67.66%)
TuclThe first-ever paper on the Unix shell written by Ken Thompson in 1976 scanned, transcribed, and redistributed with permission
Stars: ✭ 303 (+0%)
InvoicesGenerate PDF invoices for your customers in laravel
Stars: ✭ 298 (-1.65%)
CamelotCamelot: PDF Table Extraction for Humans
Stars: ✭ 3,150 (+939.6%)
pdf2xml-viewerA simple viewer and inspection tool for text boxes in PDF documents
Stars: ✭ 82 (-72.94%)
PRLibPre-Recognition Library - library with algorithms for improving OCR quality.
Stars: ✭ 22 (-92.74%)
DeckSlide Decks
Stars: ✭ 261 (-13.86%)
tesseract-serverA small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.
Stars: ✭ 15 (-95.05%)
RplosR client for the PLoS Journals API
Stars: ✭ 289 (-4.62%)
TableexporttableExport(table导出文件,支持json、csv、txt、xml、word、excel、image、pdf)
Stars: ✭ 261 (-13.86%)
VehicleInfoOCRUse your camera to read number plates and obtain vehicle details. Simple, ad-free and faster alternative to existing playstore apps
Stars: ✭ 35 (-88.45%)
OCR-ReaderAn Android app to extract text from camera preview directly.
Stars: ✭ 43 (-85.81%)
UxmpdfkitAn iOS PDF viewer and annotator written in Swift that can be embedded into any application.
Stars: ✭ 260 (-14.19%)
screenshot-actionsDunst actions for screenshots (OCR, upload to 0x0.st, delete, rename, move to/from clipboard)
Stars: ✭ 49 (-83.83%)
MpdfPHP library generating PDF files from UTF-8 encoded HTML
Stars: ✭ 3,375 (+1013.86%)