OcrmypdfOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Stars: ✭ 5,549 (+181.82%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+579.48%)
Lambda Text ExtractorAWS Lambda functions to extract text from various binary formats.
Stars: ✭ 159 (-91.92%)
ExifcleanerCross-platform desktop GUI app to clean image metadata
Stars: ✭ 305 (-84.51%)
Signature extractorA super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
Stars: ✭ 205 (-89.59%)
Open Semantic EtlPython based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (-91.62%)
Open PaperlessScan, index, and archive all of your paper documents (acquired by Mayan EDMS)
Stars: ✭ 2,538 (+28.9%)
Pan card ocr projectTo extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
Stars: ✭ 39 (-98.02%)
PaperworkPersonal document manager (Linux/Windows) -- Moved to Gnome's Gitlab
Stars: ✭ 2,392 (+21.48%)
CcextractorCCExtractor - Official version maintained by the core team
Stars: ✭ 356 (-81.92%)
GovipsA lightning fast image processing and resizing library for Go
Stars: ✭ 442 (-77.55%)
DmsmsgrcgA photo OCR project aims to output DMS messages contained in sign structure images.
Stars: ✭ 18 (-99.09%)
Awesome Ai BooksSome awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Stars: ✭ 855 (-56.58%)
MyboxEasy tools of document, image, file, network, location, color, and media.
Stars: ✭ 45 (-97.71%)
ParsrTransforms PDF, Documents and Images into Enriched Structured Data
Stars: ✭ 2,736 (+38.95%)
TabulaTabula is a tool for liberating data tables trapped inside PDF files
Stars: ✭ 5,420 (+175.27%)
TypefontThe first open-source library that detects the font of a text in a image.
Stars: ✭ 1,575 (-20.01%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (-88.98%)
PdfocrAdds text to PDF files using the cuneiform OCR software
Stars: ✭ 287 (-85.42%)
DocspellAssist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
Stars: ✭ 303 (-84.61%)
LibvipsA fast image processing library with low memory needs.
Stars: ✭ 6,094 (+209.5%)
PrlibPre-Recognition Library - library with algorithms for improving OCR quality.
Stars: ✭ 18 (-99.09%)
RemarksExtract highlights, scribbles, and annotations from PDFs marked with the reMarkable tablet. Export to Markdown, PDF, PNG, and SVG
Stars: ✭ 94 (-95.23%)
PapermergeOpen Source Document Management System for Digital Archives (Scanned Documents)
Stars: ✭ 1,177 (-40.22%)
SsocrSeven Segment Optical Character Recognition
Stars: ✭ 133 (-93.25%)
Mayan EdmsFree Open Source Document Management System (mirror, no pull request or issues)
Stars: ✭ 226 (-88.52%)
En Data miningData Mining Historical Newspaper Metadata (METS/ALTO formats)
Stars: ✭ 14 (-99.29%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (-7.11%)
Scene Text RecognitionScene text detection and recognition based on Extremal Region(ER)
Stars: ✭ 146 (-92.59%)
Gpuimage XA Cross-platform (for both Android & iOS) Framework for GPU-based Filters, Video and Image Processing.
Stars: ✭ 154 (-92.18%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+548.2%)
Fall DetectionHuman Fall Detection from CCTV camera feed
Stars: ✭ 154 (-92.18%)
Color recognition🎨 Color recognition & classification & detection on webcam stream / on video / on single image using K-Nearest Neighbors (KNN) is trained with color histogram features by OpenCV.
Stars: ✭ 154 (-92.18%)
OpenbookOpen source lilypond real book for Jazz musicians
Stars: ✭ 159 (-91.92%)
Smartcrop.jsContent aware image cropping
Stars: ✭ 12,345 (+526.97%)
It books好书分享,送人玫瑰,手有余香。
Stars: ✭ 154 (-92.18%)
Tesseract MacosObjective C wrapper for the open source OCR Engine Tesseract (macOS)
Stars: ✭ 154 (-92.18%)
Gasyori100knockimage processing codes to understand algorithm
Stars: ✭ 1,988 (+0.96%)
East icprForked from argman/EAST for the ICPR MTWI 2018 CHALLENGE
Stars: ✭ 154 (-92.18%)
Computer Vision Video LecturesA curated list of free, high-quality, university-level courses with video lectures related to the field of Computer Vision.
Stars: ✭ 154 (-92.18%)
PngtasticA pure Java PNG image optimization and manipulation library
Stars: ✭ 159 (-91.92%)
Tools Ocr树洞 OCR 文字识别(一款跨平台的 OCR 小工具)
Stars: ✭ 2,303 (+16.96%)
Yii2 ExportA library to export server/db data in various formats (e.g. excel, html, pdf, csv etc.)
Stars: ✭ 153 (-92.23%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (-92.23%)
Go AudioAn offline solution to convert pdfs into audiobooks
Stars: ✭ 153 (-92.23%)
CrnnConvolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.
Stars: ✭ 1,901 (-3.45%)
Allitebooks爬取AllITeBook网站的书籍下载链接
Stars: ✭ 157 (-92.03%)
Captcha trainer[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
Stars: ✭ 2,228 (+13.15%)
HltoolGo 开发常用工具库, Google2步验证客户端,AES加密解密,RSA加密解密,钉钉机器人,邮件发送,JWT生成解析,Log,BoltDB操作,图片操作,json操作,struct序列化
Stars: ✭ 151 (-92.33%)
Cyclegan KerasKeras implementation of CycleGAN using a tensorflow backend.
Stars: ✭ 152 (-92.28%)
IpyplotIPyPlot is a small python package offering fast and efficient plotting of images inside Python Notebooks. It's using IPython with HTML for faster, richer and more interactive way of displaying big numbers of images.
Stars: ✭ 152 (-92.28%)
Reportbro DesignerJavascript plugin to visually design report layouts (for pdf and Excel) which can be created with reportbro-lib (a Python package) on the server.
Stars: ✭ 160 (-91.87%)
Gan MriCode repository for Frontiers article 'Generative Adversarial Networks for Image-to-Image Translation on Multi-Contrast MR Images - A Comparison of CycleGAN and UNIT'
Stars: ✭ 159 (-91.92%)
DegateOpen source software for chip reverse engineering.
Stars: ✭ 156 (-92.08%)
Mlv AppAll in one MLV processing app that is pretty great. Download:
Stars: ✭ 150 (-92.38%)