OcrmypdfOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Stars: ✭ 5,549 (+3389.94%)
ocrSimple app to extract text from pictures using Tesseract
Stars: ✭ 98 (-38.36%)
Tesseract MacosObjective C wrapper for the open source OCR Engine Tesseract (macOS)
Stars: ✭ 154 (-3.14%)
OcrbotAn OCR (Optical Character Recognition) bot for Mastodon (and compatible) instances
Stars: ✭ 39 (-75.47%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+527.04%)
Tesseract4androidFork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Stars: ✭ 148 (-6.92%)
Serverless LibreofficeRun LibreOffice in AWS Lambda to create PDFs & convert documents
Stars: ✭ 410 (+157.86%)
TesseractA PHP wrapper for the Tesseract OCR engine
Stars: ✭ 19 (-88.05%)
Pytesseractid使用 pytesseract ocr 识别 18 位身份证号
Stars: ✭ 23 (-85.53%)
UnipdfGolang PDF library for creating and processing PDF files (pure go)
Stars: ✭ 1,171 (+636.48%)
Aws Serverless Airline BookingAirline Booking is a sample web application that provides Flight Search, Flight Payment, Flight Booking and Loyalty points including end-to-end testing, GraphQL and CI/CD. This web application was the theme of Build on Serverless Season 2 on AWS Twitch running from April 24th until end of August in 2019.
Stars: ✭ 1,290 (+711.32%)
RemarksExtract highlights, scribbles, and annotations from PDFs marked with the reMarkable tablet. Export to Markdown, PDF, PNG, and SVG
Stars: ✭ 94 (-40.88%)
OcrtableRecognize tables and text from scanned images that contain tables. 从包含表格的扫描图片中识别表格和文字
Stars: ✭ 155 (-2.52%)
Qanswer【Deprecated】🥇🥇🥇 冲顶大会等游戏答题助手,提供答题辅助决策 ,帮助顺利吃鸡
Stars: ✭ 326 (+105.03%)
Card Ocr身份证识别OCR
Stars: ✭ 345 (+116.98%)
AwslambdaproxyAn AWS Lambda powered HTTP/SOCKS web proxy
Stars: ✭ 571 (+259.12%)
UnidocThis repository has moved! https://github.com/unidoc/unipdf
Stars: ✭ 694 (+336.48%)
Cogstack PipelineDistributed, fault tolerant batch processing for Natural Language Applications and Search, using remote partitioning
Stars: ✭ 26 (-83.65%)
EasyocrJava OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).
Stars: ✭ 466 (+193.08%)
Ocr Electron Vue📇 A Simple OCR Application built on Electron, Vue.js & Tesseract.js
Stars: ✭ 67 (-57.86%)
Penteract Ocr⭐️ The native node.js bindings to the Tesseract OCR project.
Stars: ✭ 86 (-45.91%)
PapermergeOpen Source Document Management System for Digital Archives (Scanned Documents)
Stars: ✭ 1,177 (+640.25%)
Links Detector📖 👆🏻 Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.
Stars: ✭ 106 (-33.33%)
TabuloTable Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)
Stars: ✭ 110 (-30.82%)
Lambda Toolkit*DO NOT USE* - This project was done during my initial python and lambda's studies. I would recommend you the `serverless framework`.
Stars: ✭ 114 (-28.3%)
DocspellAssist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
Stars: ✭ 303 (+90.57%)
Serverless Photo RecognitionA collection of 3 lambda functions that are invoked by Amazon S3 or Amazon API Gateway to analyze uploaded images with Amazon Rekognition and save picture labels to ElasticSearch (written in Kotlin)
Stars: ✭ 345 (+116.98%)
PdfocrAdds text to PDF files using the cuneiform OCR software
Stars: ✭ 287 (+80.5%)
TessdataTrained models with support for legacy and LSTM OCR engine
Stars: ✭ 4,173 (+2524.53%)
CcextractorCCExtractor - Official version maintained by the core team
Stars: ✭ 356 (+123.9%)
Tesseract.jsPure Javascript OCR for more than 100 Languages 📖🎉🖥
Stars: ✭ 25,246 (+15777.99%)
DownloadthisvideoTwitter bot for easily downloading videos/GIFs off tweets
Stars: ✭ 530 (+233.33%)
Image Text Localization RecognitionA general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約
Stars: ✭ 788 (+395.6%)
Webiny JsEnterprise open-source serverless CMS. Includes a headless CMS, page builder, form builder and file manager. Easy to customize and expand. Deploys to AWS.
Stars: ✭ 4,869 (+2962.26%)
Epsagon GoAutomated tracing library for Go 1.x ⚡️
Stars: ✭ 24 (-84.91%)
Pan card ocr projectTo extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
Stars: ✭ 39 (-75.47%)
breach-protocol-autosolverSolve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: ✭ 28 (-82.39%)
Pdfio.jlPDF Reader Library for Native Julia.
Stars: ✭ 56 (-64.78%)
IdmatchMatch faces on id cards with OCR capabilities.
Stars: ✭ 52 (-67.3%)
TextshotPython tool for grabbing text via screenshot
Stars: ✭ 1,163 (+631.45%)
Tesseract PythonExamples to implement OCR(Optical Character Recognition) using tesseract using Python
Stars: ✭ 49 (-69.18%)
Koreader BaseBase framework offering a Lua scriptable environment for creating document readers
Stars: ✭ 81 (-49.06%)
Php Apache TikaApache Tika bindings for PHP: extract text and metadata from documents, images and other formats
Stars: ✭ 76 (-52.2%)
Lambcycle🐑🛵 A declarative lambda middleware with life cycle hooks 🐑🛵
Stars: ✭ 88 (-44.65%)
TesseractThis package contains an OCR engine - libtesseract and a command line program - tesseract.
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
on line recognition, but also still supports the legacy Tesseract OCR engine of
Tesseract 3 which works by recognizing character patterns. Compatibility with
Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0).
It also needs traineddata files which support the legacy engine, for example
those from the tessdata repository.
Stars: ✭ 43,199 (+27069.18%)
GosseractGo package for OCR (Optical Character Recognition), by using Tesseract C++ library
Stars: ✭ 1,622 (+920.13%)
Aadhaar Card OcrExtract text information from Aadhaar Card using tesseract-ocr 😎
Stars: ✭ 112 (-29.56%)
TesserocrA Python wrapper for the tesseract-ocr API
Stars: ✭ 1,567 (+885.53%)
Aws Lambda ListA list of hopefully useful AWS lambdas and lambda-related resources.
Stars: ✭ 130 (-18.24%)
MyboxEasy tools of document, image, file, network, location, color, and media.
Stars: ✭ 45 (-71.7%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (+1050.31%)