pygramsExtracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
Stars: ✭ 52 (+79.31%)
Paddle2onnxPaddlePaddle to ONNX model converter
Stars: ✭ 185 (+537.93%)
TesseractBindings to Tesseract OCR engine for R
Stars: ✭ 192 (+562.07%)
Image2text📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: ✭ 243 (+737.93%)
Ocr TableExtract tables from scanned image PDFs using Optical Character Recognition.
Stars: ✭ 165 (+468.97%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+658.62%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+6689.66%)
Icdar 2019 SroieICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
Stars: ✭ 202 (+596.55%)
Crnn PytorchPytorch implementation of CRNN (CNN + RNN + CTCLoss) for all language OCR.
Stars: ✭ 248 (+755.17%)
Layout ParserA Python Library for Document Layout Understanding
Stars: ✭ 191 (+558.62%)
koolslaFood recommendation tool with Machine learning.
Stars: ✭ 21 (-27.59%)
Hms Ml DemoHMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Stars: ✭ 187 (+544.83%)
ImageocrPHP验证码识别[PHP CAPTCHA Recognition]
Stars: ✭ 241 (+731.03%)
KeywordExtractionImplementation of algorithm in keyword extraction,including TextRank,TF-IDF and the combination of both
Stars: ✭ 95 (+227.59%)
SwiftytesseractA Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
Stars: ✭ 170 (+486.21%)
Mayan EdmsFree Open Source Document Management System (mirror, no pull request or issues)
Stars: ✭ 226 (+679.31%)
AdelaidetAdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Stars: ✭ 2,565 (+8744.83%)
SharexShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.
Stars: ✭ 18,143 (+62462.07%)
Lambda Text ExtractorAWS Lambda functions to extract text from various binary formats.
Stars: ✭ 159 (+448.28%)
Tesseract4javaJava GUI and Tools for Tesseract OCR
Stars: ✭ 214 (+637.93%)
Tools Ocr树洞 OCR 文字识别(一款跨平台的 OCR 小工具)
Stars: ✭ 2,303 (+7841.38%)
Pytorchocr基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn
Stars: ✭ 198 (+582.76%)
EastA tensorflow implementation of EAST text detector
Stars: ✭ 2,804 (+9568.97%)
Ocr.pytorchA pure pytorch implemented ocr project including text detection and recognition
Stars: ✭ 196 (+575.86%)
CintruderCaptcha Intruder (CIntrud3r) is an automatic pentesting tool to bypass captchas.
Stars: ✭ 192 (+562.07%)
Opencv📷 Computer-Vision Demos
Stars: ✭ 244 (+741.38%)
Receipt ScannerReceipt scanner extracts information from your PDF or image receipts - built in NodeJS
Stars: ✭ 190 (+555.17%)
Handwritten-Names-RecognitionThe goal of this project is to solve the task of name transcription from handwriting images implementing a NN approach.
Stars: ✭ 54 (+86.21%)
Caffe OneclickUse caffe to train your own data in just one click
Stars: ✭ 187 (+544.83%)
Rrpn faster Rcnn tensorflowA tensorflow re-implementation of RRPN: Arbitrary-Oriented Scene Text Detection via Rotation Proposals.
Stars: ✭ 243 (+737.93%)
Android OcrExperimental optical character recognition app
Stars: ✭ 2,177 (+7406.9%)
ParsrTransforms PDF, Documents and Images into Enriched Structured Data
Stars: ✭ 2,736 (+9334.48%)
OCRmyPDF-webA tiny frontend for OCRing PDF files via the web.
Stars: ✭ 37 (+27.59%)
OcrThe Best Image OCR SDK For BAT
Stars: ✭ 173 (+496.55%)
Open PaperlessScan, index, and archive all of your paper documents (acquired by Mayan EDMS)
Stars: ✭ 2,538 (+8651.72%)
Text DetectionText detection with mainly MSER and SWT
Stars: ✭ 167 (+475.86%)
resume-parserThis site uses Lever's resume parsing API to parse resumes
Stars: ✭ 80 (+175.86%)
Open Semantic EtlPython based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (+468.97%)
Tessdata fastFast integer versions of trained LSTM models
Stars: ✭ 221 (+662.07%)
Card.io Android Sdkcard.io provides fast, easy credit card scanning in mobile apps
Stars: ✭ 1,942 (+6596.55%)
Kaku画 - Japanese OCR Dictionary
Stars: ✭ 160 (+451.72%)
Chinese Ocr[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别
Stars: ✭ 2,589 (+8827.59%)
Craft PytorchOfficial implementation of Character Region Awareness for Text Detection (CRAFT)
Stars: ✭ 2,220 (+7555.17%)
TesstrainTrain Tesseract LSTM with make
Stars: ✭ 251 (+765.52%)
bns-short-text-similarity📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
Stars: ✭ 24 (-17.24%)
PaperworkPersonal document manager (Linux/Windows) -- Moved to Gnome's Gitlab
Stars: ✭ 2,392 (+8148.28%)
Wx Cardscanner名片扫描-微信小程序,包括腾讯 ai 开放平台的使用,以及在小程序中实现图片转 Base64 的方法。
Stars: ✭ 156 (+437.93%)