goSpidersome small project and some articles
Stars: ✭ 56 (+194.74%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+4736.84%)
crnn.mxnetcrnn in mxnet.can train with chinese characters
Stars: ✭ 47 (+147.37%)
MoMo利用墨墨背单词的分享功能拿每日20个的单词上限奖励(多线程
Stars: ✭ 45 (+136.84%)
ddddocr带带弟弟 通用验证码识别OCR pypi版
Stars: ✭ 4,093 (+21442.11%)
crawlerdetectGolang module to detect bots and crawlers via the user agent
Stars: ✭ 22 (+15.79%)
video-subtitle-extractor视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Stars: ✭ 1,763 (+9178.95%)
shape-context-ocrThe Shape Context is a shape descriptor that captures the relative positions of other points on the shape contours, and is used to recognize characters.
Stars: ✭ 20 (+5.26%)
blog技术资料日常积累(欢迎投稿)
Stars: ✭ 59 (+210.53%)
robotstxtrobots.txt file parsing and checking for R
Stars: ✭ 65 (+242.11%)
digdetA realtime digit OCR on the browser using Machine Learning
Stars: ✭ 22 (+15.79%)
CLPR.pytorchEnd to End Chinese License Plate Recognition
Stars: ✭ 75 (+294.74%)
omynote众山小笔记 - 集中管理你的读书笔记
Stars: ✭ 154 (+710.53%)
Snipping-OcrA simple Snipping tool for Windows with OCR capabilities
Stars: ✭ 82 (+331.58%)
scraper图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Stars: ✭ 64 (+236.84%)
zhihu-crawler徒手实现定时爬取知乎,从中发掘有价值的信息,并可视化爬取的数据作网页展示。
Stars: ✭ 56 (+194.74%)
spiderpython 爬虫(amazon, confluence ...)
Stars: ✭ 21 (+10.53%)
DocTrThe official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Stars: ✭ 202 (+963.16%)
Tess4AndroidA new fork base on tess-two and Tesseract 4.0.0
Stars: ✭ 31 (+63.16%)
blinkid-in-browserBlinkID In-browser SDK for WebAssembly-enabled browsers.
Stars: ✭ 40 (+110.53%)
get LibSeat利昂图书馆预约系统自动预约&签到程序。支持包括中国人民大学、北京师范大学、济南大学、哈尔滨工业大学等在内的38所高校的图书馆系统
Stars: ✭ 39 (+105.26%)
feaplat爬虫管理系统,支持集群,弹性伸缩。支持运行feapder、scrapy、selenium、playwright等各种框架及脚本
Stars: ✭ 42 (+121.05%)
js block研究学习各种拦截:反爬虫、拦截ad、防广告注入、斗黄牛等
Stars: ✭ 59 (+210.53%)
erpnext ocr🐍 ⚗️ Optical Character Recognition using tesseract within Frappe.
Stars: ✭ 58 (+205.26%)
deep-text-recognition-benchmarkProvide the OCR model in ONNX format so that the OpenCV DNN module can use them directly and correctly.
Stars: ✭ 32 (+68.42%)
ICP-CheckerICP备案查询,可查询企业或域名的ICP备案信息,自动完成滑动验证,保存结果到Excel表格,适用于2022年新版的工信部备案管理系统网站,告别频繁拖动验证,以及某站*工具要开通VIP才可查看备案信息的坑
Stars: ✭ 119 (+526.32%)
tibetan-ocrPython OCR for Handwritten Tibetan Mauscripts
Stars: ✭ 19 (+0%)
form-segmentationLet's explore how we can extract text from forms
Stars: ✭ 42 (+121.05%)
Printed-Chinese-Character-OCRThis is a Chinese Character ocr system based on Deep learning (VGG like CNN neural net work),this rep include trainning set generating,image preprocesing,NN model optimizing based on Keras high level NN framwork
Stars: ✭ 21 (+10.53%)
ocr2textConvert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: ✭ 90 (+373.68%)
ruzzle-solverA python script that solves ruzzle boards
Stars: ✭ 46 (+142.11%)
seenreqGenerate an object for testing if a request is sent, request is Mikeal's request.
Stars: ✭ 42 (+121.05%)
fetchurlsA bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.
Stars: ✭ 97 (+410.53%)
webgrepGrep Web pages with extra features like JS deobfuscation and OCR
Stars: ✭ 86 (+352.63%)
yutto🧊 一个可爱且任性的 B 站视频下载器(bilili V2)
Stars: ✭ 383 (+1915.79%)
paperbaseOpen source document organizer with automatic OCR and full text search
Stars: ✭ 21 (+10.53%)
i-librarian-freeI, Librarian - open-source version of a PDF managing SaaS.
Stars: ✭ 110 (+478.95%)
NScrapyNScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider
Stars: ✭ 88 (+363.16%)
ingest-fileIngestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
Stars: ✭ 40 (+110.53%)
veryfi-goGo module for communicating with the Veryfi OCR API
Stars: ✭ 18 (-5.26%)