wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-46.39%)
gathertoolgathertool是golang脚本化开发库,目的是提高对应场景程序开发的效率;轻量级爬虫库,接口测试&压力测试库,DB操作库等。
Stars: ✭ 36 (-62.89%)
InfospiderINFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客、开源中国博客、简书。
Stars: ✭ 5,984 (+6069.07%)
gospider⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架
Stars: ✭ 183 (+88.66%)
Pspider一个简单的分布式爬虫框架
Stars: ✭ 102 (+5.15%)
BitextorBitextor generates translation memories from multilingual websites.
Stars: ✭ 168 (+73.2%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+452.58%)
Proxy poolPython爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+14295.88%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+601.03%)
Geetestgeetest,滑动验证码
Stars: ✭ 293 (+202.06%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-65.98%)
Novel Plus小说精品屋-plus是一个多端(PC、WAP)阅读、功能完善的原创文学CMS系统,由前台门户系统、作家后台管理系统、平台后台管理系统、爬虫管理系统等多个子系统构成,支持多模版、会员充值、订阅模式、新闻发布和实时统计报表等功能,新书自动入库,老书自动更新。
Stars: ✭ 1,122 (+1056.7%)
Geetest滑动验证码,希望对你们有所帮助❤️
Stars: ✭ 114 (+17.53%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+250.52%)
Python3 SpiderPython爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+2094.85%)
Crack Js Spider破解JS反爬虫加密参数,已破解中国裁判文书网(2020-06-30更新),淘宝密码,天安保险登录,b站登录,房天下登录,WPS登录,微博登录,有道翻译,网易登录,微信公众号登录,空中网登录,今目标登录,学生信息管理系统登录,共赢金融登录,重庆科技资源共享平台登录,网易云音乐下载,一键解析视频链接,财联社登录。
Stars: ✭ 175 (+80.41%)
crawlLightweight library for scalable crawlers in Go.
Stars: ✭ 20 (-79.38%)
ZSpider基于Electron爬虫程序
Stars: ✭ 37 (-61.86%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-45.36%)
antA web crawler for Go
Stars: ✭ 264 (+172.16%)
Web-IotaIota is a web scraper which can find all of the images and links/suburls on a webpage
Stars: ✭ 60 (-38.14%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+847.42%)
TaobaoSpiderThis taobao spider has been archived
Stars: ✭ 28 (-71.13%)
grapyGrapy, a fast high-level web crawling framework for Python 3.3 or later base on asyncio.
Stars: ✭ 18 (-81.44%)
MoMo利用墨墨背单词的分享功能拿每日20个的单词上限奖励(多线程
Stars: ✭ 45 (-53.61%)
scrapy helperDynamic configurable crawl (动态可配置化爬虫)
Stars: ✭ 84 (-13.4%)
main project基于nodejs的网络聊天室、爬虫,vue音乐播放器,及php后台开发的管理系统等项目
Stars: ✭ 49 (-49.48%)
crawlerdetectGolang module to detect bots and crawlers via the user agent
Stars: ✭ 22 (-77.32%)
url-regex-safeRegular expression matching for URL's. Maintained, safe, and browser-friendly version of url-regex. Resolves CVE-2020-7661 for Node.js servers.
Stars: ✭ 59 (-39.18%)
scraper图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Stars: ✭ 64 (-34.02%)
spider裁判文书网爬虫
Stars: ✭ 19 (-80.41%)
HTML-DEV-ToolLinkHTML Development Tool Link-常用的在线字符串编解码、代码压缩、美化、JSON格式化、正则表达式、时间转换工具、二维码生成与解码等工具,支持在线搜索和Chrome插件。
Stars: ✭ 44 (-54.64%)
DeadPool该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery原生的分布式调用,实现大规模并发。
Stars: ✭ 38 (-60.82%)
Spider资讯爬虫App
Stars: ✭ 24 (-75.26%)
seenreqGenerate an object for testing if a request is sent, request is Mikeal's request.
Stars: ✭ 42 (-56.7%)
glyphhangerYour web font utility belt. It can subset web fonts. It can find unicode-ranges for you automatically. It makes julienne fries.
Stars: ✭ 422 (+335.05%)
BaiduSpider项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (-70.1%)
sedeText-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data
Stars: ✭ 83 (-14.43%)
simpyder超高速异步协程Python爬虫
Stars: ✭ 74 (-23.71%)
imdb-spiderscrapy spider for scraping imdb {movie_id: [recommended, ...]}
Stars: ✭ 23 (-76.29%)
goSpidersome small project and some articles
Stars: ✭ 56 (-42.27%)
spiderpython 爬虫(amazon, confluence ...)
Stars: ✭ 21 (-78.35%)