CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+2458.54%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-94.82%)
App comments spider爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。
Stars: ✭ 38 (-88.41%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (-75.61%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-89.94%)
imdb-spiderscrapy spider for scraping imdb {movie_id: [recommended, ...]}
Stars: ✭ 23 (-92.99%)
JspiderJSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
Stars: ✭ 914 (+178.66%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+878.35%)
Go DemoGo语言实例教程从入门到进阶,包括基础库使用、设计模式、面试易错点、工具类、对接第三方等
Stars: ✭ 881 (+168.6%)
Go spiderA golang spider
Stars: ✭ 25 (-92.38%)
crawlerA simple and flexible web crawler framework for java.
Stars: ✭ 20 (-93.9%)
SeekerSeeker - another job board aggregator.
Stars: ✭ 16 (-95.12%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (+150.3%)
Ppspiderweb spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (-27.74%)
FunpyspidersearchengineWord2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
Stars: ✭ 782 (+138.41%)
Dumpall一款信息泄漏利用工具,适用于.git/.svn源代码泄漏和.DS_Store泄漏
Stars: ✭ 250 (-23.78%)
Creeper🐾 Creeper - The Next Generation Crawler Framework (Go)
Stars: ✭ 762 (+132.32%)
Querido Diario📰 Brazilian government gazettes, accessible to everyone.
Stars: ✭ 681 (+107.62%)
Oneblog👽 OneBlog,一个简洁美观、功能强大并且自适应的Java博客
Stars: ✭ 678 (+106.71%)
Spiderkeeperadmin ui for scrapy/open source scrapinghub
Stars: ✭ 2,562 (+681.1%)
wb wx zh tt新浪微博,微信,知乎,头条爬虫,支持新浪登录打码获取cookie实现登录
Stars: ✭ 16 (-95.12%)
Istock👉一个基于spring boot 实现的java股票爬虫(仅支持A股),如果你❤️请⭐️ . V2升级版正在开发中!
Stars: ✭ 622 (+89.63%)
InfospiderINFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客、开源中国博客、简书。
Stars: ✭ 5,984 (+1724.39%)
spider🌟 powered by python3( simple learning of spider) 百度文库;网易云歌曲; 豆瓣电影; GitHub; 京东; QQ空间; 天气; vip解析助手; TED文本内容; wifi破解脚本; 必应图片设置为桌面等爬取
Stars: ✭ 124 (-62.2%)
Jd mask robot京东口罩库存监控爬虫(非selenium),扫码登录、查价、加购、下单、秒杀
Stars: ✭ 216 (-34.15%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+76.83%)
Gospidergolang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (-13.11%)
LspiderLSpider 一个为被动扫描器定制的前端爬虫
Stars: ✭ 214 (-34.76%)
91porn php最简单的91porn爬虫php版本
Stars: ✭ 557 (+69.82%)
163Music163music spider by scrapy.
Stars: ✭ 60 (-81.71%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+63.41%)
DhtBitTorrent DHT Protocol && DHT Spider.
Stars: ✭ 2,459 (+649.7%)
Go jobs带你了解一下Golang的市场行情
Stars: ✭ 526 (+60.37%)
talospidertalospider - A simple,lightweight scraping micro-framework
Stars: ✭ 57 (-82.62%)
Elves🎊 Design and implement of lightweight crawler framework.
Stars: ✭ 315 (-3.96%)
DhtspiderBittorrent dht network spider
Stars: ✭ 302 (-7.93%)
Geetestgeetest,滑动验证码
Stars: ✭ 293 (-10.67%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-15.55%)
TwEaterA Python Bot for Scraping Conversations from Twitter
Stars: ✭ 16 (-95.12%)
rb-spider基于 RabbitMQ 中间件的爬虫的 Ruby 实现 [Developing]
Stars: ✭ 13 (-96.04%)
Douyin Api抖音API、抖音数据、抖音直播数据、抖音直播Api、抖音视频Api、抖音爬虫、抖音去水印、抖音视频下载、抖音视频解析、抖音直播监控、抖音数据采集
Stars: ✭ 112 (-65.85%)