SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+1266.67%)
NewcrawlerFree Web Scraping Tool with Java
Stars: ✭ 589 (+1127.08%)
Divergence MeterDivergence Meter is an application based on Steins;Gate. Unmaintained, feel free to contribute
Stars: ✭ 18 (-62.5%)
Qt.pyMinimal Python 2 & 3 shim around all Qt bindings - PySide, PySide2, PyQt4 and PyQt5.
Stars: ✭ 684 (+1325%)
XsrfprobeThe Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (+1008.33%)
Go spiderA golang spider
Stars: ✭ 25 (-47.92%)
Python Spider豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Stars: ✭ 615 (+1181.25%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-31.25%)
Xxl CrawlerA distributed web crawler framework.(分布式爬虫框架XXL-CRAWLER)
Stars: ✭ 561 (+1068.75%)
Anti Anti Spider越来越多的网站具有反爬虫特性,有的用图片隐藏关键数据,有的使用反人类的验证码,建立反反爬虫的代码仓库,通过与不同特性的网站做斗争(无恶意)提高技术。(欢迎提交难以采集的网站)(因工作原因,项目暂停)
Stars: ✭ 6,907 (+14289.58%)
Creeper🐾 Creeper - The Next Generation Crawler Framework (Go)
Stars: ✭ 762 (+1487.5%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+1316.67%)
App comments spider爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。
Stars: ✭ 38 (-20.83%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+1210.42%)
Domain hunterA Burp Suite Extension that try to find all sub-domain, similar-domain and related-domain of an organization automatically! 基于流量自动收集整个企业或组织的子域名、相似域名、相关域名的burp插件
Stars: ✭ 594 (+1137.5%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+17383.33%)
NetdiscoveryNetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。
Stars: ✭ 573 (+1093.75%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+1754.17%)
Web kg爬取百度百科中文页面,抽取三元组信息,构建中文知识图谱
Stars: ✭ 549 (+1043.75%)
JspiderJSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
Stars: ✭ 914 (+1804.17%)
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+10302.08%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (+1610.42%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+1527.08%)
PholcusPholcus is a distributed high-concurrency crawler software written in pure golang
Stars: ✭ 6,990 (+14462.5%)
MamanRust Web Crawler saving pages on Redis
Stars: ✭ 39 (-18.75%)
Querido Diario📰 Brazilian government gazettes, accessible to everyone.
Stars: ✭ 681 (+1318.75%)
EasyloginA python3 package for writing spider more easily.
Stars: ✭ 26 (-45.83%)
Oneblog👽 OneBlog,一个简洁美观、功能强大并且自适应的Java博客
Stars: ✭ 678 (+1312.5%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+16843.75%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-47.92%)
Istock👉一个基于spring boot 实现的java股票爬虫(仅支持A股),如果你❤️请⭐️ . V2升级版正在开发中!
Stars: ✭ 622 (+1195.83%)
NetcloudNetCloud Web Spider
Stars: ✭ 37 (-22.92%)
InfospiderINFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客、开源中国博客、简书。
Stars: ✭ 5,984 (+12366.67%)
FbiwarningNode.js seed downloader (Node.js 种子神器)
Stars: ✭ 44 (-8.33%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+1108.33%)
Spider163抓取网易云音乐热门评论
Stars: ✭ 569 (+1085.42%)
SpiderA configurable web spider with a easy-to-use web console
Stars: ✭ 954 (+1887.5%)
91porn php最简单的91porn爬虫php版本
Stars: ✭ 557 (+1060.42%)
SeekerSeeker - another job board aggregator.
Stars: ✭ 16 (-66.67%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+1016.67%)
Lizard💐 Full Amazon Automatic Download
Stars: ✭ 41 (-14.58%)
Go jobs带你了解一下Golang的市场行情
Stars: ✭ 526 (+995.83%)
Web2executableUses NW.js to generate "native" apps for already existing web apps.
Stars: ✭ 824 (+1616.67%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+9885.42%)
BlackwidowA Python based web application scanner to gather OSINT and fuzz for OWASP vulnerabilities on a target website.
Stars: ✭ 887 (+1747.92%)
GospiderGospider - Fast web spider written in Go
Stars: ✭ 785 (+1535.42%)
PhotonIncredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+17258.33%)
Go DemoGo语言实例教程从入门到进阶,包括基础库使用、设计模式、面试易错点、工具类、对接第三方等
Stars: ✭ 881 (+1735.42%)
FunpyspidersearchengineWord2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
Stars: ✭ 782 (+1529.17%)