web spider built by puppeteer, support task-queue and task-scheduling by decorators，support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架，提供灵活的任务队列管理调度方案，提供便捷的数据保存方案（nedb/mongodb），提供数据可视化和用户交互的实现方案

Stars: ✭ 237 (+16.75%)

Mutual labels: crawler, spider

Lianjia Beike Spider

链家网和贝壳网房价爬虫，采集北京上海广州深圳等21个中国主要城市的房价数据（小区，二手房，出租房，新房），稳定可靠快速！支持csv,MySQL, MongoDB,Excel, json存储，支持Python2和3，图表展示数据，注释丰富，点星支持，仅供学习参考，请勿用于商业用途，后果自负。

Stars: ✭ 2,257 (+1011.82%)

Mutual labels: crawler, spider

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-87.68%)

Mutual labels: crawler, spider

Universityrecruitment Ssurvey

用严肃的数据来回答“什么样的企业会到什么样的大学招聘”？

Stars: ✭ 30 (-85.22%)

Mutual labels: crawler, beautifulsoup

Zhihu Crawler

zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目

Stars: ✭ 890 (+338.42%)

Mutual labels: crawler, spider

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+4033.99%)

Mutual labels: crawler, spider

Lizard

💐 Full Amazon Automatic Download

Stars: ✭ 41 (-79.8%)

Mutual labels: crawler, spider

Photon

Incredibly fast crawler designed for OSINT.

Stars: ✭ 8,332 (+4004.43%)

Mutual labels: crawler, spider

Torbot

Dark Web OSINT Tool

Stars: ✭ 821 (+304.43%)

Mutual labels: crawler, spider

Spider

python crawler spider

Stars: ✭ 70 (-65.52%)

Mutual labels: crawler, spider

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-66.5%)

Mutual labels: crawler, spider

Js Reverse

JS逆向研究

Stars: ✭ 159 (-21.67%)

Mutual labels: crawler, spider

Beanbun

Beanbun 是用 PHP 编写的多进程网络爬虫框架，具有良好的开放性、高可扩展性，基于 Workerman。

Stars: ✭ 1,096 (+439.9%)

Mutual labels: crawler, spider

Gopa Abandoned

GOPA, a spider written in Go.（NOTE: this project moved to https://github.com/infinitbyte/gopa ）

Stars: ✭ 98 (-51.72%)

Mutual labels: crawler, spider

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+513.79%)

Mutual labels: crawler, spider

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (+645.81%)

Mutual labels: crawler, spider

Gospider

Gospider - Fast web spider written in Go

Stars: ✭ 785 (+286.7%)

Mutual labels: crawler, spider

Pkulaw spider

爬取北大法宝网http://www.pkulaw.cn/Case/

Stars: ✭ 113 (-44.33%)

Mutual labels: crawler, spider

Baiduspider

BaiduSpider，一个爬取百度搜索结果的爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。

Stars: ✭ 105 (-48.28%)

Mutual labels: crawler, spider

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+866.01%)

Mutual labels: crawler, spider

Hive

lots of spider (很多爬虫）

Stars: ✭ 110 (-45.81%)

Mutual labels: spider, beautifulsoup

Pspider

简单易用的Python爬虫框架，QQ交流群：597510560

Stars: ✭ 1,611 (+693.6%)

Mutual labels: crawler, spider

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (-41.38%)

Mutual labels: crawler, spider

Weibo Topic Spider

微博超级话题爬虫，微博词频统计+情感分析+简单分类，新增肺炎超话爬取数据

Stars: ✭ 128 (-36.95%)

Mutual labels: crawler, spider

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (-47.29%)

Mutual labels: crawler, spider

Chromium for spider

dynamic crawler for web vulnerability scanner

Stars: ✭ 220 (+8.37%)

Mutual labels: crawler, spider

Laravel Crawler Detect

A Laravel wrapper for CrawlerDetect - the web crawler detection library

Stars: ✭ 227 (+11.82%)

Mutual labels: crawler, spider

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (+284.73%)

Mutual labels: crawler, spider

Crawler Detect

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

Stars: ✭ 1,549 (+663.05%)

Mutual labels: crawler, spider

Mm131

MM131网站图片爬取 🚨

Stars: ✭ 129 (-36.45%)

Mutual labels: crawler, spider

Jlitespider

A lite distributed Java spider framework :-)

Stars: ✭ 151 (-25.62%)

Mutual labels: crawler, spider

Zhihuspider

多线程知乎用户爬虫，基于python3

Stars: ✭ 201 (-0.99%)

Mutual labels: crawler, spider

61-120 of 1221 similar projects

‹

›

next*5