稳健高效的评分制-针对性- IP代理池 + API服务，可以自己插入采集器进行代理IP的爬取，针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库，支持MongoDB 4.0 使用 Python3.7（Scored IP proxy pool ,customise proxy data crawler can be added anytime）

Stars: ✭ 195 (-55.78%)

Mutual labels: crawler, spider

91porn Api

🌭💦 91porn爬虫在线无限制API接口（永久有效，口令每日更新）及在线web预览

Stars: ✭ 341 (-22.68%)

Mutual labels: crawler, spider

Ncov2019 data crawler

疫情数据爬虫，2019新型冠状病毒数据仓库，轨迹数据，同乘数据，报道

Stars: ✭ 175 (-60.32%)

Mutual labels: crawler, spider

Signature algorithm

各种App、小程序、网站的请求签名或加密算法。现已有：自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)

Stars: ✭ 380 (-13.83%)

Mutual labels: crawler, spider

Ppspider

web spider built by puppeteer, support task-queue and task-scheduling by decorators，support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架，提供灵活的任务队列管理调度方案，提供便捷的数据保存方案（nedb/mongodb），提供数据可视化和用户交互的实现方案

Stars: ✭ 237 (-46.26%)

Mutual labels: crawler, spider

Moodle Downloader 2

A Moodle downloader that downloads course content fast from Moodle (eg. lecture pdfs)

Stars: ✭ 118 (-73.24%)

Mutual labels: content, crawler

flink-crawler

Continuous scalable web crawler built on top of Flink and crawler-commons

Stars: ✭ 48 (-89.12%)

Mutual labels: crawler, spider

arachnod

High performance crawler for Nodejs

Stars: ✭ 17 (-96.15%)

Mutual labels: crawler, spider

WebCrawler

一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。

Stars: ✭ 39 (-91.16%)

Mutual labels: crawler, spider

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (-37.19%)

Mutual labels: crawler, spider

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+344.67%)

Mutual labels: crawler, spider

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+382.77%)

Mutual labels: crawler, spider

Fictiondown

Stars: ✭ 362 (-17.91%)

Mutual labels: crawler, spider

Jlitespider

A lite distributed Java spider framework :-)

Stars: ✭ 151 (-65.76%)

Mutual labels: crawler, spider

Gain

Web crawling framework based on asyncio.

Stars: ✭ 2,002 (+353.97%)

Mutual labels: crawler, spider

Fun crawler

Crawl some picture for fun

Stars: ✭ 169 (-61.68%)

Mutual labels: crawler, spider

Freshonions Torscraper

Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion

Stars: ✭ 348 (-21.09%)

Mutual labels: crawler, spider

Crawler China Mainland Universities

中国大陆大学列表爬虫

Stars: ✭ 143 (-67.57%)

Mutual labels: crawler, spider

Marmot

💐Marmot | Web Crawler/HTTP protocol Download Package 🐭

Stars: ✭ 186 (-57.82%)

Mutual labels: crawler, spider

Lianjia Beike Spider

链家网和贝壳网房价爬虫，采集北京上海广州深圳等21个中国主要城市的房价数据（小区，二手房，出租房，新房），稳定可靠快速！支持csv,MySQL, MongoDB,Excel, json存储，支持Python2和3，图表展示数据，注释丰富，点星支持，仅供学习参考，请勿用于商业用途，后果自负。

Stars: ✭ 2,257 (+411.79%)

Mutual labels: crawler, spider

Ok ip proxy pool

🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池

Stars: ✭ 196 (-55.56%)

Mutual labels: crawler, spider

Zhihu Crawler People

A simple distributed crawler for zhihu && data analysis

Stars: ✭ 182 (-58.73%)

Mutual labels: crawler, spider

Ttbot

今日头条机器人，支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等，使用今日头条网页版API实现

Stars: ✭ 338 (-23.36%)

Mutual labels: crawler, spider

Querylist

🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

Stars: ✭ 2,392 (+442.4%)

Mutual labels: crawler, spider

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+3422.68%)

Mutual labels: crawler, spider

Amazonbigspider

😱Full Automatic Amazon Distributed Spider | 亚马逊分布式四国际站采集选款产品|账号admin,密码adminadmin

Stars: ✭ 140 (-68.25%)

Mutual labels: crawler, spider

Zhihu Login

知乎模拟登录，支持提取验证码和保存 Cookies

Stars: ✭ 340 (-22.9%)

Mutual labels: crawler, spider

Laravel Crawler Detect

A Laravel wrapper for CrawlerDetect - the web crawler detection library

Stars: ✭ 227 (-48.53%)

Mutual labels: crawler, spider

Fast Lianjia Crawler

直接通过链家 API 抓取数据的极速爬虫，宇宙最快~~ 🚀

Stars: ✭ 247 (-43.99%)

Mutual labels: crawler, spider

Chromium for spider

dynamic crawler for web vulnerability scanner

Stars: ✭ 220 (-50.11%)

Mutual labels: crawler, spider

crawler

A simple and flexible web crawler framework for java.

Stars: ✭ 20 (-95.46%)

Mutual labels: crawler, spider

weixin article spiders

A spiders' program for weixin which made by Express & cheerio

Stars: ✭ 33 (-92.52%)

Mutual labels: spider, article

slime

🍰 一个可视化的爬虫平台

Stars: ✭ 27 (-93.88%)

Mutual labels: crawler, spider

Jd mask robot

京东口罩库存监控爬虫(非selenium)，扫码登录、查价、加购、下单、秒杀

Stars: ✭ 216 (-51.02%)

Mutual labels: crawler, spider

Bt Btt

磁力網站U3C3介紹以及域名更新

Stars: ✭ 261 (-40.82%)

Mutual labels: crawler, spider

Crawlertutorial

爬蟲極簡教學（fetch, parse, search, multiprocessing, API）- PTT 為例

Stars: ✭ 282 (-36.05%)

Mutual labels: crawler, spider

Gospider

golang实现的爬虫框架，使用者只需关心页面规则，提供web管理界面。基于colly开发。

Stars: ✭ 285 (-35.37%)

Mutual labels: crawler, spider

galer

A fast tool to fetch URLs from HTML attributes by crawl-in.

Stars: ✭ 138 (-68.71%)

Mutual labels: crawler, spider

Spider Flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

Stars: ✭ 365 (-17.23%)

Mutual labels: crawler, spider

Mm131

MM131网站图片爬取 🚨

Stars: ✭ 129 (-70.75%)

Mutual labels: crawler, spider

Go spider

[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

Stars: ✭ 1,745 (+295.69%)

Mutual labels: crawler, spider

Webvideobot

Web crawler.

Stars: ✭ 214 (-51.47%)

Mutual labels: crawler, spider

ZhengFang System Spider

🐛一只登录正方教务管理系统，爬取数据的小爬虫

Stars: ✭ 21 (-95.24%)

Mutual labels: crawler, spider

Toapi

Every web site provides APIs.

Stars: ✭ 3,209 (+627.66%)

Mutual labels: crawler, spider

1-60 of 952 similar projects

›

next*5