All Projects → Marmot → Similar Projects or Alternatives

1428 Open source projects that are alternatives of or similar to Marmot

Ok ip proxy pool

🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池

Stars: ✭ 196 (+5.38%)

Mutual labels: crawler, spider, proxy

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (-36.02%)

Mutual labels: crawler, spider, proxy

Scrapoxy

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+610.75%)

Mutual labels: crawler, scrapy, proxy

Spoon

🥄 A package for building specific Proxy Pool for different Sites.

Stars: ✭ 173 (-6.99%)

Mutual labels: crawler, spider, proxy

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+1044.62%)

Mutual labels: crawler, spider, scrapy

Ppspider

web spider built by puppeteer, support task-queue and task-scheduling by decorators，support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架，提供灵活的任务队列管理调度方案，提供便捷的数据保存方案（nedb/mongodb），提供数据可视化和用户交互的实现方案

Stars: ✭ 237 (+27.42%)

Mutual labels: crawler, spider, proxy

Haipproxy

💖 High available distributed ip proxy pool, powerd by Scrapy and Redis

Stars: ✭ 4,993 (+2584.41%)

Mutual labels: crawler, spider, scrapy

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+188.17%)

Mutual labels: crawler, spider, scrapy

Fp Server

Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器，基于Tornado和Scrapy，在本地搭建属于自己的代理池

Stars: ✭ 154 (-17.2%)

Mutual labels: spider, scrapy, proxy

Proxy pool

Python爬虫代理IP池(proxy pool)

Stars: ✭ 13,964 (+7407.53%)

Mutual labels: crawler, spider, proxy

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (-34.41%)

Mutual labels: crawler, spider, scrapy

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+238.17%)

Mutual labels: crawler, spider, scrapy

Scrapy Crawlera

Crawlera middleware for Scrapy

Stars: ✭ 281 (+51.08%)

Mutual labels: crawler, scrapy, proxy

Goribot

[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。

Stars: ✭ 190 (+2.15%)

Mutual labels: crawler, spider, scrapy

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+4411.83%)

Mutual labels: crawler, spider, scrapy

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (-11.83%)

Mutual labels: crawler, spider, scrapy

Cracker

tunnel over http[s]

Stars: ✭ 107 (-42.47%)

Mutual labels: proxy, socks5

V2ray Core

A platform for building proxies to bypass network restrictions.

Stars: ✭ 38,782 (+20750.54%)

Mutual labels: proxy, socks5

Baiduspider

BaiduSpider，一个爬取百度搜索结果的爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。

Stars: ✭ 105 (-43.55%)

Mutual labels: crawler, spider

Patentcrawler

scrapy专利爬虫（停止维护）

Stars: ✭ 114 (-38.71%)

Mutual labels: crawler, scrapy

Pkulaw spider

爬取北大法宝网http://www.pkulaw.cn/Case/

Stars: ✭ 113 (-39.25%)

Mutual labels: crawler, spider

Lianjia Beike Spider

链家网和贝壳网房价爬虫，采集北京上海广州深圳等21个中国主要城市的房价数据（小区，二手房，出租房，新房），稳定可靠快速！支持csv,MySQL, MongoDB,Excel, json存储，支持Python2和3，图表展示数据，注释丰富，点星支持，仅供学习参考，请勿用于商业用途，后果自负。

Stars: ✭ 2,257 (+1113.44%)

Mutual labels: crawler, spider

Php Whois

PHP WHOIS provides parsed and raw whois lookup of domains and ASN routes. PHP 5.4+ and 7+ compatible

Stars: ✭ 179 (-3.76%)

Mutual labels: proxy, socks5

Copybook

用爬虫爬取小说网站上所有小说，存储到数据库中，并用爬到的数据构建自己的小说网站

Stars: ✭ 117 (-37.1%)

Mutual labels: spider, scrapy

Decryptlogin

APIs for loginning some websites by using requests.

Stars: ✭ 1,861 (+900.54%)

Mutual labels: crawler, spider

Docs

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (-36.56%)

Mutual labels: crawler, scrapy

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (-42.47%)

Mutual labels: crawler, spider

Crawler Detect

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

Stars: ✭ 1,549 (+732.8%)

Mutual labels: crawler, spider

Hive

lots of spider (很多爬虫）

Stars: ✭ 110 (-40.86%)

Mutual labels: spider, scrapy

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (-43.01%)

Mutual labels: crawler, scrapy

Douban Movie

Golang爬虫爬取豆瓣电影Top250

Stars: ✭ 114 (-38.71%)

Mutual labels: crawler, spider

Scrala

Unmaintained 🐳 ☕️ 🕷 Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege

Stars: ✭ 113 (-39.25%)

Mutual labels: spider, scrapy

Ncov2019 data crawler

疫情数据爬虫，2019新型冠状病毒数据仓库，轨迹数据，同乘数据，报道

Stars: ✭ 175 (-5.91%)

Mutual labels: crawler, spider

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (+713.98%)

Mutual labels: crawler, spider

Examples Of Web Crawlers

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

Stars: ✭ 10,724 (+5665.59%)

Mutual labels: crawler, spider

Baiducrawler

Sample of using proxies to crawl baidu search results.

Stars: ✭ 116 (-37.63%)

Mutual labels: crawler, proxy

Bilibili member crawler

B站用户爬虫好耶~是爬虫

Stars: ✭ 115 (-38.17%)

Mutual labels: crawler, spider

Qqmusicspider

基于Scrapy的QQ音乐爬虫(QQ Music Spider)，爬取歌曲信息、歌词、精彩评论等，并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料

Stars: ✭ 120 (-35.48%)

Mutual labels: crawler, scrapy

Glider

glider is a forward proxy with multiple protocols support, and also a dns/dhcp server with ipset management features(like dnsmasq).

Stars: ✭ 1,710 (+819.35%)

Mutual labels: proxy, socks5

Pspider

简单易用的Python爬虫框架，QQ交流群：597510560

Stars: ✭ 1,611 (+766.13%)

Mutual labels: crawler, spider

Flynet

A powerful TCP/UDP tool, which support socks5 proxy by tcp and udp, http proxy and NAT traversal. This tool can help you bypass gfw easily

Stars: ✭ 124 (-33.33%)

Mutual labels: proxy, socks5

Feapder

feapder是一款支持分布式、批次采集、任务防丢、报警丰富的python爬虫框架

Stars: ✭ 110 (-40.86%)

Mutual labels: spider, scrapy

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-8.06%)

Mutual labels: crawler, spider

Websocks

A secure proxy based on WebSocket. 一个基于 WebSocket 的代理工具

Stars: ✭ 102 (-45.16%)

Mutual labels: proxy, socks5

V2ray Panel Master

Deprecated

Stars: ✭ 136 (-26.88%)

Mutual labels: proxy, socks5

Mm131

MM131网站图片爬取 🚨

Stars: ✭ 129 (-30.65%)

Mutual labels: crawler, spider

Go spider

[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

Stars: ✭ 1,745 (+838.17%)

Mutual labels: crawler, spider

Crawler China Mainland Universities

中国大陆大学列表爬虫

Stars: ✭ 143 (-23.12%)

Mutual labels: crawler, spider

Digger

Digger is a powerful and flexible web crawler implemented by pure golang

Stars: ✭ 130 (-30.11%)

Mutual labels: crawler, spider

Amazonbigspider

😱Full Automatic Amazon Distributed Spider | 亚马逊分布式四国际站采集选款产品|账号admin,密码adminadmin