AV 电影管理系统， avmoo , javbus , javlibrary 爬虫，线上 AV 影片图书馆，AV 磁力链接数据库，Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

Stars: ✭ 8,133 (+4180.53%)

Mutual labels: crawler, spider, scraper

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+8076.32%)

Mutual labels: crawler, spider, scraper

Scrapoxy

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+595.79%)

Mutual labels: crawler, scraper, scrapy

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+245.26%)

Mutual labels: crawler, spider, scraper

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+555.79%)

Mutual labels: crawler, spider, scraper

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+1020.53%)

Mutual labels: crawler, spider, scrapy

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (-44.21%)

Mutual labels: crawler, scrapy

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (+696.84%)

Mutual labels: crawler, spider

Crawler Detect

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

Stars: ✭ 1,549 (+715.26%)

Mutual labels: crawler, spider

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-47.37%)

Mutual labels: crawler, scrapy

Hive

lots of spider (很多爬虫）

Stars: ✭ 110 (-42.11%)

Mutual labels: spider, scrapy

Proxy pool

Python爬虫代理IP池(proxy pool)

Stars: ✭ 13,964 (+7249.47%)

Mutual labels: crawler, spider

Gain

Web crawling framework based on asyncio.

Stars: ✭ 2,002 (+953.68%)

Mutual labels: crawler, spider

Instagram Crawler

Crawl instagram photos, posts and videos for download.

Stars: ✭ 178 (-6.32%)

Mutual labels: crawler, scraper

Fun crawler

Crawl some picture for fun

Stars: ✭ 169 (-11.05%)

Mutual labels: crawler, spider

Pkulaw spider

爬取北大法宝网http://www.pkulaw.cn/Case/

Stars: ✭ 113 (-40.53%)

Mutual labels: crawler, spider

Ruia

Async Python 3.6+ web scraping micro-framework based on asyncio

Stars: ✭ 1,366 (+618.95%)

Mutual labels: crawler, spider

Google Play Scraper

Node.js scraper to get data from Google Play

Stars: ✭ 1,606 (+745.26%)

Mutual labels: crawler, scraper

Baiduspider

BaiduSpider，一个爬取百度搜索结果的爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。

Stars: ✭ 105 (-44.74%)

Mutual labels: crawler, spider

Scrapydweb

Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO 👉

Stars: ✭ 2,385 (+1155.26%)

Mutual labels: spider, scrapy

Bilibili member crawler

B站用户爬虫好耶~是爬虫

Stars: ✭ 115 (-39.47%)

Mutual labels: crawler, spider

Patentcrawler

scrapy专利爬虫（停止维护）

Stars: ✭ 114 (-40%)

Mutual labels: crawler, scrapy

Examples Of Web Crawlers

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

Stars: ✭ 10,724 (+5544.21%)

Mutual labels: crawler, spider

Decryptlogin

APIs for loginning some websites by using requests.

Stars: ✭ 1,861 (+879.47%)

Mutual labels: crawler, spider

Douban Movie

Golang爬虫爬取豆瓣电影Top250

Stars: ✭ 114 (-40%)

Mutual labels: crawler, spider

Copybook

用爬虫爬取小说网站上所有小说，存储到数据库中，并用爬到的数据构建自己的小说网站

Stars: ✭ 117 (-38.42%)

Mutual labels: spider, scrapy

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Stars: ✭ 117 (-38.42%)

Mutual labels: scraper, scrapy

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (-37.37%)

Mutual labels: crawler, spider

Pspider

简单易用的Python爬虫框架，QQ交流群：597510560

Stars: ✭ 1,611 (+747.89%)

Mutual labels: crawler, spider

Ncov2019 data crawler

疫情数据爬虫，2019新型冠状病毒数据仓库，轨迹数据，同乘数据，报道

Stars: ✭ 175 (-7.89%)

Mutual labels: crawler, spider

Douyinsdk

抖音 SDK，数据采集，爬虫抓取不是梦

Stars: ✭ 99 (-47.89%)

Mutual labels: crawler, spider

Scrala

Unmaintained 🐳 ☕️ 🕷 Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege

Stars: ✭ 113 (-40.53%)

Mutual labels: spider, scrapy

Docs

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (-37.89%)

Mutual labels: crawler, scrapy

Lianjia Beike Spider

链家网和贝壳网房价爬虫，采集北京上海广州深圳等21个中国主要城市的房价数据（小区，二手房，出租房，新房），稳定可靠快速！支持csv,MySQL, MongoDB,Excel, json存储，支持Python2和3，图表展示数据，注释丰富，点星支持，仅供学习参考，请勿用于商业用途，后果自负。

Stars: ✭ 2,257 (+1087.89%)

Mutual labels: crawler, spider

Datmusic Api

Alternative for VK Audio API

Stars: ✭ 160 (-15.79%)

Mutual labels: crawler, scraper

Js Reverse

JS逆向研究

Stars: ✭ 159 (-16.32%)

Mutual labels: crawler, spider

1-60 of 1142 similar projects

›

next*5