A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+412.5%)

Mutual labels: crawler, spider

Grab Site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

Stars: ✭ 680 (+431.25%)

Mutual labels: crawler, spider

Torbot

Dark Web OSINT Tool

Stars: ✭ 821 (+541.41%)

Mutual labels: crawler, spider

Douyin

API of DouYin for Humans used to Crawl Popular Videos and Musics

Stars: ✭ 580 (+353.13%)

Mutual labels: crawler, spider

Avbook

AV 电影管理系统， avmoo , javbus , javlibrary 爬虫，线上 AV 影片图书馆，AV 磁力链接数据库，Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

Stars: ✭ 8,133 (+6253.91%)

Mutual labels: crawler, spider

Photon

Incredibly fast crawler designed for OSINT.

Stars: ✭ 8,332 (+6409.38%)

Mutual labels: crawler, spider

Car Prices

Golang爬虫爬取汽车之家二手车产品库

Stars: ✭ 57 (-55.47%)

Mutual labels: crawler, spider

Lxspider

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》

Stars: ✭ 60 (-53.12%)

Mutual labels: crawler, weibo

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-46.87%)

Mutual labels: crawler, spider

Crawler examples

Some classic web crawler projects.一些经典的爬虫

Stars: ✭ 74 (-42.19%)

Mutual labels: crawler, spider

Examples Of Web Crawlers

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

Stars: ✭ 10,724 (+8278.13%)

Mutual labels: crawler, spider

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (+243.75%)

Mutual labels: crawler, spider

Douyinsdk

抖音 SDK，数据采集，爬虫抓取不是梦

Stars: ✭ 99 (-22.66%)

Mutual labels: crawler, spider

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (-16.41%)

Mutual labels: crawler, spider

Bilibili member crawler

B站用户爬虫好耶~是爬虫

Stars: ✭ 115 (-10.16%)

Mutual labels: crawler, spider

Weibo Analyst

Social media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类

Stars: ✭ 430 (+235.94%)

Mutual labels: crawler, weibo

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+318.75%)

Mutual labels: crawler, spider

Xsrfprobe

The Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.

Stars: ✭ 532 (+315.63%)

Mutual labels: crawler, spider

Netdiscovery

NetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。

Stars: ✭ 573 (+347.66%)

Mutual labels: crawler, spider

Gosint

OSINT Swiss Army Knife

Stars: ✭ 401 (+213.28%)

Mutual labels: crawler, spider

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+391.41%)

Mutual labels: crawler, spider

Baiduimagespider

一个超级轻量的百度图片爬虫

Stars: ✭ 591 (+361.72%)

Mutual labels: crawler, spider

Pkulaw spider

爬取北大法宝网http://www.pkulaw.cn/Case/

Stars: ✭ 113 (-11.72%)

Mutual labels: crawler, spider

Newcrawler

Free Web Scraping Tool with Java

Stars: ✭ 589 (+360.16%)

Mutual labels: crawler, spider

Gospider

Gospider - Fast web spider written in Go

Stars: ✭ 785 (+513.28%)

Mutual labels: crawler, spider

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (+510.16%)

Mutual labels: crawler, spider

Baiduspider

BaiduSpider，一个爬取百度搜索结果的爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。

Stars: ✭ 105 (-17.97%)

Mutual labels: crawler, spider

Bilili

🍻 bilibili video (including bangumi) and danmaku downloader | B站视频（含番剧）、弹幕下载器

Stars: ✭ 379 (+196.09%)

Mutual labels: crawler, spider

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+6456.25%)

Mutual labels: crawler, spider

Lizard

💐 Full Amazon Automatic Download

Stars: ✭ 41 (-67.97%)

Mutual labels: crawler, spider

Weibo Crawler

新浪微博爬虫，用python爬取新浪微博数据，并下载微博图片和微博视频

Stars: ✭ 1,019 (+696.09%)

Mutual labels: crawler, weibo

Maman

Rust Web Crawler saving pages on Redis

Stars: ✭ 39 (-69.53%)

Mutual labels: crawler, spider

Beanbun

Beanbun 是用 PHP 编写的多进程网络爬虫框架，具有良好的开放性、高可扩展性，基于 Workerman。

Stars: ✭ 1,096 (+756.25%)

Mutual labels: crawler, spider

Spider

python crawler spider

Stars: ✭ 70 (-45.31%)

Mutual labels: crawler, spider

Nodespider

[DEPRECATED] Simple, flexible, delightful web crawler/spider package

Stars: ✭ 33 (-74.22%)

Mutual labels: crawler, spider

Gopa Abandoned

GOPA, a spider written in Go.（NOTE: this project moved to https://github.com/infinitbyte/gopa ）

Stars: ✭ 98 (-23.44%)

Mutual labels: crawler, spider

Weibo Album Crawler

新浪微博相册大图多线程爬虫。

Stars: ✭ 83 (-35.16%)

Mutual labels: crawler, weibo

Pspider

简单易用的Python爬虫框架，QQ交流群：597510560

Stars: ✭ 1,611 (+1158.59%)

Mutual labels: crawler, spider

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+873.44%)

Mutual labels: crawler, spider

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (+1082.81%)

Mutual labels: crawler, spider

Spider Flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

Stars: ✭ 365 (+185.16%)

Mutual labels: crawler, spider

Signature algorithm

各种App、小程序、网站的请求签名或加密算法。现已有：自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)

Stars: ✭ 380 (+196.88%)

Mutual labels: crawler, spider

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-80.47%)

Mutual labels: crawler, spider

Puppeteer Walker

a puppeteer walker 🕷 🕸

Stars: ✭ 78 (-39.06%)

Mutual labels: crawler, spider

Ruia

Async Python 3.6+ web scraping micro-framework based on asyncio

Stars: ✭ 1,366 (+967.19%)

Mutual labels: crawler, spider

1-60 of 810 similar projects

›

next*5