A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+165.59%)

Mutual labels: crawler, spider

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

Stars: ✭ 57 (-76.92%)

Mutual labels: crawler, spider

Photon

Incredibly fast crawler designed for OSINT.

Stars: ✭ 8,332 (+3273.28%)

Mutual labels: crawler, spider

Beanbun

Beanbun 是用 PHP 编写的多进程网络爬虫框架，具有良好的开放性、高可扩展性，基于 Workerman。

Stars: ✭ 1,096 (+343.72%)

Mutual labels: crawler, spider

Avbook

AV 电影管理系统， avmoo , javbus , javlibrary 爬虫，线上 AV 影片图书馆，AV 磁力链接数据库，Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

Stars: ✭ 8,133 (+3192.71%)

Mutual labels: crawler, spider

Puppeteer Walker

a puppeteer walker 🕷 🕸

Stars: ✭ 78 (-68.42%)

Mutual labels: crawler, spider

Crawler examples

Some classic web crawler projects.一些经典的爬虫

Stars: ✭ 74 (-70.04%)

Mutual labels: crawler, spider

Laravel Crawler Detect

A Laravel wrapper for CrawlerDetect - the web crawler detection library

Stars: ✭ 227 (-8.1%)

Mutual labels: crawler, spider

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+154.66%)

Mutual labels: crawler, spider

Baiduimagespider

一个超级轻量的百度图片爬虫

Stars: ✭ 591 (+139.27%)

Mutual labels: crawler, spider

Douyinsdk

抖音 SDK，数据采集，爬虫抓取不是梦

Stars: ✭ 99 (-59.92%)

Mutual labels: crawler, spider

Digger

Digger is a powerful and flexible web crawler implemented by pure golang

Stars: ✭ 130 (-47.37%)

Mutual labels: crawler, spider

Crawler China Mainland Universities

中国大陆大学列表爬虫

Stars: ✭ 143 (-42.11%)

Mutual labels: crawler, spider

Ppspider

web spider built by puppeteer, support task-queue and task-scheduling by decorators，support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架，提供灵活的任务队列管理调度方案，提供便捷的数据保存方案（nedb/mongodb），提供数据可视化和用户交互的实现方案

Stars: ✭ 237 (-4.05%)

Mutual labels: crawler, spider

Algoliasearch Netlify

Official Algolia Plugin for Netlify. Index your website to Algolia when deploying your project to Netlify with the Algolia Crawler

Stars: ✭ 208 (-15.79%)

Mutual labels: crawler

Crawler illegal cases in china

Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律，避免触碰数据合规红线。 [AD]中文知识图谱门户

Stars: ✭ 2,448 (+891.09%)

Mutual labels: crawler

Grab

Web Scraping Framework

Stars: ✭ 2,147 (+769.23%)

Mutual labels: spider

Proxybroker

Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS 🎭

Stars: ✭ 2,767 (+1020.24%)

Mutual labels: crawler

Fiction house

小说精品屋是一个多平台（web、安卓app、微信小程序）、功能完善的屏幕自适应小说漫画连载系统，包含精品小说专区、轻小说专区和漫画专区。包括小说/漫画分类、小说/漫画搜索、小说/漫画排行、完本小说/漫画、小说/漫画评分、小说/漫画在线阅读、小说/漫画书架、小说/漫画阅读记录、小说下载、小说弹幕、小说/漫画自动采集/更新/纠错、小说内容自动分享到微博、邮件自动推广、链接自动推送到百度搜索引擎等功能。

Stars: ✭ 2,710 (+997.17%)

Mutual labels: spider

Nosmoke

A cross platform UI crawler which scans view trees then generate and execute UI test cases.

Stars: ✭ 178 (-27.94%)

Mutual labels: crawler

Instagram Crawler

Crawl instagram photos, posts and videos for download.

Stars: ✭ 178 (-27.94%)

Mutual labels: crawler

Media Scraper

Scrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok