AnticrawlersolutionIt covers the blockade principle of most anti-climbing strategies and corresponding solutions.👽👽👽👽(涵盖了大部分的反爬策略的封锁原理以及对应的解决方案。)
Bee UniversityProject thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
ArachnidPowerful web scraping framework for Crystal
Lxspider爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》
Tumblr CrawlerEasily download all the photos/videos from tumblr blogs. 下载指定的 Tumblr 博客中的图片,视频
Hproxyhproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)
Boj AutocommitWhen you solve the problem of Baekjoon Online Judge, it automatically commits and pushes to the remote repository.
ChemrtronA document viewer; fuzzy match incremental search.
BeanbunBeanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
CrawlergoA powerful dynamic crawler for web vulnerability scanners
Images Web CrawlerThis package is a complete tool for creating a large dataset of images (specially designed -but not only- for machine learning enthusiasts). It can crawl the web, download images, rename / resize / covert the images and merge folders..
Lyrics CrawlerGet the lyrics for the song currently playing on Spotify
Fund Crawler基于NodeJS的基金数据爬虫,爬取的数据存于github的@nullpointer/fund-data。
Social ScraperTổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
PixevalA Strong, Fast and Flexible Pixiv Client based on .NET Core and WPF
PhotonIncredibly fast crawler designed for OSINT.
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Vulnxvulnx 🕷️ is an intelligent bot auto shell injector that detect vulnerabilities in multiple types of cms { `wordpress , joomla , drupal , prestashop .. `}
Lizard💐 Full Amazon Automatic Download
MamanRust Web Crawler saving pages on Redis
PixivcrawleriiiA python3 crawler for crawling Pixiv ranking top and any illustrator all artworks
DirhuntFind web directories without bruteforce
Schannel Qt5A GUI client of schannel powered by therecipe/qt and golang
DiskoverFile system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
News Pleasenews-please - an integrated web crawler and information extractor for news that just works.
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Vw Crawler🐞简单轻便的Java爬虫框架,只要会一点简单的正则表达式和简单的css选择器就能轻松的采集数据。
AutocrawlerGoogle, Naver multiprocess image web crawler (Selenium)
Onion CrawlerTor website crawler (specific for Alphabay at the time)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
AxegrinderCrawl websites for accessibility issues from the command line.
CcrawlSimple CORPORA list crawler