Email ExtractorThe main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Capturercapture pictures from website like sina, lofter, huaban and so on
Warta ScrapIndonesia Index News Crawler, including 10 online media
Scrapy S3pipelineScrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.
Reptile🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
ScrapymonSimple Web UI for Scrapy spider management via Scrapyd
Place2liveAnalysis of the characteristics of different countries
JspiderJSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
Voyages Sncf ApiA scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.
Scrapy ClusterThis Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Pdf downloaderA Scrapy Spider for downloading PDF files from a webpage.
Scrapy Finance[OUTDATED] scrapy spiders to crawl the financial text data 📚 📜 pertinent to train word vectors 🚀
SeekerSeeker - another job board aggregator.
FunpyspidersearchengineWord2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
House RentingPossibly the best practice of Scrapy 🕷 and renting a house 🏡
TweetscraperTweetScraper is a simple crawler/spider for Twitter Search without using API
WebhubbotPython + Scrapy + MongoDB . 5 million data per day !!!💥 The world's largest website.
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Python Spider豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
ScrappleA framework for creating semi-automatic web content extractors
FilesDocs and files for ScrapydWeb, Scrapyd, Scrapy, and other projects
Awesome ScrapyA curated list of awesome packages, articles, and other cool resources from the Scrapy community.
Vaultswiss army knife for hackers