Top 228 scrapy open source projects

Distributed Multi User Scrapy System With A Web Ui
Django based application that allows creating, deploying and running Scrapy spiders in a distributed manner
Taiwan News Crawlers
Scrapy-based Crawlers for news of Taiwan
Email Extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Olxscraper
OLX Scraper in Python Scrapy
Capturer
capture pictures from website like sina, lofter, huaban and so on
Scrapy Examples
Some scrapy and web.py exmaples
Image Downloader
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Alipayspider Scrapy
AlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)
Taobao duoshou
使用Scrapy采集淘宝数据,Flask展示
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Warta Scrap
Indonesia Index News Crawler, including 10 online media
Scrapy S3pipeline
Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.
Scrapy Craigslist
Web Scraping Craigslist's Engineering Jobs in NY with Scrapy
Reptile
🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
Scrapy Pyppeteer
Pyppeteer integration for Scrapy
Wescraper
依赖Scrapy和搜狗搜索微信公众号文章
Pixiv Crawler
Scrapy框架下的pixiv多功能爬虫
Django Dynamic Scraper
Creating Scrapy scrapers via the Django admin interface
Crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Articlespider
慕课网python分布式爬虫源码-长期更新维护
App comments spider
爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。
Scrapymon
Simple Web UI for Scrapy spider management via Scrapyd
Jspider
JSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
Scrapy Azuresearch Crawler Samples
Scrapy as a Web Crawler for Azure Search Samples
Voyages Sncf Api
A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.
Scrapy Cluster
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Mailinglistscraper
A python web scraper for public email lists.
Pdf downloader
A Scrapy Spider for downloading PDF files from a webpage.
Scrapy Finance
[OUTDATED] scrapy spiders to crawl the financial text data 📚 📜 pertinent to train word vectors 🚀
Seeker
Seeker - another job board aggregator.
Py3 scripts
Life is short, *****.
Funpyspidersearchengine
Word2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
House Renting
Possibly the best practice of Scrapy 🕷 and renting a house 🏡
Jd spider
两只蠢萌京东的分布式爬虫.
Tweetscraper
TweetScraper is a simple crawler/spider for Twitter Search without using API
Webhubbot
Python + Scrapy + MongoDB . 5 million data per day !!!💥 The world's largest website.
Scrapyrt
HTTP API for Scrapy spiders
Icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Python Spider
豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Wechatsogou
基于搜狗微信搜索的微信公众号爬虫接口
Scrapy Selenium
Scrapy middleware to handle javascript pages using selenium
Scrapy Redis
Redis-based components for Scrapy.
Haipproxy
💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Scrapy Fake Useragent
Random User-Agent middleware based on fake-useragent
Scrapy Rotating Proxies
use multiple proxies with Scrapy
Scrapple
A framework for creating semi-automatic web content extractors
Scrapydouban
豆瓣电影/豆瓣读书 Scarpy 爬虫
Spiderman
基于 scrapy-redis 的通用分布式爬虫框架
Files
Docs and files for ScrapydWeb, Scrapyd, Scrapy, and other projects
✭ 390
scrapy
Advanced Web Scraping Tutorial
The Zipru scraper developed in the Advanced Web Scraping Tutorial.
E Commerce Crawlers
🚀电商网站爬虫合集,淘宝京东亚马逊等
Post Tuto Deployment
Build and deploy a machine learning app from scratch 🚀
Awesome Scrapy
A curated list of awesome packages, articles, and other cool resources from the Scrapy community.
61-120 of 228 scrapy projects