Top 228 scrapy open source projects

Elves
🎊 Design and implement of lightweight crawler framework.
Linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Scrapy Crawlera
Crawlera middleware for Scrapy
Alltheplaces
A set of spiders and scrapers to extract location information from places that post their location on the internet.
Happy Spiders
🔧 🔩 🔨 收集整理了爬虫相关的工具、模拟登陆技术、代理IP、scrapy模板代码等内容。
Tieba spider
百度贴吧爬虫(基于scrapy和mysql)
Douban Crawler
Uno Crawler por https://douban.com
ip proxy pool
Generating spiders dynamically to crawl and check those free proxy ip on the internet with scrapy.
PttImageSpider
PTT 圖片下載器 (抓取整個看板的圖片,並用文章標題作為資料夾的名稱 ) (使用Scrapy)
ARGUS
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
scrapyr
a simple & tiny scrapy clustering solution, considered a drop-in replacement for scrapyd
toutiao
今日头条科技新闻接口爬虫
policy-data-analyzer
Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
memes-api
API for scrapping common meme sites
douban-spider
基于Scrapy框架的豆瓣电影爬虫
scrapy-pipelines
A collection of pipelines for Scrapy
allitebooks.com
Download all the ebooks with indexed csv of "allitebooks.com"
pythonSpider
🕷️some python spiders with BeautifulSoup or scarpy
scrapy-zyte-smartproxy
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
XMQ-BackUp
小密圈备份,圈子/话题/图片/文件。
scrapy xiuren
秀人网爬虫 55156爬虫
GPlayCrawler
No description or website provided.
scrapy-admin
A django admin site for scrapy
scrapy facebooker
Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
hk0weather
Web scraper project to collect the useful Hong Kong weather data from HKO website
ImageGrabber
A Scrapy demo : Download all images from a site
restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Scrapy-Spiders
一个基于Scrapy的数据采集爬虫代码库
BOC FER Spider
Use Scrapy crawl foreign exchange rate from BOC (Bank of China)
NovelCrawler
基于Scrapy的爬虫demo
JustDownlink
基于Scrapy+Elasticsearch+Django搭建的分布式电影搜索
python-fxxk-spider
收集各种免费的 Python 爬虫项目
python-spider
python爬虫小项目【持续更新】【笔趣阁小说下载、Tweet数据抓取、天气查询、网易云音乐逆向、天天基金网查询、微博数据抓取(生成cookie)、有道翻译逆向、企查查免登陆爬虫、大众点评svg加密破解、B站用户爬虫、拉钩免登录爬虫、自如租房字体加密、知乎问答
proxi
Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.
scraping-ebay
Scraping Ebay's products using Scrapy Web Crawling Framework
logparser
A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.
scrapy-distributed
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
IMDB-Scraper
Scrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.
OpenScraper
An open source webapp for scraping: towards a public service for webscraping
factory
Docker microservice & Crawler by scrapy
scrapy.dart
Scrapy, a fast high-level web crawling & scraping framework for dart and Flutter
Scrapy IPProxyPool
免费 IP 代理池。Scrapy 爬虫框架插件
elves
🎊 Design and implement of lightweight crawler framework.
scrapy-cookies
A middleware of cookies persistence for Scrapy
163Music
163music spider by scrapy.
121-180 of 228 scrapy projects