ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

Stars: ✭ 68 (-41.88%)

Mutual labels: scrapy

Decoration Design Crawler

土巴兔和谷居装修网站爬虫

Stars: ✭ 105 (-10.26%)

Mutual labels: scrapy

toutiao

今日头条科技新闻接口爬虫

Stars: ✭ 17 (-85.47%)

Mutual labels: scrapy

Scrapy Finance

[OUTDATED] scrapy spiders to crawl the financial text data 📚 📜 pertinent to train word vectors 🚀

Stars: ✭ 17 (-85.47%)

Mutual labels: scrapy

memes-api

API for scrapping common meme sites

Stars: ✭ 17 (-85.47%)

Mutual labels: scrapy

Alipayspider Scrapy

AlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)

Stars: ✭ 70 (-40.17%)

Mutual labels: scrapy

ptt-web-crawler

PTT 網路版爬蟲

Stars: ✭ 20 (-82.91%)

Mutual labels: scrapy

Py3 scripts

Life is short, *****.

Stars: ✭ 5 (-95.73%)

Mutual labels: scrapy

dannyAVgleDownloader

知名網站avgle下載器

Stars: ✭ 27 (-76.92%)

Mutual labels: scrapy

Patentcrawler

scrapy专利爬虫（停止维护）

Stars: ✭ 114 (-2.56%)

Mutual labels: scrapy

SpiderManager

爬虫管理平台

Stars: ✭ 27 (-76.92%)

Mutual labels: scrapy

House Renting

Possibly the best practice of Scrapy 🕷 and renting a house 🏡

Stars: ✭ 741 (+533.33%)

Mutual labels: scrapy

pythonSpider

🕷️some python spiders with BeautifulSoup or scarpy

Stars: ✭ 28 (-76.07%)

Mutual labels: scrapy

Terpene Profile Parser For Cannabis Strains

Parser and database to index the terpene profile of different strains of Cannabis from online databases

Stars: ✭ 63 (-46.15%)

Mutual labels: scrapy

XMQ-BackUp

小密圈备份，圈子/话题/图片/文件。

Stars: ✭ 22 (-81.2%)

Mutual labels: scrapy

Tweetscraper

TweetScraper is a simple crawler/spider for Twitter Search without using API

Stars: ✭ 694 (+493.16%)

Mutual labels: scrapy

GPlayCrawler

No description or website provided.

Stars: ✭ 47 (-59.83%)

Mutual labels: scrapy

Experiments

Some research experiments

Stars: ✭ 95 (-18.8%)

Mutual labels: scrapy

scrapy facebooker

Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.

Stars: ✭ 22 (-81.2%)

Mutual labels: scrapy

Faster Than Requests

Faster requests on Python 3

Stars: ✭ 639 (+446.15%)

Mutual labels: scrapy

V2EX Spider

V2EX爬虫

Stars: ✭ 21 (-82.05%)

Mutual labels: scrapy

Scrapy S3pipeline

Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.

Stars: ✭ 57 (-51.28%)

Mutual labels: scrapy

restaurant-finder-featureReviews

Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).

Stars: ✭ 21 (-82.05%)

Mutual labels: scrapy

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+437.61%)

Mutual labels: scrapy

BOC FER Spider

Use Scrapy crawl foreign exchange rate from BOC (Bank of China)

Stars: ✭ 18 (-84.62%)

Mutual labels: scrapy

Wswp

Code for the second edition Web Scraping with Python book by Packt Publications

Stars: ✭ 112 (-4.27%)

Mutual labels: scrapy

JustDownlink

基于Scrapy+Elasticsearch+Django搭建的分布式电影搜索

Stars: ✭ 28 (-76.07%)

Mutual labels: scrapy

Pythonspidernotes

Python入门网络爬虫之精华版

Stars: ✭ 5,634 (+4715.38%)

Mutual labels: scrapy

python-fxxk-spider

收集各种免费的 Python 爬虫项目

Stars: ✭ 184 (+57.26%)

Mutual labels: scrapy

Reptile

🏀 Python3 网络爬虫实战（部分含详细教程）猫眼腾讯视频豆瓣研招网微博笔趣阁小说百度热点 B站 CSDN 网易云阅读阿里文学百度股票今日头条微信公众号网易云音乐拉勾有道 unsplash 实习僧汽车之家英雄联盟盒子大众点评链家 LPL赛程台风梦幻西游、阴阳师藏宝阁天气牛客网百度文库睡前故事知乎 Wish

Stars: ✭ 1,048 (+795.73%)

Mutual labels: scrapy

Data-Engineering-Projects

Personal Data Engineering Projects

Stars: ✭ 167 (+42.74%)

Mutual labels: scrapy

Wechatsogou

基于搜狗微信搜索的微信公众号爬虫接口

Stars: ✭ 5,220 (+4361.54%)

Mutual labels: scrapy

scraping-ebay

Scraping Ebay's products using Scrapy Web Crawling Framework

Stars: ✭ 79 (-32.48%)

Mutual labels: scrapy

Scrapoxy

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+1029.91%)

Mutual labels: scrapy

Scrapy Selenium

Scrapy middleware to handle javascript pages using selenium

Stars: ✭ 550 (+370.09%)

Mutual labels: scrapy

Maria Quiteria

Backend para coleta e disponibilização dos dados 📜

Stars: ✭ 115 (-1.71%)

Mutual labels: scrapy

Weibo hot search

微博爬虫：每天定时爬取微博热搜榜的内容，留下互联网人的记忆。