WebhubbotPython + Scrapy + MongoDB . 5 million data per day !!!💥 The world's largest website.
Stars: ✭ 5,427 (+4702.65%)
Reptile🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
Stars: ✭ 1,048 (+827.43%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+463.72%)
Wescraper依赖Scrapy和搜狗搜索微信公众号文章
Stars: ✭ 46 (-59.29%)
Email ExtractorThe main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-28.32%)
Yawf药方 Yet Another Weibo Filter 用户脚本,微博过滤和版面改造等 userscript, filter weibo and modify layout
Stars: ✭ 562 (+397.35%)
Xweibo💊 删除所有微博痕迹
Stars: ✭ 46 (-59.29%)
Hivelots of spider (很多爬虫)
Stars: ✭ 110 (-2.65%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+374.34%)
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+4318.58%)
Capturercapture pictures from website like sina, lofter, huaban and so on
Stars: ✭ 76 (-32.74%)
Weibospider⚡ A distributed crawler for weibo, building with celery and requests.
Stars: ✭ 4,670 (+4032.74%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+7326.55%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-11.5%)
Weibo AnalystSocial media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类
Stars: ✭ 430 (+280.53%)
FilesDocs and files for ScrapydWeb, Scrapyd, Scrapy, and other projects
Stars: ✭ 390 (+245.13%)
ScrapymonSimple Web UI for Scrapy spider management via Scrapyd
Stars: ✭ 35 (-69.03%)
Place2liveAnalysis of the characteristics of different countries
Stars: ✭ 30 (-73.45%)
Xiaofendui薅羊毛小分队(第一时间尽知羊毛)- 微信机器人即时推送
Stars: ✭ 356 (+215.04%)
Image DownloaderDownload images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Stars: ✭ 1,173 (+938.05%)
LinkedinLinkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (+173.45%)
Proxy server crawleran awesome public proxy server crawler based on scrapy framework
Stars: ✭ 94 (-16.81%)
Laravel Socialite Social OAuth Authentication for Laravel 5. drivers: facebook, github, google, linkedin, weibo, qq, wechat and douban
Stars: ✭ 296 (+161.95%)
Lxspider爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》
Stars: ✭ 60 (-46.9%)
scrapyra simple & tiny scrapy clustering solution, considered a drop-in replacement for scrapyd
Stars: ✭ 50 (-55.75%)
Crawler爬虫, http代理, 模拟登陆!
Stars: ✭ 106 (-6.19%)
Pdf downloaderA Scrapy Spider for downloading PDF files from a webpage.
Stars: ✭ 18 (-84.07%)
WeiiWeibo client for Linux
Stars: ✭ 16 (-85.84%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (-39.82%)
ScrapoxyScrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (+1069.91%)
Scrapy S3pipelineScrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.
Stars: ✭ 57 (-49.56%)
Lcactionsheet一款简约而不失强大的 ActionSheet,微博、微信和 QQ 都采用了极其类似的样式,完全支持 Swift。
Stars: ✭ 809 (+615.93%)
FunpyspidersearchengineWord2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
Stars: ✭ 782 (+592.04%)
ScralaUnmaintained 🐳 ☕️ 🕷 Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege
Stars: ✭ 113 (+0%)
WswpCode for the second edition Web Scraping with Python book by Packt Publications
Stars: ✭ 112 (-0.88%)
Scrapyd Cluster On HerokuSet up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (-6.19%)