python-spiderpython爬虫小项目【持续更新】【笔趣阁小说下载、Tweet数据抓取、天气查询、网易云音乐逆向、天天基金网查询、微博数据抓取(生成cookie)、有道翻译逆向、企查查免登陆爬虫、大众点评svg加密破解、B站用户爬虫、拉钩免登录爬虫、自如租房字体加密、知乎问答
QQSpider爬取QQ用户信息(qq号、昵称、生日、地址等基本信息)并做简要analysis。
SpydanA web spider for shodan.io without using the Developer API.
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
douban-movieGet movie info from douban(豆瓣) and display in your terminal
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
ChineseStarsRelationship中国明星数据爬取。你甚至可以拿到互联网上所有的人之间的关系,接下来你可以自己发挥!基于这些数据,你可以完成更多有趣的事情。比如说社交网络分析,关系网络可视化,算法研究,和其他有意思的事情。Chinese star data crawling. You can even get all the people on the internet! Based on these data, you can do more interesting things. For example, social network analysis, relational network visualization, algorithm research, and other interesting things.
elves🎊 Design and implement of lightweight crawler framework.
spider🌟 powered by python3( simple learning of spider) 百度文库;网易云歌曲; 豆瓣电影; GitHub; 京东; QQ空间; 天气; vip解析助手; TED文本内容; wifi破解脚本; 必应图片设置为桌面等爬取
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
qa😚 Q & A website based on Spring Boot.
Sina Spider新浪爬虫,基于Python+Selenium。模拟登陆后保存cookie,实现登录状态的保存。可以通过输入关键词来爬取到关键词相关的热门微博。
SpiderSpider项目将会不断更新本人学习使用过的爬虫方法!!!
weibo topic微博话题关键词,个人微博采集, 微博博文一键删除 selenium获取cookie,requests处理
devsearchA web search engine built with Python which uses TF-IDF and PageRank to sort search results.
Shadow计算机基础知识,数据结构,设计模式,Tomcat中间件的实现
NScrapyNScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider
ICP-CheckerICP备案查询,可查询企业或域名的ICP备案信息,自动完成滑动验证,保存结果到Excel表格,适用于2022年新版的工信部备案管理系统网站,告别频繁拖动验证,以及某站*工具要开通VIP才可查看备案信息的坑
robotstxtrobots.txt file parsing and checking for R
yutto🧊 一个可爱且任性的 B 站视频下载器(bilili V2)
get LibSeat利昂图书馆预约系统自动预约&签到程序。支持包括中国人民大学、北京师范大学、济南大学、哈尔滨工业大学等在内的38所高校的图书馆系统
fetchurlsA bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.
MoMo利用墨墨背单词的分享功能拿每日20个的单词上限奖励(多线程
goSpidersome small project and some articles
feaplat爬虫管理系统,支持集群,弹性伸缩。支持运行feapder、scrapy、selenium、playwright等各种框架及脚本
crawlerdetectGolang module to detect bots and crawlers via the user agent
scraper图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
seenreqGenerate an object for testing if a request is sent, request is Mikeal's request.
spiderpython 爬虫(amazon, confluence ...)