zhihu-crawler徒手实现定时爬取知乎,从中发掘有价值的信息,并可视化爬取的数据作网页展示。
Stars: ✭ 56 (-35.63%)
FunpyspidersearchengineWord2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
Stars: ✭ 782 (+798.85%)
zhihu搜索你的知乎收藏:可以直观地浏览你的所有收藏夹的内容,并进行全文搜索
Stars: ✭ 39 (-55.17%)
DecryptloginAPIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+2039.08%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+290.8%)
SpidersPython爬虫,返回一定格式的信息,下载,使用flask提供简易api。抖音无水印、皮皮虾、快手、网易云音乐、qq音乐、咪咕音乐、荔枝FM音频、知乎视频、最右语音、视频、微博......
Stars: ✭ 372 (+327.59%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+922.99%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+9545.98%)
Super Spider根据腾讯安全应急响应中心的架构编写的一款超强爬虫(广度优先搜索)
Stars: ✭ 48 (-44.83%)
Novel Plus小说精品屋-plus是一个多端(PC、WAP)阅读、功能完善的原创文学CMS系统,由前台门户系统、作家后台管理系统、平台后台管理系统、爬虫管理系统等多个子系统构成,支持多模版、会员充值、订阅模式、新闻发布和实时统计报表等功能,新书自动入库,老书自动更新。
Stars: ✭ 1,122 (+1189.66%)
PhotonIncredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+9477.01%)
Spiderpython crawler spider
Stars: ✭ 70 (-19.54%)
Zhihu仿知乎网站
Stars: ✭ 60 (-31.03%)
MamanRust Web Crawler saving pages on Redis
Stars: ✭ 39 (-55.17%)
App comments spider爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。
Stars: ✭ 38 (-56.32%)
Xinahn Socket一个开源,高隐私,自架自用的聚合搜索引擎。 https://xinahn.com
Stars: ✭ 77 (-11.49%)
Alipayspider ScrapyAlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)
Stars: ✭ 70 (-19.54%)
BeanbunBeanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Stars: ✭ 1,096 (+1159.77%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-62.07%)
JspiderJSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
Stars: ✭ 914 (+950.57%)
Awesome Python Primer自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-34.48%)
BlackwidowA Python based web application scanner to gather OSINT and fuzz for OWASP vulnerabilities on a target website.
Stars: ✭ 887 (+919.54%)
Reptile🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
Stars: ✭ 1,048 (+1104.6%)
AbotxCross Platform C# Web crawler framework, headless browser, parallel crawler. Please star this project! +1.
Stars: ✭ 63 (-27.59%)
Python crawlerIt's designed to be a simple, tiny, pratical python crawler using json and sqlite instead of mysql or mongdb. The destination website is Zhihu.com.
Stars: ✭ 45 (-48.28%)
Image DownloaderDownload images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Stars: ✭ 1,173 (+1248.28%)
FbiwarningNode.js seed downloader (Node.js 种子神器)
Stars: ✭ 44 (-49.43%)
T66y spiderPython多线程下载 草榴(t66y.com) 网站【新時代的我們】和【達蓋爾的旗幟】两个板块帖子内的图片
Stars: ✭ 62 (-28.74%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+9248.28%)
Lizard💐 Full Amazon Automatic Download
Stars: ✭ 41 (-52.87%)
Test demoTesting Using Python Demo. 使用Python测试脚本demo。
Stars: ✭ 60 (-31.03%)
TspiderYet Another Web Spider
Stars: ✭ 70 (-19.54%)
Zhihuapi PyUnofficial API for zhihu.
Stars: ✭ 39 (-55.17%)
GlyphhangerYour web font utility belt. It can subset web fonts. It can find unicode-ranges for you automatically. It makes julienne fries.
Stars: ✭ 1,099 (+1163.22%)
NetcloudNetCloud Web Spider
Stars: ✭ 37 (-57.47%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+1332.18%)
SpiderA configurable web spider with a easy-to-use web console
Stars: ✭ 954 (+996.55%)
Car PricesGolang爬虫 爬取汽车之家 二手车产品库
Stars: ✭ 57 (-34.48%)
Zhihu ApiZhihu API for Humans
Stars: ✭ 911 (+947.13%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-21.84%)
Go DemoGo语言实例教程从入门到进阶,包括基础库使用、设计模式、面试易错点、工具类、对接第三方等
Stars: ✭ 881 (+912.64%)
PholcusPholcus is a distributed high-concurrency crawler software written in pure golang
Stars: ✭ 6,990 (+7934.48%)
Capturercapture pictures from website like sina, lofter, huaban and so on
Stars: ✭ 76 (-12.64%)
AntcolonyNodejs实现的一个磁力链接爬虫 http://findit.keenwon.com (原域名http://findit.so )
Stars: ✭ 1,151 (+1222.99%)
BtletSome toolkits implements part of BT Protocol, like DHT spider.
Stars: ✭ 54 (-37.93%)
EasyloginA python3 package for writing spider more easily.
Stars: ✭ 26 (-70.11%)
Go spiderA golang spider
Stars: ✭ 25 (-71.26%)
Gotoolscreate some tools use go lang.
Stars: ✭ 54 (-37.93%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-71.26%)
Paperplane📚 PaperPlane - An Android reading app, including articles from Zhihu Daily, Guokr Handpick and Douban Moment.
Stars: ✭ 1,147 (+1218.39%)