DecryptloginAPIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+447.35%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-90.29%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+161.76%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+57.65%)
Proxy poolPython爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+4007.06%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+100%)
Python3 SpiderPython爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+526.18%)
Marmot💐Marmot | Web Crawler/HTTP protocol Download Package 🐭
Stars: ✭ 186 (-45.29%)
Crawlertutorial爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (-17.06%)
Fooproxy稳健高效的评分制-针对性- IP代理池 + API服务,可以自己插入采集器进行代理IP的爬取,针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库,支持MongoDB 4.0 使用 Python3.7(Scored IP proxy pool ,customise proxy data crawler can be added anytime)
Stars: ✭ 195 (-42.65%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-44.12%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+843.82%)
Laravel Crawler DetectA Laravel wrapper for CrawlerDetect - the web crawler detection library
Stars: ✭ 227 (-33.24%)
Magic googleGoogle search results crawler, get google search results that you need
Stars: ✭ 247 (-27.35%)
JssoupJavaScript + BeautifulSoup = JSSoup
Stars: ✭ 203 (-40.29%)
Ppspiderweb spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (-30.29%)
SpidersPython爬虫,返回一定格式的信息,下载,使用flask提供简易api。抖音无水印、皮皮虾、快手、网易云音乐、qq音乐、咪咕音乐、荔枝FM音频、知乎视频、最右语音、视频、微博......
Stars: ✭ 372 (+9.41%)
Fuck Login模拟登录一些知名的网站,为了方便爬取需要登录的网站
Stars: ✭ 5,729 (+1585%)
Zhihu fun基于 Selenium 的知乎关键词爬虫
Stars: ✭ 185 (-45.59%)
Awesome crawl腾讯新闻、知乎话题、微博粉丝,Tumblr爬虫、斗鱼弹幕、妹子图爬虫、分布式设计等
Stars: ✭ 246 (-27.65%)
gospider⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架
Stars: ✭ 183 (-46.18%)
Weixin Spider微信公众号爬虫,公众号历史文章,文章评论,文章阅读及在看数据,可视化web页面,可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现,高效微信爬虫,微信公众号爬虫,历史文章,文章评论,数据更新。
Stars: ✭ 287 (-15.59%)
Spoon🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-49.12%)
Lianjia Beike Spider链家网和贝壳网房价爬虫,采集北京上海广州深圳等21个中国主要城市的房价数据(小区,二手房,出租房,新房),稳定可靠快速!支持csv,MySQL, MongoDB,Excel, json存储,支持Python2和3,图表展示数据,注释丰富 ,点星支持,仅供学习参考,请勿用于商业用途,后果自负。
Stars: ✭ 2,257 (+563.82%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-49.71%)
Gospidergolang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (-16.18%)
Google Group CrawlerGet (almost) original messages from google group archives. Your data is yours.
Stars: ✭ 190 (-44.12%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+603.53%)
fetchurlsA bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.
Stars: ✭ 97 (-71.47%)
Jd mask robot京东口罩库存监控爬虫(非selenium),扫码登录、查价、加购、下单、秒杀
Stars: ✭ 216 (-36.47%)
Hacker News Digest📰 A responsive interface of Hacker News with summaries and thumbnails.
Stars: ✭ 278 (-18.24%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+4469.12%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-18.53%)
GainWeb crawling framework based on asyncio.
Stars: ✭ 2,002 (+488.82%)
zhihu-crawler徒手实现定时爬取知乎,从中发掘有价值的信息,并可视化爬取的数据作网页展示。
Stars: ✭ 56 (-83.53%)
gathertoolgathertool是golang脚本化开发库,目的是提高对应场景程序开发的效率;轻量级爬虫库,接口测试&压力测试库,DB操作库等。
Stars: ✭ 36 (-89.41%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-84.71%)
crawlerA simple and flexible web crawler framework for java.
Stars: ✭ 20 (-94.12%)
Geetestgeetest,滑动验证码
Stars: ✭ 293 (-13.82%)
Bt Btt磁力網站U3C3介紹以及域名更新
Stars: ✭ 261 (-23.24%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-95%)
CockyGrabberC# library for the collection of browser information such as cookies, logins, and more
Stars: ✭ 46 (-86.47%)
flink-crawlerContinuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-85.88%)
slime🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-92.06%)
BitextorBitextor generates translation memories from multilingual websites.
Stars: ✭ 168 (-50.59%)
Fun crawlerCrawl some picture for fun
Stars: ✭ 169 (-50.29%)