Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+3.94%)
WebCrawler一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。
Stars: ✭ 39 (-80.79%)
slime🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-86.7%)
galerA fast tool to fetch URLs from HTML attributes by crawl-in.
Stars: ✭ 138 (-32.02%)
Magic googleGoogle search results crawler, get google search results that you need
Stars: ✭ 247 (+21.67%)
Spoon🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-14.78%)
Crawlertutorial爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (+38.92%)
Hacker News Digest📰 A responsive interface of Hacker News with summaries and thumbnails.
Stars: ✭ 278 (+36.95%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+67.49%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+1480.79%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (+69.46%)
Signature algorithm各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (+87.19%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+79.8%)
Marmot💐Marmot | Web Crawler/HTTP protocol Download Package 🐭
Stars: ✭ 186 (-8.37%)
Webstera reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (+79.31%)
Fun crawlerCrawl some picture for fun
Stars: ✭ 169 (-16.75%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+128.57%)
Fictiondown小说下载|小说爬取|起点|笔趣阁|导出Markdown|导出txt|转换epub|广告过滤|自动校对
Stars: ✭ 362 (+78.33%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+164.04%)
XsrfprobeThe Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (+162.07%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+185.71%)
Go jobs带你了解一下Golang的市场行情
Stars: ✭ 526 (+159.11%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+209.85%)
Creeper🐾 Creeper - The Next Generation Crawler Framework (Go)
Stars: ✭ 762 (+275.37%)
Ppspiderweb spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (+16.75%)
Lianjia Beike Spider链家网和贝壳网房价爬虫,采集北京上海广州深圳等21个中国主要城市的房价数据(小区,二手房,出租房,新房),稳定可靠快速!支持csv,MySQL, MongoDB,Excel, json存储,支持Python2和3,图表展示数据,注释丰富 ,点星支持,仅供学习参考,请勿用于商业用途,后果自负。
Stars: ✭ 2,257 (+1011.82%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-87.68%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+338.42%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+4033.99%)
Lizard💐 Full Amazon Automatic Download
Stars: ✭ 41 (-79.8%)
PhotonIncredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+4004.43%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (+304.43%)
Spiderpython crawler spider
Stars: ✭ 70 (-65.52%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-66.5%)
BeanbunBeanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Stars: ✭ 1,096 (+439.9%)
Gopa AbandonedGOPA, a spider written in Go.(NOTE: this project moved to https://github.com/infinitbyte/gopa )
Stars: ✭ 98 (-51.72%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+513.79%)
Skycaiji蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+645.81%)
GospiderGospider - Fast web spider written in Go
Stars: ✭ 785 (+286.7%)
Pkulaw spider爬取北大法宝网http://www.pkulaw.cn/Case/
Stars: ✭ 113 (-44.33%)
BaiduspiderBaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 105 (-48.28%)
AbotCross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+866.01%)
Hivelots of spider (很多爬虫)
Stars: ✭ 110 (-45.81%)
Pspider简单易用的Python爬虫框架,QQ交流群:597510560
Stars: ✭ 1,611 (+693.6%)
Laravel Crawler DetectA Laravel wrapper for CrawlerDetect - the web crawler detection library
Stars: ✭ 227 (+11.82%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+284.73%)
Crawler Detect🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
Stars: ✭ 1,549 (+663.05%)
Mm131MM131网站图片爬取 🚨
Stars: ✭ 129 (-36.45%)
JlitespiderA lite distributed Java spider framework :-)
Stars: ✭ 151 (-25.62%)