crawlerA simple and flexible web crawler framework for java.
Stars: ✭ 20 (-86.75%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-74.83%)
slime🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-82.12%)
coreMicroservice abstract class
Stars: ✭ 37 (-75.5%)
NScrapyNScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider
Stars: ✭ 88 (-41.72%)
OklogA distributed and coördination-free log management system
Stars: ✭ 2,937 (+1845.03%)
Hacker News Digest📰 A responsive interface of Hacker News with summaries and thumbnails.
Stars: ✭ 278 (+84.11%)
Gospidergolang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (+88.74%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+83.44%)
Dis Seckill👊SpringBoot+Zookeeper+Dubbo打造分布式高并发商品秒杀系统
Stars: ✭ 315 (+108.61%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+2025.17%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+125.17%)
leekCelery Tasks Monitoring Tool
Stars: ✭ 77 (-49.01%)
Fictiondown小说下载|小说爬取|起点|笔趣阁|导出Markdown|导出txt|转换epub|广告过滤|自动校对
Stars: ✭ 362 (+139.74%)
DiplomatA HTTP Ruby API for Consul
Stars: ✭ 358 (+137.09%)
Signature algorithm各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (+151.66%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (+130.46%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (+165.56%)
Libvineyardlibvineyard: an in-memory immutable data manager.
Stars: ✭ 392 (+159.6%)
Mm131MM131网站图片爬取 🚨
Stars: ✭ 129 (-14.57%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (+127.81%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+3074.17%)
HazelcastOpen-source distributed computation and storage platform
Stars: ✭ 4,662 (+2987.42%)
LearnpythonPython的基础练习代码与各种爬虫代码
Stars: ✭ 451 (+198.68%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+254.97%)
Scrapy RedisRedis-based components for Scrapy.
Stars: ✭ 4,998 (+3209.93%)
moqui-hazelcastMoqui Framework tool component for Hazelcast, used for distributed async services, entity distributed cache invalidation, web session replication, and distributed cache (javax.cache)
Stars: ✭ 12 (-92.05%)
Examples Of Web Crawlers一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Stars: ✭ 10,724 (+7001.99%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+350.33%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+334.44%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+417.22%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+316.56%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+489.4%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (+443.71%)
DisecDistributed Image Search Engine Crawler
Stars: ✭ 11 (-92.72%)
DecryptloginAPIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+1132.45%)
MamanRust Web Crawler saving pages on Redis
Stars: ✭ 39 (-74.17%)
PhotonIncredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+5417.88%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-78.15%)
BeanbunBeanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Stars: ✭ 1,096 (+625.83%)
Car PricesGolang爬虫 爬取汽车之家 二手车产品库
Stars: ✭ 57 (-62.25%)
NewcrawlerFree Web Scraping Tool with Java
Stars: ✭ 589 (+290.07%)
Gopa AbandonedGOPA, a spider written in Go.(NOTE: this project moved to https://github.com/infinitbyte/gopa )
Stars: ✭ 98 (-35.1%)
FoundatioPluggable foundation blocks for building distributed apps.
Stars: ✭ 1,365 (+803.97%)
StorjOngoing Storj v3 development. Decentralized cloud object storage that is affordable, easy to use, private, and secure.
Stars: ✭ 1,278 (+746.36%)
Crawler Detect🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
Stars: ✭ 1,549 (+925.83%)
BaiduspiderBaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 105 (-30.46%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+725.17%)
SandglassSandglass is a distributed, horizontally scalable, persistent, time sorted message queue.
Stars: ✭ 1,531 (+913.91%)
Magic googleGoogle search results crawler, get google search results that you need
Stars: ✭ 247 (+63.58%)
nebulaA distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+5327.81%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+284.11%)