All Projects → wx-chevalier → Sentinel Crawler

wx-chevalier / Sentinel Crawler

Licence: mit
Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Sentinel Crawler

Pkulaw spider
爬取北大法宝网http://www.pkulaw.cn/Case/
Stars: ✭ 113 (-4.24%)
Mutual labels:  crawler
Prerender Java
java framework for prerender
Stars: ✭ 115 (-2.54%)
Mutual labels:  crawler
Opcache Dashboard
Set up properly and monitor your Zend OPcache with this dashboard that will help you checking memory, hits and status, configuring for optimal performance (warning you when cache full, validation, etc.) and reseting one or all scripts with one click.
Stars: ✭ 117 (-0.85%)
Mutual labels:  monitor
Statusok
Monitor your Website and APIs from your Computer. Get Notified through Slack, E-mail when your server is down or response time is more than expected.
Stars: ✭ 1,522 (+1189.83%)
Mutual labels:  monitor
Bilibili member crawler
B站用户爬虫 好耶~是爬虫
Stars: ✭ 115 (-2.54%)
Mutual labels:  crawler
Baiducrawler
Sample of using proxies to crawl baidu search results.
Stars: ✭ 116 (-1.69%)
Mutual labels:  crawler
Tl Wn722n V2
Drivers for TP-LINK TL-WN722N version 2 .Clean ported for kernel 4.4, 4.8, 4.10 ,4.13 ,4.14 & 4.15 from source which was at 4.3
Stars: ✭ 112 (-5.08%)
Mutual labels:  monitor
Eaglet
最简單直接的 Koa2 的脚手架
Stars: ✭ 118 (+0%)
Mutual labels:  koa2
Memex Explorer
Viewers for statistics and dashboarding of Domain Search Engine data
Stars: ✭ 115 (-2.54%)
Mutual labels:  crawler
Decryptlogin
APIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+1477.12%)
Mutual labels:  crawler
Douban Movie
Golang爬虫 爬取豆瓣电影Top250
Stars: ✭ 114 (-3.39%)
Mutual labels:  crawler
Jianso movie
🎬 电影资源爬虫,电影图片抓取脚本,Flask|Nginx|wsgi
Stars: ✭ 114 (-3.39%)
Mutual labels:  crawler
Examples Of Web Crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Stars: ✭ 10,724 (+8988.14%)
Mutual labels:  crawler
Marklogic Data Hub
The MarkLogic Data Hub: documentation ==>
Stars: ✭ 113 (-4.24%)
Mutual labels:  etl
Moodle Downloader 2
A Moodle downloader that downloads course content fast from Moodle (eg. lecture pdfs)
Stars: ✭ 118 (+0%)
Mutual labels:  crawler
Lcrawl
一只优雅的正方教务系统爬虫。
Stars: ✭ 112 (-5.08%)
Mutual labels:  crawler
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (-1.69%)
Mutual labels:  etl
Koalerplate
Simple Koa Boilerplate for APIs
Stars: ✭ 118 (+0%)
Mutual labels:  koa2
Docs
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (+0%)
Mutual labels:  crawler
Postgraphile
GraphQL is a new way of communicating with your server. It eliminates the problems of over- and under-fetching, incorporates strong data types, has built-in introspection, documentation and deprecation capabilities, and is implemented in many programming languages. This all leads to gloriously low-latency user experiences, better developer experiences, and much increased productivity. Because of all this, GraphQL is typically used as a replacement for (or companion to) RESTful API services.
Stars: ✭ 10,967 (+9194.07%)
Mutual labels:  koa2

xe-crawler

xe-crawler 是遵循声明式、可监测理念的分布式爬虫,其计划提供 Node.js、Go、Python 多种实现,能够对于静态 Web 页面、动态 Web 页面、关系型数据库、操作系统等异构多源数据进行抓取。xe-crawler 希望让使用者专注于领域逻辑而不用考虑调度、监控等问题,并且稍加改造就能用于系统监控、ETL 数据迁移等领域。更多的 xe-crawler 设计思想、设计规范参考爬虫实战 https://url.wx-coder.cn/3gyS2

Usage & Development

Standalone Crawler Framework | 单个爬虫框架的独立使用

Deployment with Supervisor | 带调度节点的集群化部署

Cases | 使用案例

Home & More | 延伸阅读

您可以通过以下导航来在 Gitbook 中阅读笔者的系列文章,涵盖了技术资料归纳、编程语言与理论、Web 与大前端、服务端开发与基础架构、云计算与大数据、数据科学与人工智能、产品设计等多个领域:

此外,前往 xCompass 交互式地检索、查找需要的文章/链接/书籍/课程;或者在在 MATRIX 文章与代码索引矩阵中查看文章与项目源代码等更详细的目录导航信息。最后,你也可以关注微信公众号:『某熊的技术之路』以获取最新资讯。

About

Motivation & Credits

  • annie: A fast, simple and clean video downloader

Golang

  • 2015-go_spider #Project#: An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

  • 2018-Muffet #Project#: Muffet is a website link checker which scrapes and inspects all pages in a website recursively.

  • 2018-ferret #Project#: ferret is a web scraping system aiming to simplify data extraction from the web for such things like UI testing, machine learning and analytics.

  • 2019-TopList #Project#: 今日热榜,一个获取各大热门网站热门头条的聚合网站,使用Go语言编写,多协程异步快速抓取信息

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].