HowtheysreA curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Stars: ✭ 6,962 (-9.43%)
availability-calculatorCalculate how much downtime should be permitted in your Service Level Agreement or Objective
Stars: ✭ 60 (-99.22%)
Awesome Sre ToolsA curated list of Site Reliability and Production Engineering Tools
Stars: ✭ 186 (-97.58%)
KapoWrap any command in a status socket
Stars: ✭ 45 (-99.41%)
Gatus⛑ Gatus - Automated service health dashboard
Stars: ✭ 1,203 (-84.35%)
NetdataReal-time performance monitoring, done right! https://www.netdata.cloud
Stars: ✭ 57,056 (+642.24%)
Prom2teamsprom2teams is an HTTP server built with Python that receives alert notifications from a previously configured Prometheus Alertmanager instance and forwards it to Microsoft Teams using defined connectors
Stars: ✭ 122 (-98.41%)
CloudproberAn active monitoring software to detect failures before your customers do.
Stars: ✭ 1,269 (-83.49%)
CabotSelf-hosted, easily-deployable monitoring and alerts service - like a lightweight PagerDuty
Stars: ✭ 5,209 (-32.24%)
TcpprobeModern TCP tool and service for network performance observability.
Stars: ✭ 207 (-97.31%)
Minicron🕰️ Monitor your cron jobs
Stars: ✭ 2,351 (-69.42%)
MoiraRealtime Alerting for Graphite
Stars: ✭ 222 (-97.11%)
Slacknimate👯 Realtime text animation for Slack chatops
Stars: ✭ 250 (-96.75%)
Awesome Linuxaudio[mirror] A list of software and resources for professional audio/video/live events production on Linux.
Stars: ✭ 756 (-90.17%)
cliReliably CLI - Optimise your operations
Stars: ✭ 2 (-99.97%)
gansoi👽 Awesome Infrastructure Monitoring and Alerting
Stars: ✭ 31 (-99.6%)
krakenChaos and resiliency testing tool for Kubernetes and OpenShift
Stars: ✭ 161 (-97.91%)
Awesome Ssh💻 A curated list of SSH resources.
Stars: ✭ 1,742 (-77.34%)
awesome-game-designA comprehensive list of Game Design related learning materials, examples and tools.
Stars: ✭ 43 (-99.44%)
Hastic ServerHastic data management server for analyzing patterns and anomalies from Grafana
Stars: ✭ 292 (-96.2%)
Sematext Agent DockerSematext Docker Agent - host + container metrics, logs & event collector
Stars: ✭ 194 (-97.48%)
Wazuh DockerWazuh - Docker containers
Stars: ✭ 213 (-97.23%)
Wgcloudlinux运维监控工具,支持系统信息,内存,cpu,温度,磁盘空间及IO,硬盘smart,系统负载,网络流量等监控,API接口,大屏展示,拓扑图,进程监控,端口监控,docker监控,文件防篡改,日志监控,数据可视化,web ssh,堡垒机,指令下发批量执行,linux面板,探针,故障告警
Stars: ✭ 2,669 (-65.28%)
DogoMonitoring changes in the source file and automatically compile and run (restart).
Stars: ✭ 237 (-96.92%)
Hawkular MetricsTime Series Metrics Engine based on Cassandra
Stars: ✭ 225 (-97.07%)
WazuhWazuh - The Open Source Security Platform
Stars: ✭ 3,154 (-58.97%)
Bookmarks🔖 +4.3K awesome resources for geeks and software crafters 🍺
Stars: ✭ 210 (-97.27%)
My LinksKnowledge seeks no man
Stars: ✭ 311 (-95.95%)
Performance-Engineers-DevOpsThis repository helps performance testers and engineers who wants to dive into DevOps and SRE world.
Stars: ✭ 35 (-99.54%)
AtlantisTerraform Pull Request Automation
Stars: ✭ 4,236 (-44.89%)
HealthchecksA cron monitoring tool written in Python & Django
Stars: ✭ 4,297 (-44.1%)
Gauntlet🔖 Guides, Articles, Podcasts, Videos and Notes to Build Reliable Large-Scale Distributed Systems.
Stars: ✭ 336 (-95.63%)
xk6-chaosxk6 extension for running chaos experiments with k6 💣
Stars: ✭ 18 (-99.77%)
bots[DEPRECATED] Tradle bot framework, allows to drive user interactions in Tradle mobile and web apps and (soon) using smart contracts for critical functions
Stars: ✭ 20 (-99.74%)
Dockbix Agent Xxl🐳 Dockerized Zabbix agent with Docker metrics and host metrics support for CoreOS, RHEL, CentOS, Ubuntu, Debian, Fedora, Boot2docker, Photon OS, Amazon Linux, ...
Stars: ✭ 177 (-97.7%)
BosunTime Series Alerting Framework
Stars: ✭ 3,226 (-58.03%)
Elm Companies🌲 A list of companies using Elm in production.
Stars: ✭ 365 (-95.25%)
AutomatronInfrastructure monitoring framework turning DevOps runbooks into automated actions
Stars: ✭ 381 (-95.04%)
UnseeAlert dashboard for Prometheus Alertmanager
Stars: ✭ 700 (-90.89%)
Nightingale💡 A Distributed and High-Performance Monitoring System. Prometheus enterprise edition
Stars: ✭ 4,003 (-47.93%)
Microsoft365dscManages, configures, extracts and monitors Microsoft 365 tenant configurations
Stars: ✭ 374 (-95.13%)
HowtheyawsA curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)
Stars: ✭ 389 (-94.94%)
Cachet MonitorDistributed monitoring plugin for CachetHQ
Stars: ✭ 427 (-94.45%)
Urlookerenterprise-level websites monitoring system
Stars: ✭ 469 (-93.9%)
RunbookA framework for gradual system automation
Stars: ✭ 531 (-93.09%)
Swagger StatsAPI Observability. Trace API calls and Monitor API performance, health and usage statistics in Node.js Microservices.
Stars: ✭ 559 (-92.73%)
RundeckEnable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
Stars: ✭ 4,426 (-42.42%)
CyphonOpen source incident management and response platform.
Stars: ✭ 543 (-92.94%)
OpennmsEnterprise-Grade Open-Source Network Management Platform
Stars: ✭ 568 (-92.61%)