Awesome SreA curated list of Site Reliability and Production Engineering resources.
HowtheysreA curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
SentinelA powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
healthzEasily add health checks to your go services
mingineA module to get the minimum usable engine(s)
pylifea general library for fatigue and reliability
xk6-chaosxk6 extension for running chaos experiments with k6 💣
airbudRetrieving stuff from the web is unreliable. Airbud adds retries for production, and fixture support for test.
optimize-ubuntuOptimize Ubuntu for usability, security, privacy and stability
mapi-action🤖 Run a Mayhem for API scan in GitHub Actions
vigorMain repository of the Vigor NF verification project.
opsani-igniteEvaluate and improve the reliability, performance and efficiency of your Kubernetes applications.
krakenChaos and resiliency testing tool for Kubernetes and OpenShift
OpenCossanOpenCossan is an open and free toolbox for uncertainty quantification and management.