Awesome SreA curated list of Site Reliability and Production Engineering resources.
Stars: ✭ 7,687 (+23.64%)
HowtheysreA curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Stars: ✭ 6,962 (+11.98%)
xk6-chaosxk6 extension for running chaos experiments with k6 💣
Stars: ✭ 18 (-99.71%)
healthzEasily add health checks to your go services
Stars: ✭ 21 (-99.66%)
aws-chaos-scriptsDEPRECATED Collection of python scripts to run failure injection on AWS infrastructure
Stars: ✭ 91 (-98.54%)
vigorMain repository of the Vigor NF verification project.
Stars: ✭ 40 (-99.36%)
aws-fis-templates-cdkCollection of AWS Fault Injection Simulator (FIS) experiment templates deploy-able via the AWS CDK
Stars: ✭ 43 (-99.31%)
AtlantisTerraform Pull Request Automation
Stars: ✭ 4,236 (-31.86%)
Gauntlet🔖 Guides, Articles, Podcasts, Videos and Notes to Build Reliable Large-Scale Distributed Systems.
Stars: ✭ 336 (-94.6%)
homebrew-devops🍺 DevOps / SRE formulae for the @Homebrew package manager.
Stars: ✭ 36 (-99.42%)
newrelic-quickstartsNew Relic One quickstarts help accelerate your New Relic journey by providing immediate value for your specific use cases.
Stars: ✭ 46 (-99.26%)
HowtheyawsA curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)
Stars: ✭ 389 (-93.74%)
opsani-igniteEvaluate and improve the reliability, performance and efficiency of your Kubernetes applications.
Stars: ✭ 17 (-99.73%)
terraform-aws-account🌳 A sustainable Terraform Package which creates Account & IAM resources on AWS
Stars: ✭ 18 (-99.71%)
Performance-Engineers-DevOpsThis repository helps performance testers and engineers who wants to dive into DevOps and SRE world.
Stars: ✭ 35 (-99.44%)
School Of SreAt LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
Stars: ✭ 5,141 (-17.31%)
kaldi-timit-sre-ivectorDevelop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (-99.73%)
terraform-onboardingA Terraform workshop for junior IT infrastructure engineer & DevOps engineer & SRE.
Stars: ✭ 26 (-99.58%)
availability-calculatorCalculate how much downtime should be permitted in your Service Level Agreement or Objective
Stars: ✭ 60 (-99.03%)
cliReliably CLI - Optimise your operations
Stars: ✭ 2 (-99.97%)
mingineA module to get the minimum usable engine(s)
Stars: ✭ 17 (-99.73%)
sre.surmon.me💻 SRE service for Surmon.me blog.
Stars: ✭ 34 (-99.45%)
Chaos Ssm DocumentsCollection of AWS SSM Documents to perform Chaos Engineering experiments
Stars: ✭ 225 (-96.38%)
airbudRetrieving stuff from the web is unreliable. Airbud adds retries for production, and fixture support for test.
Stars: ✭ 15 (-99.76%)
optimize-ubuntuOptimize Ubuntu for usability, security, privacy and stability
Stars: ✭ 15 (-99.76%)
sre-playground🎯 A set of Site Reliability Engineering notes & challenges
Stars: ✭ 24 (-99.61%)
mapi-action🤖 Run a Mayhem for API scan in GitHub Actions
Stars: ✭ 16 (-99.74%)
RunbookA framework for gradual system automation
Stars: ✭ 531 (-91.46%)
krakenChaos and resiliency testing tool for Kubernetes and OpenShift
Stars: ✭ 161 (-97.41%)
Version CheckerKubernetes utility for exposing image versions in use, compared to latest available upstream, as metrics.
Stars: ✭ 371 (-94.03%)
PowerShell-FeatureFlagsPowerShell module containing a Feature Flags implementation based on a local config file.
Stars: ✭ 15 (-99.76%)
TcpprobeModern TCP tool and service for network performance observability.
Stars: ✭ 207 (-96.67%)
OpenCossanOpenCossan is an open and free toolbox for uncertainty quantification and management.
Stars: ✭ 40 (-99.36%)
Jaeger UiWeb UI for Jaeger
Stars: ✭ 639 (-89.72%)
knowledgeEverything I know: DevOps & CloudNative, Music, Homelab, Blockchain, AI, etc...
Stars: ✭ 84 (-98.65%)
command-line-cheat-sheet📝 A place to quickly lookup commands (bash, vim, git, AWS, Docker, Terraform, Ansible, kubectl)
Stars: ✭ 30 (-99.52%)
gansoi👽 Awesome Infrastructure Monitoring and Alerting
Stars: ✭ 31 (-99.5%)
My LinksKnowledge seeks no man
Stars: ✭ 311 (-95%)
tiketTIKET is a ticketing/helpdesk system to support and help you deal with issues/incidents in your organization or from customers.
Stars: ✭ 59 (-99.05%)
ProvisionDigital Rebar Provision is a simple and powerful Golang executable that provides a complete API-driven DHCP/PXE/TFTP provisioning system.
Stars: ✭ 252 (-95.95%)
pylifea general library for fatigue and reliability
Stars: ✭ 45 (-99.28%)
Sre InterviewsCurated list of good SRE interview questions.
Stars: ✭ 210 (-96.62%)
SentinelA powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
Stars: ✭ 18,071 (+190.67%)
devops-notesMy technical documentation in the SRE / DevOps paradigm.
Stars: ✭ 19 (-99.69%)
Cloud Ops SandboxCloud Operations Sandbox is an open source tool that helps practitioners to learn Service Reliability Engineering practices from Google and apply them on their cloud services using Cloud Operations suite of tools.
Stars: ✭ 191 (-96.93%)
JnitraceA Frida based tool that traces usage of the JNI API in Android apps.
Stars: ✭ 534 (-91.41%)
RundeckEnable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
Stars: ✭ 4,426 (-28.81%)
awesome-game-designA comprehensive list of Game Design related learning materials, examples and tools.
Stars: ✭ 43 (-99.31%)