ProvisionDigital Rebar Provision is a simple and powerful Golang executable that provides a complete API-driven DHCP/PXE/TFTP provisioning system.
TcpprobeModern TCP tool and service for network performance observability.
Cloud Ops SandboxCloud Operations Sandbox is an open source tool that helps practitioners to learn Service Reliability Engineering practices from Google and apply them on their cloud services using Cloud Operations suite of tools.
MarmotMarmot workflow execution engine
S3 Streaming Uploads3-streaming-upload is node.js library that listens to your stream and upload its data to Amazon S3 using ManagedUpload API.
Slo GeneratorEasy setup a service level objective using prometheus
CloudproberAn active monitoring software to detect failures before your customers do.
SkinnyThe Skinny Distributed Lock Service
Devops ExercisesLinux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
KapoWrap any command in a status socket
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Turbine Ec2Turbine Instance Discovery based on EC2 tags
Black BeltInternal toolbelt on steroids (idle since September 2018)
Awesome SreA curated list of Site Reliability and Production Engineering resources.
JnitraceA Frida based tool that traces usage of the JNI API in Android apps.
RunbookA framework for gradual system automation
School Of SreAt LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
HowtheysreA curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
RundeckEnable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
HowtheyawsA curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)
Version CheckerKubernetes utility for exposing image versions in use, compared to latest available upstream, as metrics.
AtlantisTerraform Pull Request Automation
newrelic-quickstartsNew Relic One quickstarts help accelerate your New Relic journey by providing immediate value for your specific use cases.
sre-playground🎯 A set of Site Reliability Engineering notes & challenges
Gauntlet🔖 Guides, Articles, Podcasts, Videos and Notes to Build Reliable Large-Scale Distributed Systems.
devops-notesMy technical documentation in the SRE / DevOps paradigm.
xk6-chaosxk6 extension for running chaos experiments with k6 💣
aws-chaos-scriptsDEPRECATED Collection of python scripts to run failure injection on AWS infrastructure
aws-fis-templates-cdkCollection of AWS Fault Injection Simulator (FIS) experiment templates deploy-able via the AWS CDK
knowledgeEverything I know: DevOps & CloudNative, Music, Homelab, Blockchain, AI, etc...