vorozhko / Site Reliability Engineer Guide
Stars: ✭ 112
Projects that are alternatives of or similar to Site Reliability Engineer Guide
Cloud Ops Sandbox
Cloud Operations Sandbox is an open source tool that helps practitioners to learn Service Reliability Engineering practices from Google and apply them on their cloud services using Cloud Operations suite of tools.
Stars: ✭ 191 (+70.54%)
Mutual labels: cloud, sre
Performance-Engineers-DevOps
This repository helps performance testers and engineers who wants to dive into DevOps and SRE world.
Stars: ✭ 35 (-68.75%)
Mutual labels: engineering, sre
Howtheyaws
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)
Stars: ✭ 389 (+247.32%)
Mutual labels: cloud, sre
Devops Readme.md
What to Read to Learn More About DevOps
Stars: ✭ 398 (+255.36%)
Mutual labels: cloud, sre
Cloudprober
An active monitoring software to detect failures before your customers do.
Stars: ✭ 1,269 (+1033.04%)
Mutual labels: cloud, sre
Bedwarsrel
Bedwars Reloaded - The Minecraft Bedwars Plugin!
Stars: ✭ 108 (-3.57%)
Mutual labels: paper
Cas Webapp Docker
Apereo CAS Server web application running inside a docker container.
Stars: ✭ 107 (-4.46%)
Mutual labels: cloud
Nativescript App Templates
Monorepo for NativeScript app templates
Stars: ✭ 108 (-3.57%)
Mutual labels: cloud
Papers Notebook
📄 🇨🇳 📃 论文阅读笔记(分布式系统、虚拟化、机器学习)Papers Notebook (Distributed System, Virtualization, Machine Learning), created by @gaocegege
Stars: ✭ 1,678 (+1398.21%)
Mutual labels: paper
Singleviewreconstruction
Official Code: 3D Scene Reconstruction from a Single Viewport
Stars: ✭ 110 (-1.79%)
Mutual labels: paper
Tradingview Trainer
A lightweight app for practicing your trading on Tradingview
Stars: ✭ 106 (-5.36%)
Mutual labels: practice
Unlitclouds
A unity cloud shader, using vertex colors and tessellation for a simple stylized look.
Stars: ✭ 110 (-1.79%)
Mutual labels: cloud
Paperlib
Plugin Library for interfacing with Paper Specific API's with graceful fallback that maintains Spigot Compatibility, such as Async Chunk Loading.
Stars: ✭ 108 (-3.57%)
Mutual labels: paper
Site Reliability Engineer guide
Collection of books, research papers, videos and articles for mastering Site Reliability Engineer proficiency.
Books
- [ ] Modern Operating Systems Tanenbaum, Andrew S.
- [x] UNIX and Linux System Administration Handbook Nemeth, Evi
- [ ] TCP/IP Illustrated, Volume 3: TCP for Transactions, HTTP, NNTP, and the Unix (R) Domain Protocols Stevens, W. Richard
- [ ] Systems Performance: Enterprise and the Cloud
- [x] Site Reliability Engineering: How Google Runs Production Systems - Free to read online(https://landing.google.com/sre/book/index.html)
- [x] The Site Reliability Workbook
- [ ] The datacenter as a computer: an introduction to the design of warehouse-scale machines
- [ ] The Practice of System and Network Administration
- [ ] The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems
- [ ] Time Management for System Administrators
- [ ] The Go Programming Language Donovan, Alan A. A.
- [x] Think Python Downey, Allen B.
- [ ] The Linux Command Line Jr., William E. Shotts
- [ ] Linux Server Hacks: 100 Industrial-Strength Tips and Tools Flickenger, Rob
- [ ] Programming Pearls Bentley, Jon L.
- [ ] Web Operations - Keeping the Data On Time
- [ ] Microservices in Production
- [ ] Docker up and running
- [x] Kubernetes Up and Running By Brendan Burns, Kelsey Hightower, Joe Beda
Research papers
- [x] Large-scale cluster management at Google with Borg
- [ ] MapReduce: simplified data processing on large clusters
- [ ] Bigtable: A Distributed Storage System for Structured Data
- [x] On designing and deploying internet-scale services
- [ ] Mesos: a platform for fine-grained resource sharing in the data center
- [x] Google: Reliable Cron across the Planet
Technologies
- [ ] Aurora
- [x] Docker
- [ ] Fluentd
- [ ] ElasticSearch
- [ ] GCE
- [ ] Hadoop
- [x] Kubernetes
- [ ] Mesos
- [ ] Kernel Based Virtual Machine
- [ ] Protocol Buffers
- [ ] Spark
- [ ] VMWare
Networking
Monitoring and alerting
- [x] Prometheus
- [x] PromCon 2016
SRE best practice
- [x] Software engineering at Google
- [x] Keys to SRE by Ben Treynor
- [x] How Container Clusters Like Kubernetes Change Operations
- [x] 10 Years of Crashing Google
- [x] Release Engineering Best Practices at Google
- [x] From Zero to Hero: Recommended Practices for Training your Ever-Evolving SRE Teams
- [x] Transactional System Administration Is Killing Us and Must be Stopped
- [x] Lessons Learned From Scaling Uber To 2000 Engineers, 1000 Services, And 8000 Git Repositories
- [x] Netflix: 190 Countries and 5 CORE SREs
- [x] Performance Checklists for SREs
- [x] Notes on SRE book
- [x] SYSADMIN (Un)Reliability Budgets
Trainings
More
- [ ] Google SRE resources
- [ ] USENIX SRE conferences
- [x] SREcon 2016
- [ ] awesome-sre
- [ ] Google Interview University
- [x] Thenewstack ebook series
- [ ] Web development with Go
- [ ] SREcon 2018
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].