This presentation will introduce Site Reliability Engineering, a practice that was pioneered at Google. SRE covers a lot of ground, but it's really about one thing: how does a small team of engineers support a huge infrastructure? This is the problem Google had to solve to operate at Google-scale.
You may think, "We don't operate at Google scale, what does SRE have to do with us?" It's a fair question! However, it's not just about scale. It's about ratios.
How many Ops engineers do you have?
How many servers/containers/networking devices do you support?
That ratio will only increase as your business grows. SRE is a recipe for success in supporting any scale, without having to double the size of the Ops team every six months. We'll cover the essentials:
- The philosophy behind SRE
- Service Levels: Indicators, Objectives, and Agreements
- Reliability Engineering
Attendees will gain an understanding of the concepts behind SRE, and how they can be adapted for an enterprise of any size.
Senior Staff Engineer