This presentation will introduce Site Reliability Engineering, a practice that was pioneered at Google. SRE covers a lot of ground, but it's really about one thing: how does a small team of engineers support a huge infrastructure? This is the problem Google had to solve to operate at Google-scale. You may think, "We don't operate at Google scale, what does SRE have to do with us?" It's a fair question! However, it's not just about scale. It's about ratios. How many Ops engineers do you have? How many servers/containers/networking devices do you support? That ratio will only increase as your business grows. SRE is a recipe for success in supporting any scale, without having to double the size of the Ops team every six months. We'll cover the essentials: - The philosophy behind SRE - Service Levels: Indicators, Objectives, and Agreements - Reliability Engineering Attendees will gain an understanding of the concepts behind SRE, and how they can be adapted for an enterprise of any size.