Hey everyone!
This time we are meeting at the SysEleven office on June 12th.
AGENDA:
18:30 - 19:00 - Arrival & networking with Bagels and Drinks
19:00 - 19:05 - Welcome and introduction
19:05 - 19:15 - Talk #0 - Introduction to SLOs and why we need them
19:15 - 20:00 - Talk #1 - Alerting on SLOs like Pros
19:45 - 20:00 - BREAK
20:00 - 20:45 - Talk #2 - Announcing the SLO-libsonnet
TALKS:
Introduction to SLOs and why we need them
Abstract: In this introduction to SLOs we want to give everyone in the audience a chance to learn about the very basics of this topic, so we are all well prepared for the second talk.
Biography: Paweł Krupa is a Software Engineer with the Red Hat Openshift Monitoring Team and a maintainer of Cloudalchemy Ansible roles to automate Prometheus deployments in non-Kubernetes environments.
---
Alerting on SLOs like Pros
Abstract: The SRE Workbook (also known as “the second SRE book”) has a very intriguing chapter “Alerting on SLOs” [1]. It is already very hands-on, but to level it up even more, let's look at how the ideas of this chapter were put into practice at SoundCloud. You can read more about it in a recent post on the SoundCloud backstage blog [2]. Expect something like an interactive version of the blog post, performed live on stage and with questions from the audience. Bonus topic is the subtle and not so subtle differences between the duration you calculate a `rate` over and the duration in the `for` clause of an alert.
Biography: Björn is an engineer at Grafana and a Prometheus developer.
Previously, he was a Production Engineer at SoundCloud, a Site Reliability Engineer at Google, and a number cruncher for science.
Links:
[1] https://landing.google.com/sre/workbook/chapters/alerting-on-slos/
[2] https://developers.soundcloud.com/blog/alerting-on-slos
---
Announcing the SLO-libsonnet
Abstract: In the last year jsonnet has gained popularity in the Prometheus ecosystem for writing configuration. We have extracted the essential steps for writing high quality SLOs into a library that will improve the way you write alerting rules, recording rules and dashboards for error rate and request latency.
Biography: Aditya Konarde is a Site Reliability Engineer at Red Hat working on Prometheus and Kubernetes. Matthias Loibl is a Software Engineer with the Red Hat OpenShift Monitoring Team.
---
Follow us on Twitter!
https://twitter.com/PrometheusBLN
Hope to see you all there!
Matthias & the Berlin Prometheus team
Wednesday, June 12, 2019
5:00 PM – 7:00 PM (UTC)
Polar Signals
Lead Organizer
CONTACT US