Conversations about Software Engineering

Conversations about Software Engineering (CaSE) is an interview podcast for software developers and architects about Software Engineering and related topics. We release a new episode every three weeks.

Alex Bramley on The Art of SLO, Part 1

Download it: MP3 | AAC | OGG | OPUS

Alex Bramley talks to Sven Johann about the basics of service level objectives. They begin with terminologies (SLI, SLO, SLA, Error Budget), look at costs of outages and discuss what reliability has to do with customer happiness. They continue with having 100% reliability is the wrong target and what’s possibly the right target. Alex then explains how to get started with collecting data about your system’s behaviour. They close the first part of this series by looking into latency SLIs.

Read transcript

Show Notes

Chapters:

  • 00:00:15 Welcome and intro
  • 00:02:14 Terminology: SLI, SLO, SLA
  • 00:09:05 Cost of a (cloud provider) outage
  • 00:11:22 Reliability and customers happiness
  • 00:20:19 Error Budgets
  • 00:26:31 100% reliability is the wrong target
  • 00:37:44 Collecting data
  • 00:54:31 Latency SLIs
  • 01:09:53 Outro

Comments

New comment

By submitting your comment you agree that the content of the field "Name or nickname" will be stored and shown publicly next to your comment. Using your real name is optional.