Pitfalls in Measuring SLOs

Add to wishlistAdded to wishlistRemoved from wishlist 0

Add to compare+

Duration	28m
level	Intermediate
Course Creator	Gremlin
Last Updated	14-Dec-22

Pluralsight

Category: DevOps

In this talk, we will discuss how we brought the theory of SLOs to practice, and what we learned that we hadn’t expected in the process.

Add your review

Description
Reviews (0)

We built support for SLOs (Service Level Objectives) against our event store so we could monitor our own complex distributed system. In the process of doing so, we learned that there were a number of important aspects that we didn’t expect from carefully reading the SRE workbook. This talk is the story of the missing pieces, unexpected pitfalls, and how we solved those problems. We’d like to share what we learned and how we iterated on our SLO adventure. As an SLO advocate and a design researcher, we collected user feedback through iterative deployments to learn what challenges users were running into. This conversation will discuss how we iterated our design, based on user feedback; how we deployed, what we learned, and re-deployed; and how we collected information from our users and from the alerts our system fired. In this talk, we will discuss how we brought the theory of SLOs to practice, and what we learned that we hadn’t expected in the process. We’ll discuss implementing the SLO feature and burn alerts; and our experiences from working with the SRE team who started using the alerts. Our hope is that when you buy or build your SLO tools, you’ll know what to look for, and how to get started. implementors will be able to start with a more solid ground, and that we will be able to advance the state of SLO support for all teams that wish to implement them. The major design points will be broken into a discussion of what we actually built; a number of unexpected technical features; and ways that we had to educate users beyond the standard SLO guidelines. The talk is largely conceptual: no live code will be shown, although some innocent servers may well die in the process of being visualized.
Author Name: Gremlin
Author Description:
Gremlin is a Chaos Engineering service on a mission to help build a more reliable internet. Their solutions turn failure into resilience by offering engineers a fully hosted SaaS platform to safely experiment on complex systems, in order to identify weaknesses before they impact customers and cause revenue loss. Founded by CEO Kolton Andrus and CTO Matthew Fornaciari in 2016, the company has since raised $26.8Million in funding from Redpoint Ventures, Index Ventures, and Amplify Partners. Existi… more

Pitfalls in Measuring SLOs
28mins

User Reviews

0.0 out of 5

★★★★★

Write a review

There are no reviews yet.

Be the first to review “Pitfalls in Measuring SLOs” Cancel reply

Pitfalls in Measuring SLOs

Description
Reviews (0)

Start Course

All Categories

Pitfalls in Measuring SLOs

Table of Contents

User Reviews

Be the first to review “Pitfalls in Measuring SLOs” Cancel reply

COURSE PROVIDERS

CATEGORIES

Quick Links

Contact Us

Compare items

All Categories

Pitfalls in Measuring SLOs

Table of Contents

User Reviews

Be the first to review “Pitfalls in Measuring SLOs” Cancel reply

Related Products

Microservices Architecture for Absolute Beginners

Diploma in Amazon Web Services 2019

Scaling a Web App with Docker on AWS

Introduction to Kubernetes

DevOps – Application Lifecycle Management

Communicate effectively on GitHub using Markdown

COURSE PROVIDERS

CATEGORIES

Quick Links

Contact Us

Compare items