Establishing a Culture of Reliability
Understand the importance of fostering a culture of reliability within organizations to enhance operational performance.
This course is all about how to foster a culture that is based on reliability. We will learn how to utilize best practices for several key areas of being a Site Reliability Engineer (SRE) and how they contribute to a culture of reliability. We will cover how to have balanced and effective on-call rotations as well as how to handle incidents. Next, we will discuss how to review your system throughout its lifecycle to find and mitigate any potential risk factors. Managing system capacity at all phases of a system’s lifecycle is another major component to ensuring that everything is operating at maximum reliability. We will round out this course by discussing a thorn in every SRE’s side: toil. We will discuss how to identify and reduce toil to maximize time spent performing operational work.
There are no reviews yet.