Incorporating Site Reliability Engineering (SRE) in Your System Design
SRE is the hot way to manage apps in production – but are you and your systems ready for it? This course teaches you how to design systems for maximum reliability, find the gaps in your current system design and adopt SRE smoothly and effectively.
Before you adopt SRE you need to be sure that your systems are designed to work well with SRE practices. In this course, Incorporating Site Reliability Engineering (SRE) in Your System Design, you’ll learn how to design systems with SRE in mind and assess what’s missing in your existing systems. First, you’ll discover how to architect apps for reliability, so temporary problems are automatically managed and bigger issues are quickly alerted. Next, you’ll explore how observability design supports SRE and helps you get your apps back online. Finally, you’ll delve into how to effectively measure and report on service levels. When you’re finished with this course, you’ll have the skills and knowledge of system design needed to bring your own apps into SRE.
Author Name: Elton Stoneman
Author Description:
Elton is a 10-time Microsoft MVP, author, trainer and speaker. He spent most of his career as a consultant working in Microsoft technologies, architecting and delivering complex solutions for industry leaders. He has delivered APIs on Azure serving millions of clients daily, Big Data solutions processing billions of events weekly, and cutting-edge solutions powered by containers. Elton’s experience with .NET goes from .NET 1.0 running on Windows Server, right up to .NET Core running on Linux. Wh… more
Table of Contents
- Course Overview
2mins - Architecting Systems for Reliability
34mins - Designing Observability for Fault Diagnosis
34mins - Driving Continuous Improvement with Service Levels
26mins
There are no reviews yet.