Site Reliability Engineering Observability
Observability plays an important role in systems engineering because it enables real-time detection and diagnosis of potential issues, allowing for proactive problem-solving and enhanced performance.
In this course, you will take a deep dive into site reliability engineering (SRE) observability, including the three pillars of observability: logs, metrics, and traces. Then you will explore the tools and technologies used for achieving observability and the methods for performing observability in distributed systems. Next, you will discover strategies for log management and analysis, methods for collecting and analyzing metrics, and effective trace analysis methods. You will examine observability tool use cases and methods for setting up observability-related alerts and for performing root cause analysis using observability data.
Finally, you will learn how to set up a logging framework for a small application, create and configure alerts, and perform a network trace analysis using Microsoft Network Analyzer.
Site Reliability Engineering Observability
it_dofsredj_04_enus