25.9.5
This website uses cookies to ensure you get the best experience on our website. Learn more

Site Reliability Engineering Observability

Manoel Marcelino de Sá Junior

Skillsoft issued completion badges are earned based on viewing the percentage required or receiving a passing score when assessment is required. Observability plays an important role in systems engineering because it enables real-time detection and diagnosis of potential issues, allowing for proactive problem-solving and enhanced performance. In this course, you will take a deep dive into site reliability engineering (SRE) observability, including the three pillars of observability: logs, metrics, and traces. Then you will explore the tools and technologies used for achieving observability and the methods for performing observability in distributed systems. Next, you will discover strategies for log management and analysis, methods for collecting and analyzing metrics, and effective trace analysis methods. You will examine observability tool use cases and methods for setting up observability-related alerts and for performing root cause analysis using observability data. Finally, you will learn how to set up a logging framework for a small application, create and configure alerts, and perform a network trace analysis using Microsoft Network Analyzer.

Issued on

September 25, 2024

Expires on

Does not expire