Alerting on Metrics Abnormalities

Understand how to define alerting rules and configure Alertmanager with Prometheus to notify your team of abnormal application behaviors. This lesson covers setting up alerting in your observability stack to help you respond quickly to deviations in metrics such as latency, enhancing your ability to maintain reliable applications.

We'll cover the following...

Adding and configuring Alertmanager

Alerting on metrics enables us to define behavioral norms and specify how we should be notified when our applications exhibit abnormal behavior. For example, if we expect HTTP responses from our application to respond in under 100 milliseconds and we observe a time span of 5 minutes when our application is responding in greater than 100 milliseconds, we would want to be notified of the deviation from the expected behavior.

In this lesson, we will learn how to extend our current configuration of services to include an AlertmanagerAlertmanager is a Prometheus component that handles alerts by triggering the appropriate actions. service to provide alerts when observed behavior deviates from expected norms. We'll learn how to define alerting rules and specify where to send those notifications when our application experiences ...

1.Introduction to the Course

2.Go Language Basics

3.Go Language Essentials

4.Filesystem Interactions

5.Using Common Data Formats

6.Interacting with Remote Data Sources

Project

7.Writing Command-Line Tooling

8.Automating Command-Line Tasks

9.Observability with OpenTelemetry

10.Automating Workflows with GitHub Actions

11.Using ChatOps to Increase Efficiency

12.Creating Immutable Infrastructure Using Packer

13.Infrastructure as Code with Terraform

14.Deploying and Building Applications in Kubernetes

15.Programming the Cloud

16.Designing for Chaos

17.Appendix

18.Conclusion

Alerting on Metrics Abnormalities