Using Internal Metrics to Debug Potential Issues

Explore how to debug potential issues in Kubernetes applications using internal metrics and alerts. Learn to analyze generic and instrumented metrics, use labels to pinpoint slow responses by method and path, and apply thresholds to filter noise. This lesson helps you identify problem areas quickly and understand the benefits of combining detailed and generic metrics for effective troubleshooting.

We'll cover the following...

- Issue with nginx_ingress_controller_request_duration_seconds
- Switch to http_server_resp_time metric
- - Include labels into expressions
  - Adding a threshold
- Benefit of Instrumentation
- Combine generic metrics with detailed metrics

We’ll resend requests with slow responses again so that we get to the same point where we started this chapter.

for i in {1..20}; do
    DELAY=$[ $RANDOM % 10000 ]
    curl "http://$GD5_ADDR/demo/hello?delay=$DELAY"
done

open "http://$PROM_ADDR/alerts"

We sent twenty requests that will result in responses with random duration (up to ten seconds). Later on, we opened the Prometheus' alerts screen.

A while later, the AppTooSlow alert should fire (remember to refresh your screen), and we have a (simulated) problem that needs to be solved. Before we start panicking and do something hasty, we’ll try to find the cause of the issue.

Please click the expression of the AppTooSlow alert.

Issue with `nginx_ingress_controller_request_duration_seconds`

We are redirected to the graph screen with the pre-populated expression from the alert. Feel free to click the Expression button, even though it will not provide any additional info, apart from the fact that the application was fast, and then it slowed down for some inexplicable reason. You will not be able to gather more details from that expression. You will not ...

1.Before Getting Started

2.Autoscaling Deployments and StatefulSets

3.Auto-Scaling Nodes Of A Kubernetes Cluster

4.Collecting and Querying Metrics and Sending Alerts

5.Debugging Issues Discovered Through Metrics and Alerts

6.Extending HorizontalPodAutoscaler With Custom Metrics

7.Visualizing Metrics And Alerts

8.Collecting And Querying Logs

9.Conclusion

Using Internal Metrics to Debug Potential Issues

Issue with `nginx_ingress_controller_request_duration_seconds`

Using Internal Metrics to Debug Potential Issues

Issue with nginx_ingress_controller_request_duration_seconds

Issue with `nginx_ingress_controller_request_duration_seconds`