Scale down the Cluster

Explore how to scale down nodes in a Kubernetes cluster to optimize resource usage and control expenses. Understand the conditions for node removal, how the cluster autoscaler identifies and drains candidates, and how pods are rescheduled to maintain cluster health.

We'll cover the following...

- Scale down the nodes
  - Candidate for removal
- The rules governing nodes scale down

Scale down the nodes

Scaling up the cluster to meet the demand is essential since it allows us to host all the replicas we need to fulfill (some of) our SLAs. When the demand drops and our nodes become underutilized, we should scale down. That is not essential given that our users will not experience problems caused by having too much hardware in our cluster. Nevertheless, we shouldn’t have underutilized nodes if we are to reduce expenses. Unused nodes result in wasted money. That is true in all situations, especially when running in Cloud and paying only for the resources we used. Even on-prem, where we already purchased hardware, it is essential to scale down and release resources so that they can be used by other clusters.

We’ll simulate a decrease in demand by applying a new definition that will redefine the HPAs threshold to 2 (min) and 5 (max). ...

1.Before Getting Started

2.Autoscaling Deployments and StatefulSets

3.Auto-Scaling Nodes Of A Kubernetes Cluster

4.Collecting and Querying Metrics and Sending Alerts

5.Debugging Issues Discovered Through Metrics and Alerts

6.Extending HorizontalPodAutoscaler With Custom Metrics

7.Visualizing Metrics And Alerts

8.Collecting And Querying Logs

9.Conclusion

Scale down the Cluster

Scale down the nodes