Exploring High Availability and Fault Tolerance of a Cluster

Explore how to validate high availability and fault tolerance in a Kubernetes cluster by simulating node failures. Learn to manage worker node instances with AWS EC2 and kOps, observe automatic recovery through Auto Scaling Groups, and understand the processes that restore a cluster to its desired state after an instance termination.

We'll cover the following...