Uncordoning Worker Nodes

Understand how to uncordon worker nodes after cordoning and failed draining in Kubernetes. This lesson shows how to roll back node scheduling disablement, restoring cluster ability to schedule new pods, and prepares you to fix node draining issues for cluster maintenance.

We'll cover the following...

- The issue we just created
- Inspecting the definition of node-uncordon.yaml and comparing it with node-drain.yaml
- Running chaos experiment and inspecting the output
- Checking the nodes to confirm the status

The output, in my case, is as follows (yours will be different).

NAME          STATUS                   ROLES  AGE VERSION
gke-chaos-... Ready,SchedulingDisabled <none> 13m v1.15.9-gke.22

You can see that the status of our single node is Ready,SchedulingDisabled. We run the experiment that failed to drain a node; this is a two-step process. First, the system disables scheduling on that node so that no new Pods are deployed. Then, it drains that node by removing everything. The experiment managed ...

1.Introduction To Kubernetes Chaos Engineering

2.Defining Requirements

3.Destroying Application Instances

4.Experimenting with Application Availability

5.Obstructing and Destroying Network

6.Draining and Deleting Nodes

7.Creating Chaos Experiment Reports

8.Running Chaos Experiments Inside a Kubernetes Cluster

9.Executing Random Chaos

10.What’s Next?

Uncordoning Worker Nodes

The issue we just created