How to use volumes in Kubernetes

Overview

When working with K8sKubernetes, we want to ensure that the data created by our application, which is running in containers, survive if something happens to any one of our running containers.

Volumes aims to solve this problem.

In this shot, we ran a simple Node.js app on our cluster. In another shot which was a part of my series on Docker, I’ve also talked about volumes.

Here we will combine those two concepts and look at how we can use volumes when working with K8s.

I recommend that you go through those two shots before reading this one.

When working with Docker, we use:

Anonymous volumes
Named volumes
Bind mounts.

K8s, in contrast to Docker, support a large number of volume types. Apart from regular volumes, it also has something called Persistent Volumes, which we will look at in more detail.

Explanation of the first problem

We first listed the volumes we wanted to use under the volumeMounts key. Here we specify the mountPath, which is the location in our container. Location of container stores the files and the name of the volume.
We configure each volume under the volumes key by first specifying its name and then the config.
The config is based on the type, which we have to specify first.
Here the type is emptyDir, and we didn’t specify any special config for it, implying that we want to use the emptyDir type of volume with its default settings.
Doing so has solved our problem. If for some reason our container now shuts down, assuming only one pod is present, then when it restarts (this is something k8s handles for us by default) the container will have access to the data that was created earlier.
Since the data isn’t being stored in the container, a new empty directory is created whenever the pod starts.
Containers can then write to this dir and if they restart or are removed the data survives.

The second problem

To solve the first problem, we stored the data on the pod so that even if the container restarts, the data is still present.

But what if the pod restarts?

Suppose there is a single pod and it restarts. Then our data would be lost and the app won’t work while the pod is down.
However, if we have multiple pods and one of them shuts down for some reason, then the data stored in its volume will be lost.
Our app would still function because other pods are running and k8s will automatically redirect incoming traffic.
We would still lose the data our shut down pod had. In short, our app would still work but not have some user data.

// apiVersion: apps/v1 
kind: Deployment 
metadata:
  name: node-app-deployment
spec:
  replicas: 1 
  selector: 
    matchLabels: 
      anything: node-app 
  template: 
    metadata:
      labels:
        anything: node-app
    spec: 
      containers:
        - name: node-app-container 
          image: YourDockerHubName/node-image
          volumeMounts:
            - mountPath: /app/userData
              name: userData-volume
      volumes:
        - name: userData-volume
          hostPath:
            path: /data
            type: DirectoryOrCreate

Explanation of the second problem

While configuring the volume we have now used hostPath instead of emptyDir and provided some configuration.

First is path, which refers to the folder on our host machine where we want to save the data.
The second is type, where we provide the value of DirectoryOrCreate. This means that if the folder we specified above exists, use it, and if not, then create it on the host machine.

What is a `hostPath`?

hostPath type is similar to the Bind Mounts I talked about in the Docker series. Using this type should now solve the problem of our data being lost when pods shut down.

Description of Persistent volumes

Persistent Volumes are like regular volumes which also have types.

We talked about how the hostPath type is common to both and is perfect for experimenting with persistent volumes when working locally.

This is because the cluster minikube provides us a single node cluster. While you would not be using a single node cluster when working with persistent volumes, the workflow I’ll be explaining will more or less be the same.

If it’s confusing, remember that we can use the hostPath type of Persistent Volumes since we are working with a single node cluster that minikube set up for us.

accessModes tell how the PV can be accessed. Here we enlist all the modes we want to support.

ReadWriteOnce mode allows the volume to the mounted as a read and write volume only by a single node, which is perfectly fine here since our cluster is a single node cluster.

You might want to look into other modes for a multiple node cluster, like ReadOnlyMany and ReadWriteMany.

After the accessModes, we mention the type of persistent volume (hostPath here) and its configuration like we did earlier.

In the specification for this PV Claim, we first mention the PV name for which this claim is.
Then we choose the accessModes from the ones we listed in the host-pv.yaml file. Since we listed only one, we have no other choice but to go with that one here.
After that, we again mention the storage class we want to use like before.
The resources key can be thought of as the counterpart for the capacity we mentioned in the host-pv.yaml file. Here we choose how much storage we want to request.
We would generally not request the entire amount of storage available to us. Here it wouldn’t matter since we are just testing stuff out.

// apiVersion: apps/v1 
kind: Deployment 
metadata:
  name: node-app-deployment
spec:
  replicas: 1 
  selector: 
    matchLabels: 
      anything: node-app 
  template: 
    metadata:
      labels:
        anything: node-app
    spec: 
      containers:
        - name: node-app-container 
          image: YourDockerHubName/node-image
          volumeMounts:
            - mountPath: /app/userData
              name: userData-volume
      volumes:
        - name: userData-volume
          persistentVolumeClaim:
            claimName: host-pvc

How to use volumes in Kubernetes

Overview

Understanding volume in docker

The first problem

Solution to the first problem

`deployment.yaml` of first problem

Explanation of the first problem

The second problem

Solution to the second problem

`deployment.yaml` of second problem

Explanation of the second problem

What is a `hostPath`?

The third problem

Solution to the third problem

Description of Persistent volumes

Setting up the PV

Explanation of the third problem

Storage classes in K8s

Setting up the PVC

Final configuration

Conclusion

How to use volumes in Kubernetes

Overview

Understanding volume in docker

The first problem

Solution to the first problem

deployment.yaml of first problem

Explanation of the first problem

The second problem

Solution to the second problem

deployment.yaml of second problem

Explanation of the second problem

What is a hostPath?

The third problem

Solution to the third problem

Description of Persistent volumes

Setting up the PV

Explanation of the third problem

Storage classes in K8s

Setting up the PVC

Final configuration

Conclusion

`deployment.yaml` of first problem

`deployment.yaml` of second problem

What is a `hostPath`?