Kubernetes - introduction

1 Working progress

Try to explain clearly what the Kubernetes elements translate to and repeat it often to clarify this vocabulary as quickly as possible. Explain everything: so start from a simple (and stupid) solution and add to it. Do not drown: keep things structured so things are not confusing.

2 Our showcase application: FAIRDOM Seek

Seek is an extensive Data Management platform mostly aimed at Life Sciences but that can be used in any domain. It is a good example for going from bar metal to full cloud setup as (1) it is already available as container and bare-metal installation, with a complete Docker compose which is still easy to understand,(2) it generally relies on several containers (4 with the default docker compose), several volumes, workers (i.e. side process that run regularly for doing batch works), a comprehensive search using Solr, and object storage and (3) it can be run with the unique Seek container (using a sqlite database within the container) so can go from really simple (for discovery or tutorials) to rather large (with load balancing, external DB, backups) which will allow us to go from a simple and stupid (in a Kubernetes context) case to full setups with most of the cases suitable for production.

2.1 Seek and his components

2.2 The default Docker Compose

2.3 What could benefit from a cloud setup

2.4 A pragmatic approach of Cloud setup

The main question of which setup to choose is a matter or benefit/cost: * How much time will be needed for the training, the setup, the maintenance, * How secure is the setup, what solution is available for backup, * Which complexity can be managed, as a high complexity can also give a security risk.

As such, it is often not beneficial to use the whole capability of a Cloud setup, but to pick what will be a real benefit and what need to be ignored in order to keep things manageable. It is also important to consider which level of availability, security and scalibility is really needed. Going from a 99% uptime (the platform need to be available 99% of the time) to 99.9% might involve a very high cost, needing several instance behind a load balancer, using a distributed database with automatic replication of data, having a redundant identity server… Going from 99.9% to 99.99% will have an even greater cost.

3 A quick overview of Kubernetes

3.1 Good free instructory resources

3.1.1 Tutorials and sandboxes

3.1.2 Simple Kubernetes setup

On Windows, the simplest way to run Kubernetes is probably using Docker Desktop. A simple Kubernetes setup is part of the options.

On Linux, there are lightweight setup using various approached. The main ones are:

Warning

It is important to consider that, if some of these setup can be used in production (such as Talos or k3s), Kubernetes is a complex system and should be probably be managed by sys-admin persons, for ensuring fault-tolerance, backups and security.

3.2 Terminology & main elements

A first glance on Kubernetes can be very confusing. A big part of this confusion is due to a lot of “moving” parts with confusing names.

A full list of terms can be found on the official documentation.

The maim terms/elements you should know are:

Cluster: the whole Kubernetes setup composed of several worker machines, called nodes. Some managed Kubernetes can support several clusters, so for instance give a cluster for one user.
Node: one worker machine in Kubernetes, like a server. Can be a physical machine or a virtual machine.
Pod: the smallest object in Kubernetes, that will run one or several containers. The containers can be run by docker or other container engines.
Service: a fixed network endpoint to a container in Kubernetes. It allows to connect to an application in a Pod, without having this connection defined into the pod. Kubernetes often has one extra layer of access (compared to Docker Compose for instance) allowing a full decoupling of all elements. That makes it easier to change the configuration later on (automatically due to the need or from an user update).
Label: labels are used for Kubernetes to find the different elements.
Kubectl: the command line tool to communicate with the Kubernetes control center. With it you can create the object, update them, list them…
Manifest: the JSON or YAML file that define a Kubernetes object. Objects are all elements of Kubernetes: Deployments, Services, ReplicaSet, DaemonSet, …

And generally, the applications are deployed as:

ReplicatSet: ask for a set numbers of Pods. Kubernetes will try to have the set number of pods running, and will restore this which might become unhealthy. The pods can be on any node and it is not guaranteed that Kubernetes will be able to get the right number of pods (if there isn’t enough resources typically). It is possible to influence on which node the pods will run using affinities, see Section 3.15
Deployment: manages a ReplicaSet and provides rolling updates. It allows you to update an application without downtime by gradually replacing old Pods with new ones. If something goes wrong, you can roll back to a previous version.
DaemonSet: ensures that one Pod runs on each (or some) node. It is typically used for system-level services such as monitoring agents, log collectors, or networking tools that need to run everywhere.
StatefulSet: manages Pods that need stable, unique identities and persistent storage. It is used for stateful applications such as databases or distributed systems that need predictable network names or data persistence.
Job: runs Pods to completion. It is used for tasks that need to run once or a fixed number of times, such as batch processing or data import. When the Job finishes successfully, the Pods are not restarted.
ConfigMap: stores non-confidential configuration data as key-value pairs. Applications can use ConfigMaps to configure themselves without having to rebuild their containers.
Secret: similar to ConfigMap, but used to store sensitive information such as passwords, tokens, or SSH keys. Secrets are stored in base64-encoded form and can be mounted into Pods or used as environment variables. Be aware that they are not an encrypted, only encoded, so the security is ensured by having the secret saved in a private place. As such it is only moderately secured.

flowchart LR
subgraph ControlPlane[Control Plane]
RS1[ReplicaSet Controller]
end
subgraph NodeA[Node A]
P1[Pod v1]
end
subgraph NodeB[Node B]
P2[Pod v1]
end
subgraph NodeC[Node C]
P3[Pod v1]
end


RS1[ReplicaSet v1 replicas=3] --> P1
RS1 --> P2
RS1 --> P3

flowchart LR
subgraph ControlPlane[Control Plane]
RS1[ReplicaSet Controller]
D1[Deployment w/o ReplicatSet]
end
subgraph NodeA[Node A]
P1[Pod v1]
end
subgraph NodeB[Node B]
P2[Pod v1]
P3[Pod v1]
end
subgraph NodeC[Node C]
P4[Pod 2 v1]
end


RS1[ReplicaSet v1 replicas=3] --> P1
RS1 --> P2
RS1 --> P3

D1 --> P4

Stateful Sets

flowchart LR
subgraph ControlPlane[Control Plane]
RS1[StatefulSet Controller]
end
subgraph NodeA[Node A]
P1[Pod v1 Example-1]
end
subgraph NodeB[Node B]
P2[Pod v1 Example-2]
end
subgraph NodeC[Node C]
P3[Pod v1 Example-3]
end

RS1[StatefulSet v1 replicas=3 Name=Example] --> P1
RS1 --> P2
RS1 --> P3

DaemonSets with Log

flowchart TB
    User --> Load_Balancer --> Service

    subgraph K8s_Cluster[Kubernetes Cluster]

        %% Application layer
        Service --> Deployment
        Deployment --> ReplicaSet

        ReplicaSet --> AppPod1
        ReplicaSet --> AppPod2

        LogAgent1_def["DaemonSet Manifest"]

        %% Nodes
        subgraph Node1["Worker Node 1"]
            AppPod1
            LogAgent1
        end

        subgraph Node2["Worker Node 2"]
            AppPod2
            LogAgent2
        end

        LogAgent1_def --> LogAgent1
        LogAgent1_def --> LogAgent2

        %% Log collection
        LogAgent1 --> AppPod1
        LogAgent2 --> AppPod2
    end

    %% Centralized logging
    subgraph Logging_Backend[Logging Backend]
        Graylog["Graylog"]
    end

    %% Log shipping
    LogAgent1 --> Graylog
    LogAgent2 --> Graylog

3.3 Everything decoupled

How Kubernetes makes everything through API and labels Why there is so many extra layers compared to docker compose -> full decoupling.

3.4 Most configuration is declarative

3.5 The various objects for your application

While the Pod is the smallest element in Kubernetes, it is directly started only for testing. If you define a pod or create it using kubectl, it will only run once and will not be restarted if something fails. As such, it is like docker run if started manually, or docker compose if started using a manifest, and offers no benefit — just the added complexity of the Kubernetes setup.

A pod is using a container engine, which does not have to be docker.

A ReplicaSet asks for a set number of pods, and Kubernetes will ensure that these pods are always running. If one pod becomes unhealthy or crashes, Kubernetes automatically removes it and creates a new one.

A Deployment is what you usually use for applications. It manages ReplicaSets and provides updates and rollbacks. You can think of it as “the normal way” to run an app on Kubernetes — one that should always be available and easy to update.

A Job is used for tasks that need to run once and then stop, for example, a script that imports some data or cleans up files. Kubernetes ensures that the job runs successfully to completion.

A DaemonSet runs one pod per machine (node). This is often used for background services such as system monitoring or log collection that must run everywhere.

A StatefulSet is like a Deployment but for applications that need to keep their identity and data across restarts — such as databases. Each pod in a StatefulSet has a fixed name and can have its own storage attached.

A CronJob is a definition of jobs that will run periodically following a cron format. When needed they are well explained in the official documentation

3.6 Visual overview

Here’s a simple way to picture how Kubernetes manages your applications:

+-------------------------------------------------------------------------------------------------+
|                                            Deployment                                           |
|                        (defines the app, version, and update strategy)                          |
|                                                                                                 |
|   +-----------------------------------------------------------------------------------------+   |
|   |                                        ReplicaSet                                       |   |
|   |                       (keeps the right number of Pods running)                          |   |
|   |                                                                                         |   |
|   |   +------------------------+   +------------------------+   +-------------------------+ |   |
|   |   |          Pod 1         |   |          Pod 2         |   |         Pod 3           | |   |
|   |   | (runs app in container)|   | (runs app in container)|   | (runs app in container) | |   |
|   |   +------------------------+   +------------------------+   +-------------------------+ |   |
|   +-----------------------------------------------------------------------------------------+   |
+-------------------------------------------------------------------------------------------------+

Deployment: Defines what you want to run and how to update it.
ReplicaSet: Ensures the right number of pods are running.
Pods: Run the actual container (your app).

If a pod fails, the ReplicaSet creates a new one. If you update your app, the Deployment creates a new ReplicaSet with the new version and removes the old one once everything runs correctly.

3.7 Updating an application

Applications are still running in containers, so updating means updating the container image. If Deployments are used, the update is managed automatically by Kubernetes: it follows an update strategy to start new containers while stopping and removing the old ones. This allows smooth, gradual updates with no downtime.

You can also roll back easily if something goes wrong, as Kubernetes keeps the previous version ready.

3.8 Where do you store the data

TODO: Note about etcd

Containers are temporary — when they stop, everything inside disappears. For most applications, you need to keep data somewhere safe and separate from the containers. The common way to store data in Kubernetes is in volumes.

A Volume is a piece of storage that can be attached to a pod. It can be on the same machine, on a network drive, or in the cloud. Kubernetes makes sure that your pods can access this storage whenever they run.

If you need data that must stay linked to a specific pod (for example, for a database), you can use a PersistentVolume and a PersistentVolumeClaim. These ensure that even if your pods are restarted or moved to another machine, your data stays available and safe.

In short:

Pods are temporary.
Data should live in Volumes or PersistentVolumes.
Kubernetes handles the connection between the app and its data storage.

For review: But you can also store your data in a database or an object storage (such as an S3 implementations). They might be part of the Kubernetes cluster, in another Kubernetes cluster, or fully external. Their access and usage will generally be the same in all cases. When within a Kubernetes cluster, they use volumes of their own and their setup could involve setting-up the volumes (see the note below). Some data storage (such as Postgresql) might be managed by an operator. Operators are software extensions of Kubernetes, that can define, as their name implies, operations on the cluster. In that case, you generally set-up the database and the operator as instructed by the operator documentation, and you will then have the database managed by the operator. They could take care of data replication, failure management, …, and are generally the prefered way to set-up a data storage on Kubernetes when something more than a simple volume is needed.

3.8.1 A note about automatic storage (StorageClass)

Kubernetes can automatically create persistent storage for you — but only if the cluster has been set up with a storage backend.

When you create a PersistentVolumeClaim (PVC), Kubernetes checks if there is a StorageClass available. The StorageClass defines how to create the actual storage — for example, on cloud disks, local drives, or network storage. If one is available, Kubernetes provisions the storage automatically and attaches it to your pod.

If no StorageClass or backend is configured, the claim will just wait forever (in Pending state), and your pod won’t start. So persistence only works if the cluster administrator configured it in advance.

In short:

Cloud clusters (AWS, Google, Azure) usually have a StorageClass ready by default.
Local or on-premise clusters might need to install one manually (for example, a local-path-provisioner or NFS).

3.8.2 Visual overview of automatic storage provisioning

+-----------------------------------------------+
|                 Your Pod                      |
|        (requests storage via PVC)             |
+-----------------------------------------------+
                      |
                      v
+-----------------------------------------------+
| PersistentVolumeClaim (PVC)                   |
| "I need 10 GB of storage, please."            |
+-----------------------------------------------+
                      |
                      v
+-----------------------------------------------+
| StorageClass (defines how to create storage)  |
| Example: AWS EBS, Google PD, NFS, Local Path  |
+-----------------------------------------------+
                      |
                      v
+-----------------------------------------------+
| PersistentVolume (PV)                         |
| Actual disk or network storage created        |
| automatically by the provisioner              |
+-----------------------------------------------+

If a suitable StorageClass is present, the system automatically creates a PersistentVolume and binds it to your claim. Otherwise, the claim just stays pending until an administrator provides a volume manually.

3.8.3 Visual overview of storage

+----------------------------+
|          Pod               |
|  (runs your container app) |
|          |                 |
|          | uses            |
|          v                 |
|      +--------+            |
|      | Volume |------------+--------------+
|      +--------+            |              |
+----------------------------+              |
                                            |
                           +--------------------------------+
                           | PersistentVolumeClaim (PVC)    |
                           |  (requests storage from pool)  |
                           +--------------------------------+
                                            |
                           +--------------------------------+
                           | PersistentVolume (PV)          |
                           | (actual disk / network storage)|
                           +--------------------------------+

Pod: The running container that needs to read or write data.
Volume: The connection point between the app and the storage.
PersistentVolumeClaim (PVC): A request for storage.
PersistentVolume (PV): The real storage space (disk, cloud volume, etc.).

This structure lets you move or restart pods without losing any data — Kubernetes will reconnect the storage automatically.

3.9 How Kubernetes keeps things running

Kubernetes is designed to make sure your applications stay up and running, even if something fails. It constantly watches all your pods and checks if they are healthy.

If a pod crashes, Kubernetes automatically restarts it or creates a new one. If a whole machine (called a node) goes down, Kubernetes moves your pods to another available machine.

You don’t need to manually restart or move anything — Kubernetes takes care of it. This is one of its biggest advantages compared to running containers manually: you describe what you want to run, and Kubernetes makes sure it stays that way.

3.10 Configuration values

ConfigMap and Secrets

Regular Values can be stored in a ConfigMap, while values that should be hidden are store in Secrets. The main idea of Secrets is to keep your confidential values out of the regular configuration, in a secure place. Secret values are encoded in Base64 for preventing issues with special characters but it does not offer any added security (it can be decoded with any Base64 decoding tool).