- Step 1 Flask-App
- Step 2.1 Kubernetes Cluster
- Step 2.2 Jenkins
- Step 2.3 Prometheus-Alertmanager-Grafana Stack
- Step 2.4.a Log Index
- Step 2.4.b Security
- Step 2.4.c Log Rotate, Elasticsearch
- Step 3.1 Kubernetes Manifests
- Step 3.2 Nginx-Ingress
- Step 3.2.a Soft Pod Affinity
- Step 3.2.b Probes
- Step 3.3 and 3.4 Jenkins CI&CD
- Step 3.5 Webhook
- Step 4.1 PDP, Priority, Preemption
- Step 4.2 Canary, ArgoCD, Istio
- Step 4.3.a HPA
- Step 4.3.b Karpenter AutoScaling
I don't know much about Java, when I looked at the repository on github, I realized that someone else had fork here and solved the case. I chose to develop from scratch using Python so that the things we do are not the same.
All the steps are as requested, I have written the Dockerfile in multistage structure and returned the parameters of the query to the endpoint to stdout.
I created Kubernetes with vagrantfile on my local computer using Libvirt(KVM/QEMU).
Master and worker nodes each have 2 CPUs.
Master node has 4096MB ram, worker nodes have 5192MB ram.
Kubernetes version 1.26.0 was installed using kubeadm and shell scripts.
Containerd is selected as the container runtime, calico is selected as the Container network interface plugin.
I looked at the file descriptor and pid limit values with ulimit -n and ulimit -u and increased it by editing the /etc/security/limits.conf file on the nodes
I need to set ulimits on worker nodes since containers will inherit the ulimit of worker nodes.
For an application that dynamically uses different port numbers and connection numbers (eg game servers) I set the kernel parameters for TCP connections:
I expanded the parameter net.ipv4.ip_local_port_range and increased the parameter net.core.somaxconn This will allow more connections.
I installed Jenkins by customizing helm chart values.yaml and giving nodeSelector-label to run on NODE 3 only.
I used localPath for persistentVolume (jenkinsData).
I made customizations with the kube-prometheus-stack community chart and installed it with values.yaml.
Here, I created the application's grafana dashboards with configmap and made it permanent.
By setting metrics addresses and configurations in components such as kubeproxy, coredns, controller-manager, etcd, scheduler,
I ensured that prometheus collects metrics properly over the entire cluster.
Using elastic/elasticsearch, elastic/kibana and fluent/fluen-bit helm charts, I have set up an EFK stack with values.yamls separately.
I installed Elasticsearch as 1 node with master-ingest-data roles. (due to lack of resources)
When I tried to install with default values, my computer froze and most of what I did was wasted.
I created the "fluent-bit" index written by fluent-bit via kibana. I created and linked a lifecycle policy for this index
Here I made settings such as xpack security and certificates and made it permanent. (to see if it works as a result of restarts.)
I would like to suggest two different approaches to the problem here.
First, it is possible to write log files to filesystem instead of stdout in any language.
Required libraries and sample codes can be found. I am giving an example in python.
import logging
from flask import Flask
app = Flask(__name__)
log_file = "/app/logs/app.log"
logger = logging.getLogger("flask-app")
logger.setLevel(logging.DEBUG)
file_handler = logging.FileHandler(log_file)
file_handler.setLevel(logging.DEBUG)
formatter = logging.Formatter("%(asctime)s - %(levelname)s - %(message)s")
file_handler.setFormatter(formatter)
logger.addHandler(file_handler)
@app.route("/")
def hello():
logger.info("Hello, World!")
return "Hello, World!"
if __name__ == "__main__":
app.run()
Filebeat Sidecar Architecture
I am using Python's logging module. I create the logger object and define the file_handler that will write to the log file.
I specify the location of the log file and the log level (setLevel) in file_handler. Next, I create a formatter that determines how the log messages will be formatted and pass this formatter to the file_handler.
With logger.addHandler(file_handler) I add file_handler to the logger object so that log messages are written to the log file.
In this example, I'm having log messages written to /app/logs/app.log instead of stdout. Thus, we can work integrated with Filebeat, which controls the log file size and performs log rotation.
Filebeat must be in the same pod as the application container (sidecar)
It can read these files over a same mountPath volume.
Of course, technologies such as nfs or cephfs should not be used, as it will make network calls again, which will lead to a great loss of performance.
(This is why we encountered an incident in Türk Telekom.) here I suggest to use localpath or block volume as persistentVolume
spec:
containers:
- name: flask-app
image: flask-app/hello
ports:
- containerPort: 80
volumeMounts:
- name: app-logs
mountPath: /app/logs
- name: filebeat-sidecar
image: docker.elastic.co/beats/filebeat:7.5.0
volumeMounts:
- name: app-logs
mountPath: /app/logs
- name: filebeat-config
mountPath: /usr/share/filebeat/filebeat.yml
subPath: filebeat.yml
volumes:
- name: app-logs
- name: filebeat-config
configMap:
name: filebeat-configmap
items:
- key: filebeat.yml
path: filebeat.yml
filebeat.inputs:
- type: log
paths:
- /app/logs/*.log
fields:
app_name: flask-app
close_inactive: 5m
clean_inactive: 24h
clean_removed: true
harvester_buffer_size: 16mb
max_bytes: 1gb
rotate_every_bytes: 1gb
output.elasticsearch:
hosts: ["elasticsearch:9200"]
index: "flask-app_logs-%{+yyyy.MM.dd}"
logging.level: info
- Logstash S3 Architecture
In this example, it uploads the log file to S3 by checking the size of the log file and rotating it if it exceeds the maximum 1GB.
A function named rotate_log_file is defined. This function performs the rotation of the log file. It rotates the original log file with a new name and uploads the rotated file to S3. If the rotate operation is successful, the rotated file is deleted.
import os
import logging
import boto3
from botocore.exceptions import ClientError
from flask import Flask
app = Flask(__name__)
log_file = "/app/logs/app.log"
s3_bucket = "flask-app-logs"
s3_key_prefix = "log-files/"
# 1GB
max_log_file_size = 1024 * 1024 * 1024
logger = logging.getLogger("flask-app")
logger.setLevel(logging.DEBUG)
file_handler = logging.FileHandler(log_file)
file_handler.setLevel(logging.DEBUG)
formatter = logging.Formatter("%(asctime)s - %(levelname)s - %(message)s")
file_handler.setFormatter(formatter)
logger.addHandler(file_handler)
def upload_log_file_to_s3(log_file_path, s3_bucket, s3_key):
s3_client = boto3.client("s3")
try:
s3_client.upload_file(log_file_path, s3_bucket, s3_key)
return True
except ClientError as e:
logger.error("Failed to upload log file to S3: %s", str(e))
return False
def rotate_log_file(log_file_path):
rotated_file_path = log_file_path + ".old"
os.rename(log_file_path, rotated_file_path)
s3_key = s3_key_prefix + os.path.basename(rotated_file_path)
if upload_log_file_to_s3(rotated_file_path, s3_bucket, s3_key):
os.remove(rotated_file_path)
@app.route("/")
def hello():
logger.info("Hello, World!")
if os.path.getsize(log_file) > max_log_file_size:
rotate_log_file(log_file)
return "Hello, World!"
if __name__ == "__main__":
app.run()
For a longer storage or analysis afterwards; With logstash, we can get log files in the format and size we want. An example can be given as input s3 output as elasticsearch.
input {
s3 {
bucket => "flask-app-logs"
access_key_id => ""
secret_access_key => ""
region => "eu-west2"
prefix => "log-files/"
type => "logs"
}
}
output {
elasticsearch {
hosts => ["elasticsearch:9200"]
index => "flask-app-logs-%{+YYYY.MM.dd}"
}
}
I created the deployment, hpa, ingress, service and service-monitor files.
To collect application specific (flask) metrics, I used the flask-prometheus-metrics library and had metrics written to prometheus with service-monitor.
I installed nginx-controller with k8s manifest files. Here I preferred to open the service as nodePort because I do not have any loadbalancer.(maybe metalLB)
After adding my service to ingress here, I verified that nginx-ingress was working fine by entering FQDN in my own /etc/hosts file.
I used pod anti affinity to distribute pods with the same label to different nodes.
Here, I preferred the soft method (prefferred, required) because if I wrote required, 1 each of the 4 pods would be placed on 2 nodes and the rest would wait in a pending state.
When we make soft, we distribute it to the nodes as homogeneously as possible. The replica count does not matter, if there are 4 nodes 8 pods, 2 each, if there are 3 nodes 9 pods, 3 will be distributed.
I added liveness probe and readiness probe in control of it. While one is checking the port (tcp), the other is checking whether the /api endpoint is receiving a request.
Jenkins, kaniko
Honestly, the step where I had the most difficulty was jenkins.
I haven't written a lot of pipelines from scratch in 2 years.
The first point is that I am using containerd and not docker (of course not docker.sock). and containerd has no ability to build an image, just runtime!
So it delayed my pipeline writing steps a bit as Jenkins is not able to run agent on kubernetes and connect to this socket(/var/run/docker.sock).
For this reason, I decided to choose kaniko.
Kaniko is an image build and push tool developed by Google. The most important thing is that it works in a serverless structure and does not need to be connected to a daemon.(like docker)
Then, I created a pod agent with a builder.yaml that contains both kaniko and kubectl containers.
Ansible, kubectl
I wrote an ansible deploy.yaml but installing ansible on jenkins is close to impossible.
Since we are running Jenkins on a container, we cannot put a binary in it, we cannot install pip packages. No matter how much I tried and tried to make it permanent, I realized that it did not work properly after restarting the pod.
Likewise, I started looking for a plugin that works in a container, and even though I installed the ansible-plugin, I could not write ansible-playbook in the pipeline because the PATH and bin variables from the source image did not work properly and I was stuck with user privileges.
however, I wrote a playbook similar to the one we use in Türk Telekom to do the task.
If we were using jenkins on a virtual machine or physical server, I could simply install and run ansible like i did before, the ansible binary would run on the os jenkins is on. So I used a deploy step in jenkins pipeline using kubectl image as in builder.yaml.
And, about 1 day later, it came to my mind, I realized why I couldn't run an ansible container agent and deploy from there while I could use podTemplate.
I wrote deployer.yaml and a dockerfile image with ansible, kubernetes pip packages in it. Then I successfully performed the deployment step with ansible-playbook by using this and mounting kubeconfig to the container as secret.
If it was possible here, I think it would be healthier to progress with GitLab. It is easier to write and run the pipeline, and it saves time thanks to the features such as the helm-chart repository and the image registry. In my humble opinion, ArgoCD can be used in the CD step.
I used python and flask, created a docker image and created files such as webhook-config, certs, deployment, service and tested both true and false condition.
I wrote bash scripts with curl for this that make testing easier.
The important thing here is to create the priorityClasses in both applications. We have to make our application X use the higher priorityclass. This allows application X to have a higher priority schedule than application Y at the time of load.
With nodeAffinity we can identify higher capacity nodes specific to X.
With pod distribution budget If we want other applications to use the resources on the nodes more equally, we can limit the number of replicas that the application will run on certain nodes.
#create for X
apiVersion: scheduling.k8s.io/v1
kind: PriorityClass
metadata:
name: high-priority
value: 1000000
globalDefault: false
description: "This priority class gives high priority to App X"
---
# create for Y
apiVersion: scheduling.k8s.io/v1
kind: PriorityClass
metadata:
name: low-priority
value: 500000
globalDefault: false
description: "This priority class gives low priority to App Y"
---
# use in pod
apiVersion: v1
kind: Pod
metadata:
name: app-x
spec:
priorityClassName: high-priority
containers:
- name: app-x
image: app-x-image:latest
---
#Limiting the Number of Pods in a Specific Node
apiVersion: policy/v1beta1
kind: PodDisruptionBudget
metadata:
name: my-deployment-pdb
spec:
maxUnavailable: 1
selector:
matchLabels:
app: my-deployment
topologyKey: kubernetes.io/hostname
---
#Equal Distribution, up to 2 pods per node
apiVersion: policy/v1beta1
kind: PodDisruptionBudget
metadata:
name: my-app-pdb
spec:
maxUnavailable: 0
selector:
matchLabels:
app: my-app
minAvailable: 2
If I were to implement canary deployments, I would use Istio for managing traffic splitting and routing between different versions of my application.
I would define the application manifests, including the deployment, service, and Istio VirtualService resources, to specify the canary deployment strategy such as traffic splitting rules.
In my pipeline, I would integrate ArgoCD and configure it to work with Argo Rollouts. Argo Rollouts is an extension to ArgoCD that provides advanced deployment strategies, including canary deployments.
By enabling Argo Rollouts integration, I would have the ability to manage canary deployments seamlessly.
Once the canary deployment is in place, I would monitor its performance and analyze metrics and user feedback. Based on the analysis results, I would decide whether to promote the canary deployment to a wider audience or roll back if any issues are detected.
By following this approach, I can ensure smooth canary deployments using Istio for traffic management and ArgoCD with Argo Rollouts for pipeline integration and management of the deployment process.
To ensure that the application scales out/in appropriately before/after a certain period of time, I would use Horizontal Pod Autoscaling (HPA).
HPA automatically adjusts the number of replicas for a Deployment based on the observed CPU utilization or other custom metrics.
By defining appropriate metrics and thresholds, HPA can automatically scale the application horizontally to handle increased traffic during peak times.
This ensures that the application can dynamically scale up or down based on demand, preventing slowness and ensuring optimal performance.
To address the situation where there are not enough resources on the cluster during scaling at peak times,
I would use Karpenter. Karpenter is a Kubernetes-based solution that provides automated cluster autoscaling.
By defining scaling profiles and scheduling rules, Karpenter can automatically scale the cluster resources, such as nodes, based on predetermined conditions.
For example, I can configure Karpenter to increase the number of nodes before certain times to ensure sufficient resources are available during peak traffic.
This proactive scaling approach allows for seamless handling of increased load without slowing down the system.