K8s-Study

Kubernetes Revision Guide for Developers

Core Architecture
- Understanding the fundamental components and how they work together
Workload Resources
- Core resources for running applications (Pods, Deployments, etc.)
Networking
- How components communicate and expose services
Storage
- Understanding how to persist and manage data
Configuration
- Basic configuration concepts needed for all resources
Security
- RBAC, SecurityContext, and other security concepts
Scheduling & Placement
- Advanced control over where and how pods run
Extensions & Custom Resources
- Extending Kubernetes functionality
Developer Tools & Troubleshooting
- Day-to-day development and debugging tools

Core Architecture

Control Plane (Master Components)
- api-server: API entry point for all operations
- etcd: Distributed key-value store for cluster state
- scheduler: Assigns pods to nodes
- controller-manager: Maintains desired state
- cloud-controller-manager: Interfaces with cloud providers
Node Components
- kubelet: Node agent ensuring containers run in pods
- kube-proxy: Network proxy, handles service networking
- container-runtime: (Docker/containerd) Runs containers

Workload Resources

Pods

apiVersion: v1
kind: Pod
metadata:
  name: myapp
spec:
  containers:
  - name: myapp
    image: nginx:latest
    ports:
    - containerPort: 80

Smallest deployable unit
Can contain multiple containers
Share network namespace and storage

Deployments

apiVersion: apps/v1
kind: Deployment
metadata:
  name: myapp
spec:
  replicas: 3
  selector:
    matchLabels:
      app: myapp
  template:
    metadata:
      labels:
        app: myapp
    spec:
      securityContext:           # Pod-level security settings
        runAsNonRoot: true      # Ensures no container runs as root
        fsGroup: 2000           # Sets filesystem group for all containers
      containers:
      - name: myapp
        image: nginx:latest
        readinessProbe:
          httpGet:
            path: /healthz
            port: 80
          initialDelaySeconds: 5
          periodSeconds: 10
          failureThreshold: 3
        livenessProbe:
          httpGet:
            path: /healthz
            port: 80
          initialDelaySeconds: 15
          periodSeconds: 20
          failureThreshold: 3
        startupProbe:
          httpGet:
            path: /healthz
            port: 80
          initialDelaySeconds: 10
          periodSeconds: 10
          failureThreshold: 3
        securityContext:
          allowPrivilegeEscalation: false
          readOnlyRootFilesystem: true
          runAsUser: 1000       # Run as UID 1000
          runAsGroup: 3000      # Run as GID 3000
          capabilities:
            drop: ["ALL"]       # Drop all capabilities
            add: ["NET_BIND_SERVICE"]  # Only add required capabilities
        resources:
          requests:
            memory: "64Mi"
            cpu: "250m"
          limits:
            memory: "128Mi"
            cpu: "500m"
        volumeMounts:           # Required when using readOnlyRootFilesystem
        - name: tmp-volume      # For temporary files
          mountPath: /tmp
        - name: var-run
          mountPath: /var/run
      volumes:                  # Define writable volumes
      - name: tmp-volume
        emptyDir: {}
      - name: var-run
        emptyDir: {}

Manages ReplicaSets
Handles rolling updates
Maintains desired state

Probes:

Readiness: Ensures pod is ready to receive traffic
Liveness: Restarts container if it fails
Startup: Ensures container is ready to start

Resource Properties Explained:

requests: Minimum resources guaranteed to the container
- Used by scheduler to find suitable nodes
- Important for proper pod placement
- Should reflect actual application needs
limits: Maximum resources the container can use
- Container will be throttled if exceeding CPU limit
- Container will be OOM killed if exceeding memory limit
- Should be set to prevent resource hogging
CPU Units:
- Expressed in cores or millicores
- "1" = 1 CPU core
- "500m" = 500 millicores = 0.5 CPU core
- "250m" = 250 millicores = 0.25 CPU core
Memory Units:
- Expressed in bytes: Ki, Mi, Gi
- "64Mi" = 64 Mebibytes
- "1Gi" = 1 Gibibyte
- Can also use K, M, G (decimal) units

Security Context Properties Explained:

Pod-level security (applies to all containers):
- runAsNonRoot: Prevents containers from running as root
- fsGroup: Sets the group ID for mounted volumes
Container-level security:
- readOnlyRootFilesystem: Makes root filesystem read-only
- runAsUser: Specifies the UID to run the container
- runAsGroup: Specifies the GID to run the container
- allowPrivilegeEscalation: Controls if process can gain more privileges
- capabilities: Fine-grained Linux capabilities control
  - drop: ["ALL"]: Removes all capabilities for security
  - add: []: Only add specific required capabilities

Note: When using readOnlyRootFilesystem: true, you need to:

Mount emptyDir volumes for directories requiring write access (e.g., /tmp)
Configure your application to use these writable mounts
Ensure all required directories are either read-only or mounted as volumes

StatefulSets

apiVersion: apps/v1
kind: StatefulSet
metadata:
  name: mydb
spec:
  serviceName: mydb  # Headless service for network identity
  replicas: 3
  selector:          # Must match template labels, like in Deployments
    matchLabels:
      app: mydb
  template:
    metadata:
      labels:        # Must match selector.matchLabels
        app: mydb
    spec:
      containers:
      - name: mydb
        image: mysql:5.7
        volumeMounts:        # References volumes defined below
        - name: data        # Must match volume name in volumeClaimTemplates
          mountPath: /var/lib/mysql
  volumeClaimTemplates:     # Creates PVC for each pod
  - metadata:
      name: data           # Referenced by volumeMounts.name above
    spec:
      accessModes: ["ReadWriteOnce"]
      resources:
        requests:
          storage: 1Gi

For stateful applications
Stable network identities
Ordered scaling/updates
Persistent storage per pod

DaemonSets

apiVersion: apps/v1
kind: DaemonSet
metadata:
  name: monitoring-agent
spec:
  selector:
    matchLabels:
      app: monitoring-agent
  template:
    metadata:
      labels:
        app: monitoring-agent
    spec:
      containers:
      - name: agent
        image: monitoring-agent:latest

Runs one pod per node
Used for node-level operations
Common for logging/monitoring

Networking

Services & Pod Connection

Services use label selectors to find and route traffic to pods. The relationship works as follows:

Pods are labeled with key-value pairs
Services use selectors to match these labels
The Service automatically discovers and routes traffic to matching pods

# Example Pod with labels
apiVersion: v1
kind: Pod
metadata:
  name: myapp-pod
  labels:            # Labels that can be selected by Services
    app: myapp      
    tier: frontend
spec:
  containers:
  - name: myapp
    image: nginx:latest
    ports:
    - containerPort: 80

---
# Service selecting the pod
apiVersion: v1
kind: Service
metadata:
  name: myapp-service
spec:
  type: ClusterIP  # or LoadBalancer/NodePort
  selector:        # Selects pods with matching labels
    app: myapp    # Must match pod labels exactly
    tier: frontend
  ports:
  - port: 80        # Port the service listens on
    targetPort: 80  # Port on the pod to forward to

For Deployments/ReplicaSets, the label relationship is:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: myapp-deployment
spec:
  replicas: 3
  selector:          # Deployment uses this to find its pods
    matchLabels:     # Must match template labels
      app: myapp
      tier: frontend
  template:
    metadata:
      labels:        # Labels applied to pods created by this deployment
        app: myapp
        tier: frontend
    spec:
      containers:
      - name: myapp
        image: nginx:latest
        ports:
        - containerPort: 80

---
# Service will automatically find all pods created by the deployment
apiVersion: v1
kind: Service
metadata:
  name: myapp-service
spec:
  type: ClusterIP
  selector:        # Matches labels in deployment's pod template
    app: myapp
    tier: frontend
  ports:
  - port: 80
    targetPort: 80

Key Points about Service-to-Pod Mapping:

Label Selection:
- Services use label selectors to dynamically discover pods
- Any pod with matching labels will automatically be added to the service's endpoint list
- Multiple pods with matching labels will receive traffic according to the service's load balancing rules
Why Selectors in Deployments:
- Deployments use selectors to track which pods they own
- The selector must match the pod template labels
- This creates a three-way relationship: Service → Pod Labels ← Deployment Selector
Dynamic Updates:
- When pods are added/removed, services automatically update their endpoints
- No manual configuration needed for service discovery
- Makes scaling and updates seamless

Endpoint Verification:

# Check if service is finding pods
kubectl get endpoints myapp-service

# Detailed view of service
kubectl describe service myapp-service

Services

apiVersion: v1
kind: Service
metadata:
  name: myapp-service
spec:
  type: ClusterIP  # Determines service exposure level
  selector:        # Finds pods to send traffic to
    app: myapp    # Must match pod labels
  ports:
  - port: 80       # Port exposed by the service
    targetPort: 8080  # Port on the pod to forward to

Types: ClusterIP, NodePort, LoadBalancer
Service discovery and load balancing
Stable network endpoint

Ingress

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: myapp-ingress
  annotations:
    nginx.ingress.kubernetes.io/rewrite-target: /    # Nginx-specific annotation
    nginx.ingress.kubernetes.io/ssl-redirect: "false" # Disable SSL redirect
spec:
  rules:
  - host: myapp.example.com  # External hostname
    http:
      paths:
      - path: /             # URL path to match
        pathType: Prefix
        backend:
          service:
            name: myapp-service  # Must match existing Service name
            port:
              number: 80        # Must match Service port

L7 load balancing
TLS termination
Name-based virtual hosting

Installing Ingress Controller:

Minikube Option (Easiest for Local Development):

# Enable the Ingress addon
minikube addons enable ingress

# Verify it's running
minikube addons list | grep ingress
kubectl get pods -n ingress-nginx

# Get Minikube IP for testing
minikube ip

# Add to /etc/hosts for local testing
echo "$(minikube ip) myapp.example.com" | sudo tee -a /etc/hosts

Helm Option (Production):

helm repo add ingress-nginx https://kubernetes.github.io/ingress-nginx
helm repo update
helm install nginx-ingress ingress-nginx/ingress-nginx

kubectl Option:

kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/controller-v1.8.2/deploy/static/provider/cloud/deploy.yaml

Key Points:

Ingress requires an Ingress Controller to work
For local development, Minikube's addon is the simplest option
Controller creates an actual load balancer (or uses NodePort in Minikube)
Annotations control Nginx-specific behavior
Verify controller is running:

kubectl get pods -n ingress-nginx
kubectl get services -n ingress-nginx

Headless Services

apiVersion: v1
kind: Service
metadata:
  name: myapp-headless
spec:
  clusterIP: None   # This makes it a headless service
  selector:
    app: myapp
  ports:
  - port: 80
    targetPort: 80

Key Points about Headless Services:

DNS Resolution:
- Returns Pod IPs directly instead of service IP
- Each Pod gets a DNS record: <pod-name>.<service-name>.<namespace>.svc.cluster.local
- Useful for stateful applications needing direct Pod addressing
Common Use Cases:
- StatefulSets requiring stable network identities
- Client-side service discovery
- Database clusters needing direct pod-to-pod communication
Example with StatefulSet:

apiVersion: apps/v1
kind: StatefulSet
metadata:
  name: mysql
spec:
  serviceName: mysql-headless  # References headless service
  replicas: 3
  // ... rest of StatefulSet spec ...

Storage

Volumes & VolumeMounts

apiVersion: v1
kind: Pod
metadata:
  name: myapp
spec:
  containers:
  - name: myapp
    image: nginx
    volumeMounts:           # References volumes defined below
    - name: config-volume   # Must match volume name
      mountPath: /etc/config
    - name: data-volume    # Must match volume name
      mountPath: /data
  volumes:                 # Volume definitions
  - name: config-volume    # Referenced by volumeMounts.name
    configMap:
      name: app-config    # Must match existing ConfigMap name
  - name: data-volume     # Referenced by volumeMounts.name
    persistentVolumeClaim:
      claimName: myapp-pvc  # Must match existing PVC name

Types: emptyDir, hostPath, configMap, secret, persistentVolumeClaim
Lifecycle tied to pod
Support for multiple volume types

PersistentVolume (PV)

apiVersion: v1
kind: PersistentVolume
metadata:
  name: myapp-pv
spec:
  capacity:
    storage: 10Gi
  accessModes:
    - ReadWriteOnce
  persistentVolumeReclaimPolicy: Retain
  storageClassName: standard
  hostPath:  # Example backend, use cloud provider's storage in production
    path: /mnt/data

Cluster-level storage resource
Independent of pod lifecycle
Provisioned by admin or dynamically

PersistentVolumeClaim (PVC)

apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: myapp-pvc
spec:
  accessModes:
    - ReadWriteOnce
  resources:
    requests:
      storage: 5Gi
  storageClassName: standard

Request for storage by pod
Can specify size and access mode
Binds to available PV

StorageClass

apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  name: fast
provisioner: kubernetes.io/aws-ebs  # Example for AWS
parameters:
  type: gp2
  encrypted: "true"
reclaimPolicy: Delete
volumeBindingMode: WaitForFirstConsumer

Defines storage profiles
Enables dynamic provisioning
Provider-specific parameters

Common Storage Patterns

Dynamic Provisioning

# StorageClass
apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  name: fast
provisioner: kubernetes.io/aws-ebs
parameters:
  type: gp2

# PVC
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: dynamic-pvc
spec:
  storageClassName: fast
  accessModes:
    - ReadWriteOnce
  resources:
    requests:
      storage: 10Gi

Shared Storage (ReadWriteMany)

# PVC
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: shared-pvc
spec:
  accessModes:
    - ReadWriteMany
  resources:
    requests:
      storage: 5Gi
  storageClassName: nfs  # Example using NFS

# Pod using shared storage
apiVersion: v1
kind: Pod
metadata:
  name: shared-pod
spec:
  containers:
  - name: app
    image: nginx
    volumeMounts:
    - name: shared-data
      mountPath: /data
  volumes:
  - name: shared-data
    persistentVolumeClaim:
      claimName: shared-pvc

Storage Troubleshooting

Common Issues
- PVC stuck in Pending
- Volume mount permission issues
- Storage capacity issues
- Volume not detaching
Debug Commands

# Check PV/PVC status
kubectl get pv,pvc
kubectl describe pv <pv-name>
kubectl describe pvc <pvc-name>

# Check StorageClass
kubectl get sc
kubectl describe sc <storage-class-name>

# Debug pod volume issues
kubectl describe pod <pod-name>
kubectl get events --field-selector involvedObject.name=<pvc-name>

Best Practices
- Use dynamic provisioning when possible
- Set appropriate storage requests
- Consider volume expansion requirements
- Use appropriate access modes
- Implement proper backup strategies

Configuration

ConfigMaps

apiVersion: v1
kind: ConfigMap
metadata:
  name: app-config
data:
  # Simple key-value pairs
  DATABASE_URL: "mysql://db:3306/myapp"
  API_KEY: "development-key"
  
  # File-like keys
  config.json: |
    {
      "environment": "development",
      "features": {
        "flag1": true,
        "flag2": false
      }
    }
  nginx.conf: |
    server {
      listen 80;
      server_name example.com;
    }

Using ConfigMaps:

apiVersion: v1
kind: Pod
metadata:
  name: myapp
spec:
  containers:
  - name: myapp
    image: myapp:1.0
    # As environment variables
    envFrom:
    - configMapRef:
        name: app-config
    # Individual values
    env:
    - name: DB_URL
      valueFrom:
        configMapKeyRef:
          name: app-config
          key: DATABASE_URL
    # As files
    volumeMounts:
    - name: config-volume
      mountPath: /etc/config
  volumes:
  - name: config-volume
    configMap:
      name: app-config

Secrets

apiVersion: v1
kind: Secret
metadata:
  name: app-secrets
type: Opaque
data:
  # Values must be base64 encoded
  username: YWRtaW4=  # 'admin'
  password: cGFzc3dvcmQxMjM=  # 'password123'
stringData:
  # Values in stringData are automatically encoded
  API_KEY: "my-actual-api-key"

Using Secrets:

apiVersion: v1
kind: Pod
metadata:
  name: myapp
spec:
  containers:
  - name: myapp
    image: myapp:1.0
    # As environment variables
    envFrom:
    - secretRef:
        name: app-secrets
    # Individual values
    env:
    - name: DB_PASSWORD
      valueFrom:
        secretKeyRef:
          name: app-secrets
          key: password
    # As files
    volumeMounts:
    - name: secrets-volume
      mountPath: /etc/secrets
      readOnly: true
  volumes:
  - name: secrets-volume
    secret:
      secretName: app-secrets

RBAC (Role-Based Access Control)

ClusterRole

apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: pod-reader
rules:
- apiGroups: [""]
  resources: ["pods"]
  verbs: ["get", "watch", "list"]
- apiGroups: ["apps"]
  resources: ["deployments"]
  verbs: ["get", "list"]

Role (Namespace-scoped)

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  namespace: default
  name: pod-reader
rules:
- apiGroups: [""]
  resources: ["pods"]
  verbs: ["get", "watch", "list"]

ClusterRoleBinding

apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: read-pods-global
subjects:
- kind: User
  name: jane
  apiGroup: rbac.authorization.k8s.io
- kind: ServiceAccount
  name: default
  namespace: kube-system
roleRef:
  kind: ClusterRole
  name: pod-reader
  apiGroup: rbac.authorization.k8s.io

RoleBinding (Namespace-scoped)

apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: read-pods
  namespace: default
subjects:
- kind: User
  name: jane
  apiGroup: rbac.authorization.k8s.io
roleRef:
  kind: Role
  name: pod-reader
  apiGroup: rbac.authorization.k8s.io

Common RBAC Verbs:

get: Read a resource
list: List resources
watch: Watch for changes
create: Create resources
update: Modify existing resources
patch: Partially modify resources
delete: Delete resources
deletecollection: Delete collection of resources

Labels vs Annotations

Labels

metadata:
  labels:
    app: myapp
    environment: production
    tier: frontend

Used for object selection/grouping
Used by controllers and services
Should be kept minimal and relevant for identification
Limited to 63 characters (alphanumeric, -, _, .)

Annotations

metadata:
  annotations:
    kubernetes.io/ingress-bandwidth: "1M"
    prometheus.io/scrape: "true"
    prometheus.io/port: "9090"
    deployment.kubernetes.io/revision: "2"
    builder: "john@example.com"
    gitCommit: "abc123"
    buildDate: "2024-03-20"

Store non-identifying auxiliary data
Can contain arbitrary metadata
Not used for selection
Can contain large data blocks
Common uses:
- Build/release information
- Logging/monitoring configurations
- Tool configurations
- Update timestamps
- Client library/tool information
- Contact information

Key Differences:

Purpose
- Labels: Identifying and selecting objects
- Annotations: Non-identifying metadata and tool configurations
Usage
- Labels: Used by Kubernetes core functionality
- Annotations: Used by tools and libraries
Constraints
- Labels: Strict character set and size limits
- Annotations: Can contain any metadata
Examples Use Cases Labels:
- Service selector matching
- Resource grouping
- Batch operations
Annotations:
- Ingress configurations
- Prometheus scraping settings
- Deployment rollout information
- CI/CD metadata

Scheduling & Placement

Node Selectors

spec:
  nodeSelector:
    disk: ssd

Simple node selection
Based on node labels

Node Affinity/Anti-Affinity

spec:
  affinity:
    nodeAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
        nodeSelectorTerms:
        - matchExpressions:
          - key: zone
            operator: In
            values:
            - zone1

More expressive than nodeSelector
Supports complex matching
Can be required or preferred

Pod Affinity/Anti-Affinity

spec:
  affinity:
    podAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
      - labelSelector:
          matchExpressions:
          - key: app
            operator: In
            values:
            - cache
        topologyKey: kubernetes.io/hostname

Schedule pods relative to other pods
Based on topology domains

Taints & Tolerations

# Node Taint
kubectl taint nodes node1 key=value:NoSchedule

# Pod Toleration
spec:
  tolerations:
  - key: "key"
    operator: "Equal"
    value: "value"
    effect: "NoSchedule"

Taints prevent pod scheduling on nodes
Tolerations allow pods on tainted nodes

Extensions & Custom Resources

Custom Resource Definitions (CRDs)

apiVersion: apiextensions.k8s.io/v1
kind: CustomResourceDefinition
metadata:
  name: backups.myapp.com
spec:
  group: myapp.com
  versions:
    - name: v1
      served: true
      storage: true
  scope: Namespaced
  names:
    plural: backups
    singular: backup
    kind: Backup

Extend Kubernetes API
Define custom resources

Operators

Automate application management
Built using CRDs and controllers
Handle application-specific logic

Developer Tools & Troubleshooting

Common Commands

# Logs
kubectl logs pod-name -c container-name
kubectl logs -f pod-name  # Follow logs

# Debug
kubectl describe pod pod-name
kubectl get events --sort-by=.metadata.creationTimestamp

# Exec into container
kubectl exec -it pod-name -- /bin/bash

# Port forwarding
kubectl port-forward pod-name 8080:80

Common Issues & Solutions

Pod Stuck in Pending
- Check node resources: kubectl describe node
- Check pod events: kubectl describe pod
- Verify node selectors/affinity rules
Pod Stuck in CrashLoopBackOff
- Check logs: kubectl logs pod-name
- Verify resource limits
- Check liveness/readiness probes
Image Pull Errors
- Verify image name/tag
- Check image pull secrets
- Ensure registry access
Service Discovery Issues
- Verify service selectors match pod labels
- Check endpoints: kubectl get endpoints
- Test using service DNS: service-name.namespace.svc.cluster.local
Network Connectivity
- Use network debug pods
- Check network policies
- Verify service/pod IP ranges

Best Practices for Developers

Resource Management
- Always set resource requests/limits
- Use horizontal pod autoscaling
- Monitor resource usage
Application Configuration
- Use ConfigMaps/Secrets
- Implement proper health checks
- Handle SIGTERM gracefully
Debugging Tools
- kubectl debug
- ksniff for packet capture
- stern for multi-pod logs
Local Development
- Use Minikube/Kind
- Implement dev/prod parity
- Use Skaffold for development workflow

seksham/K8s-Study