/governor

A collection of cluster reliability tools for Kubernetes

Primary LanguageGoApache License 2.0Apache-2.0

governor

Build Status codecov Go Report Card

A collection of cluster reliability tools built for Kubernetes

Governor is a collection of tools for improving the stability of the large Kubernetes clusters as a single Docker image.

Two common problems observed in large Kubernetes clusters are:

  1. Node failure due to underlying cloud provider issues.
  2. Pods being stuck in "Terminating" state and unable to be cleaned up.

node-reaper provides the capability for worker nodes to be force terminated so that replacement ones come up. pod-reaper does a force termination of pods stuck in Terminating state for a certain amount of time.

Usage

Assuming an AWS-hosted running kubernetes cluster:

kubectl create namespace governor

# Using a CronJob
kubectl apply -n governor -f https://raw.githubusercontent.com/keikoproj/governor/master/examples/node-reaper.yaml

kubectl apply -n governor -f https://raw.githubusercontent.com/keikoproj/governor/master/examples/pod-reaper.yaml

Available Packages

Package Description Docs
node-reaper terminates nodes in scaling groups node-reaper
pod-reaper force terminates stuck pods pod-reaper

Release History

Please see CHANGELOG.md.

❤ Contributing ❤

Please see CONTRIBUTING.md.

Developer Guide

Please see DEVELOPER.md.