poison-pill

command module
v0.0.2 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jun 9, 2021 License: Apache-2.0 Imports: 18 Imported by: 0

README

Poison Pill k8s Node Remediation

Existing baremetal remediation strategies utilize BMC credentials to power-cycle and/or reprovision the host. However there are also environments that either do not include BMCs, or there are policies in place that prevent them from being utilized. Such environments would also benefit from the ability to safely recover affected workloads and restore cluster capacity (where possible). This poison pill controller is using an alternate mechanism for a node in a cluster to detect its health status and take actions to remediate itself in case of a failure. While not all remediation events can result in the node returning to a healthy state, the proposal does allow surviving parts of the cluster to assume the node has reached a safe state so that it’s workloads can be automatically recovered. This work can also be useful for clusters with BMC credentials.

Backlog

  1. Peer to peer authentication and encryption
  2. Marking the poison pill pod as critical pod to avoid eviction
  3. Ask multiple peers concurrently instead of one by one
  4. create a flow chart to describe the algorithm visually
  5. OLM integration

Blog Post

https://www.openshift.com/blog/kubernetes-self-remediation-aka-poison-pill

Project State

Currently the project is in PoC phase

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis
api
v1alpha1
Package v1alpha1 contains API Schema definitions for the poison-pill v1alpha1 API group +kubebuilder:object:generate=true +groupName=poison-pill.medik8s.io
Package v1alpha1 contains API Schema definitions for the poison-pill v1alpha1 API group +kubebuilder:object:generate=true +groupName=poison-pill.medik8s.io
pkg

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL