vertical-pod-autoscaler/

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/openshift/kubernetes-autoscaler

Links

Open Source Insights

README ¶

Vertical Pod Autoscaler

Intro

Vertical Pod Autoscaler (VPA) frees the users from necessity of setting up-to-date resource requests for their containers in pods. When configured, it will set the requests automatically based on usage and thus allow proper scheduling onto nodes so that appropriate resource amount is available for each pod.

For users

Installation

Prerequisites (to be automatized):

Install Prometheus (otherwise VPA will only have current usage data, no history).
Make sure your cluster supports MutatingAdmissionWebhooks (see here).
kubectl should be connected to the cluster you want to install VPA in.

To install VPA, run:

./hack/vpa-up.sh

Note: the script currently depends on environment variables: $REGISTRY and $TAG. Make sure you don't set them if you want the released version.

The script issues multiple kubectl commands to the cluster that insert the configuration and start all needed pods (see architecture) in the kube-system namespace.

Quick start

After installation the system is ready to recommend and set resource requests for your pods. In order to use it you need to insert a Vertical Pod Autoscaler resource for each logical group of pods that have similar resource requirements. We recommend to insert a VPA per each Deployment you want to control automatically and use the same label selector as the Deployment uses. There are three modes in which VPAs operate:

"Auto": VPA assigns resource requests on Pod creation as well as updates them on running Pods (only if they differ significantly from the new recommendation and only within Eviction API limits).
"Initial": VPA only assigns resource requests on Pod creation and never changes them later.
"Off": VPA does not automatically change resource requirements of the pods. The recommendations are calculated and can be inspected in the VPA object.

Example VPA configuration

apiVersion: poc.autoscaling.k8s.io/v1alpha1
kind: VerticalPodAutoscaler
metadata:
  name: my-app-vpa
spec:
  selector:
    matchLabels:
      app: my-app
  updatePolicy:
    updateMode: "Auto"

Known limitations of the alpha version

The VPA admission controller is an admission webhook. The order of admission controllers is defined by flag on APIserver. VPA admission controller might have relations to other admission controllers, e.g. running it after quota admission might cause making incorrect decisions.
Out-of-memory events / pod evictions are not yet taken into account for memory usage data. Containers dying because of lack of memory might not get bigger recommendations.
Recommender reads some amount of history (currently eight days) and treats all samples from that period identically, no matter how recent they are. Also, it does not forget CPU samples after they go out of the one day window, so the CPU history length will grow during the lifetime of the recommender binary.
VPA recommendation might exceed available resources (e.g. Node size, available size, available quota) and cause Pods to go pending.
Multiple VPA resources matching the same Pod have undefined behavior.

For developers

Architecture

The system consists of three separate binaries: recommender, updater and admission controller.

How to plug in a modified recommender

First, make any changes you like in recommender code. Then, build it with

make --directory recommender build docker

Remember the command puts your build docker image into your GCR registry and tags it using env variables: $REGISTRY, e.g. gcr.io/my-project and $TAG, e.g. my-latest-release. To deploy that version, follow installation. If you already had VPA installed, you can run:

./hack/vpa-down.sh recommender
./hack/vpa-up.sh recommender

to only recreate the recommender deployment and keep the rest of VPA system as it was.

How to modify other components

Updater and admission controller can be modified, built and deployed similarly to recommender.

Directories ¶

Path	Synopsis
admission-controller
logic
e2e
pkg
apis/poc.autoscaling.k8s.io/v1alpha1 Package v1alpha1 contains definitions of Vertical Pod Autoscaler related objects.	Package v1alpha1 contains definitions of Vertical Pod Autoscaler related objects.
client/clientset/versioned This package has the automatically generated clientset.	This package has the automatically generated clientset.
client/clientset/versioned/fake This package has the automatically generated fake clientset.	This package has the automatically generated fake clientset.
client/clientset/versioned/scheme This package contains the scheme of the automatically generated clientset.	This package contains the scheme of the automatically generated clientset.
client/clientset/versioned/typed/poc.autoscaling.k8s.io/v1alpha1 This package has the automatically generated typed clients.	This package has the automatically generated typed clients.
client/clientset/versioned/typed/poc.autoscaling.k8s.io/v1alpha1/fake Package fake has the automatically generated clients.	Package fake has the automatically generated clients.
client/informers/externalversions
client/informers/externalversions/internalinterfaces
client/informers/externalversions/poc.autoscaling.k8s.io
client/informers/externalversions/poc.autoscaling.k8s.io/v1alpha1
client/listers/poc.autoscaling.k8s.io/v1alpha1
common
utils/test
utils/vpa
recommender
input
input/history
input/metrics
input/oom
input/spec
logic
model
output
util
updater
eviction
priority

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL