gpu-scheduling-webhook

command

v0.0.0-...-81dd683 Latest Latest Go to latest Published: Dec 12, 2024 License: Apache-2.0 Imports: 17 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/openshift/ci-tools

README ¶

gpu-scheduling-webhook

Motivation

Our clusters host some nodes that feature an Nvidia GPU. They are expensive to run workload on so by using this mutating webhook we ensure that only the pods requesting a GPU actually run on those nodes, leaving out everything else.

How it works

A node that features an Nvida GPU holds the following taint:

taints:
- effect: NoSchedule
  key: nvidia.com/gpu
  value: "true"

The webhook inspects a pod's container requests, both form the init containers and regular ones, and apply this toleration:

tolerations:
- key: nvidia.com/gpu
  operator: Equal
  value: "true"
  effect: NoSchedule

when it finds either such a request:

requests:
  nvidia.com/gpu: <SOME_VALUE_HERE>

or the following limit:

limits:
  nvidia.com/gpu: <SOME_VALUE_HERE>

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL