gpu-scheduling-webhook

command
v0.0.0-...-110c471 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Dec 17, 2024 License: Apache-2.0 Imports: 17 Imported by: 0

README

gpu-scheduling-webhook

Motivation

Our clusters host some nodes that feature an Nvidia GPU. They are expensive to run workload on so by using this mutating webhook we ensure that only the pods requesting a GPU actually run on those nodes, leaving out everything else.

How it works

A node that features an Nvida GPU holds the following taint:

taints:
- effect: NoSchedule
  key: nvidia.com/gpu
  value: "true"

The webhook inspects a pod's container requests, both form the init containers and regular ones, and apply this toleration:

tolerations:
- key: nvidia.com/gpu
  operator: Equal
  value: "true"
  effect: NoSchedule

when it finds either such a request:

requests:
  nvidia.com/gpu: <SOME_VALUE_HERE>

or the following limit:

limits:
  nvidia.com/gpu: <SOME_VALUE_HERE>

Documentation

The Go Gopher

There is no documentation for this package.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL