k8smlaccelerator

module
v0.0.0-...-95aba1e Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Nov 1, 2023 License: Apache-2.0

README

K8sMLAccelerator: Kubernetes Native Accelerator for Big Data Processing in Machine Learning Applications

Why?

Machine Learning (ML)/Deep Learning (DL) training using datasets stored at S3/GCS/Azure can experience rate limiting and suboptimal downloading throughput. K8sMLAccelerator autorewrites training job's Pod spec and reroutes S3/GCS/Azure requests to a local cache, thus bolstering performance and scalability.

How It Works

This is achieved via a mutating webhook that modifies the Pod spec. The webhook deploys host aliases to Pod spec once it identifies the app label in the deployments or jobs. Upon running containers, the S3/GCS/Azure requests are redirected to a proxy's endpoint.

Further instructions, details about setting up the Reverse Proxy Cache Service and a demonstration of the test job can be found in the markdown. It also provides a cleanup guide.

Acknowledgement

Initial implementation of initializer and webhook were inspired by resources available on Kubernetes e2e tests and Kelseyhightower's Kubernetes-initializer-tutorial.

Directories

Path Synopsis
app
pkg

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL