Kubeflow Pipelines v2
Design
bit.ly/kfp-v2 (You need to join kubeflow-discuss google group to get access).
Developing
Developing KFP v2
Prerequisites:
-
Install a KFP standalone instance on Google Cloud:
This does not currently work on other envs, because some tests use GCS & GCS client.
Welcome contributions to make it portable.
-
Install go, python, kfp pypi package, docker.
-
Install ko CLI tool:
make install-ko
-
Configure dev environment by creating a config file called .env
in this folder,
it should have the following content:
export DEV_IMAGE_PREFIX=<an container image registry prefix you own>
For example:
export DEV_IMAGE_PREFIX=gcr.io/ml-pipeline-test/kfp-
Then after images are built, they will be pushed to locations like
gcr.io/ml-pipeline-test/kfp-driver
.
The .env
file is ignored by git, it's your local development configuration.
Verify you can push images to the registry:
# push all built dev images to DEV_IMAGE_PREFIX
make image-dev
set up go environment value
# .env is a Makefile local config (ignored by git)
echo "GOOS_VALUE=$(go env GOOS)" >> .env
echo "GOARCH_VALUE="$(go env GOARCH) >> .env
-
Install sample test python dependencies (require Python 3.7 or 3.8 due to ml-metadata limitation):
cd test
pip install -r requirements.txt
-
Connecting to Kubeflow Pipelines using the SDK client.
Recommend adding the env vars to your .bashrc or .zshrc etc to persist your config.
Verify your configuration and connectivity:
kfp experiment list
Requirements on the KFP backend installation:
- Current limitation, this only works for KFP Standalone, not tested on full Kubeflow yet.
- KFP backend version should be at least 1.7.0-rc.2.
Requirements on the KFP SDK package:
- KFP v2 components defined using
@component
decorator installs KFP SDK package at runtime.
To use a compatible KFP SDK, define the following environment variable before running e2e test:
export KFP_PACKAGE_PATH="git+https://github.com/kubeflow/pipelines.git@master#subdirectory=sdk/python"
Instructions:
-
Run everything e2e: build images, backend compiler, compile pipelines and run them:
make dev
-
Run go unit tests:
make test
# or watch file changes and rerun automatically
make test-watch
-
For individual targets, read the Makefile directly.
-
Run one sample test:
cd "${REPO_ROOT}/v2"
make install-compiler # This needs to be run each time you update compiler code.
cd "${REPO_ROOT}"
python -m samples.path.to.sample_test
Read v2 sample test documentation for more details.
Update licenses
Note, this is currently outdated instructions for v2 compatible mode. We haven't set up licensing workflow for v2 engine.
Download the license tool binary from https://github.com/Bobgy/go-licenses/releases and put it into $PATH.
Update licenses info by:
make license-launcher
or run the following to enable verbose output:
GO_LICENSES_FLAGS=-v4 make license-launcher
After the update, check generated third_party/licenses/launcher.csv file to
make sure licenses of new dependencies are correctly identified.
If something is unexpected, examine the unexpected dependencies by yourself and add
overrides to go-licenses.yaml.
For detailed documentation about the tool: https://github.com/Bobgy/go-licenses/tree/main/v2.