Development environment for AI/ML
What is envd?
envd (ΙͺnΛvdΙͺ
) is a command-line tool that helps you create the container-based development environment for AI/ML.
Creating development environments is not easy, especially with today's complex systems and dependencies. With everything from Python to CUDA, BASH scripts, and Dockerfiles constantly breaking, it can feel like a nightmare - until now!
Instantly get your environment running exactly as you need with a simple declaration of the packages you seek in build.envd and just one command: envd up
!
Why use envd
?
Environments built with envd
provide the following features out-of-the-box:
Simple CLI and language
envd
enables you to quickly and seamlessly integrate powerful CLI tools into your existing Python workflow to provision your programming environment without learning a new language or DSL.
def build():
install.python_packages(name = [
"numpy",
])
shell("zsh")
config.jupyter()
Isolation, compatible with OCI image
With envd
, users can create an isolated space to train, fine-tune, or serve. By utilizing sophisticated virtualization technology as well as other features like buildkit, it's an ideal solution for environment setup.
envd
environment image is compatible with OCI image specification. By leveraging the power of an OCI image, you can make your environment available to anyone and everyone! Make it happen with a container registry like Harbor or Docker Hub.
Local, and cloud
envd
can now be used on a hybrid platform, ranging from local machines to clusters hosted by Kubernetes. Any of these options offers an efficient and versatile way for developers to create their projects!
$ envd context use local
# Run envd environments locally
$ envd up
...
$ envd context use cluster
# Run envd environments in the cluster with the same experience
$ envd up
Check out the doc for more details.
Build anywhere, faster
envd
offers a wealth of advantages, such as remote build and software caching capabilities like pip index caches or apt cache, with the help of buildkit - all designed to make your life easier without ever having to step foot in the code itself!
Reusing previously downloaded packages from the PyPI/APT cache saves time and energy, making builds more efficient. No need to redownload what was already acquired before β a single download is enough for repeat usage!
With Dockerfile v1, users are unable to take advantage of PyPI caching for faster installation speeds - but envd
offers this support and more!
Besides, envd
also supports remote build, which means you can build your environment on a remote machine, such as a cloud server, and then push it to the registry. This is especially useful when you are working on a machine with limited resources, or when you expect a build machine with higher performance.
Knowledge reuse in your team
Forget copy-pasting Dockerfile instructions - use envd to easily build functions and reuse them by importing any Git repositories with the include
function! Craft powerful custom solutions quickly.
envdlib = include("https://github.com/tensorchord/envdlib")
def build():
base(os="ubuntu22.04", language="python")
envdlib.tensorboard(host_port=8888)
envdlib.tensorboard
is defined in github.com/tensorchord/envdlib
def tensorboard(
envd_port=6006,
envd_dir="/home/envd/logs",
host_port=0,
host_dir="/tmp",
):
"""Configure TensorBoard.
Make sure you have permission for `host_dir`
Args:
envd_port (Optional[int]): port used by envd container
envd_dir (Optional[str]): log storage mount path in the envd container
host_port (Optional[int]): port used by the host, if not specified or equals to 0,
envd will randomly choose a free port
host_dir (Optional[str]): log storage mount path in the host
"""
install.python_packages(["tensorboard"])
runtime.mount(host_path=host_dir, envd_path=envd_dir)
runtime.daemon(
commands=[
[
"tensorboard",
"--logdir",
envd_dir,
"--port",
str(envd_port),
"--host",
"0.0.0.0",
],
]
)
runtime.expose(envd_port=envd_port, host_port=host_port, service="tensorboard")
Getting Started π
Requirements
- Docker (20.10.0 or above)
Install and bootstrap envd
envd
can be installed with pip
, or you can download the binary release directly. After the installation, please run envd bootstrap
to bootstrap.
pip install --upgrade envd
After the installation, please run envd bootstrap
to bootstrap:
envd bootstrap
Read the documentation for more alternative installation methods.
You can add --dockerhub-mirror
or -m
flag when running envd bootstrap
, to configure the mirror for docker.io registry:
envd bootstrap --dockerhub-mirror https://docker.mirrors.sjtug.sjtu.edu.cn
Create an envd
environment
Please clone the envd-quick-start
:
git clone https://github.com/tensorchord/envd-quick-start.git
The build manifest build.envd
looks like:
def build():
base(os="ubuntu22.04", language="python3")
# Configure the pip index if needed.
# config.pip_index(url = "https://pypi.tuna.tsinghua.edu.cn/simple")
install.python_packages(name = [
"numpy",
])
shell("zsh")
Note that we use Python here as an example but please check out examples for other languages such as R and Julia here.
Then please run the command below to set up a new environment:
cd envd-quick-start && envd up
$ cd envd-quick-start && envd up
[+] β parse build.envd and download/cache dependencies 2.8s β
(finished)
=> download oh-my-zsh 2.8s
[+] π build envd environment 18.3s (25/25) β
(finished)
=> create apt source dir 0.0s
=> local://cache-dir 0.1s
=> => transferring cache-dir: 5.12MB 0.1s
...
=> pip install numpy 13.0s
=> copy /oh-my-zsh /home/envd/.oh-my-zsh 0.1s
=> mkfile /home/envd/install.sh 0.0s
=> install oh-my-zsh 0.1s
=> mkfile /home/envd/.zshrc 0.0s
=> install shell 0.0s
=> install PyPI packages 0.0s
=> merging all components into one 0.3s
=> => merging 0.3s
=> mkfile /home/envd/.gitconfig 0.0s
=> exporting to oci image format 2.4s
=> => exporting layers 2.0s
=> => exporting manifest sha256:7dbe9494d2a7a39af16d514b997a5a8f08b637f 0.0s
=> => exporting config sha256:1da06b907d53cf8a7312c138c3221e590dedc2717 0.0s
=> => sending tarball 0.4s
envd-quick-start via Py v3.9.13 via π
envd
β¬’ [envd]β― # You are in the container-based environment!
Set up Jupyter notebook
Please edit the build.envd
to enable jupyter notebook:
def build():
base(os="ubuntu22.04", language="python3")
# Configure the pip index if needed.
# config.pip_index(url = "https://pypi.tuna.tsinghua.edu.cn/simple")
install.python_packages(name = [
"numpy",
])
shell("zsh")
config.jupyter()
You can get the endpoint of the running Jupyter notebook via envd envs ls
.
$ envd up --detach
$ envd envs ls
NAME JUPYTER SSH TARGET CONTEXT IMAGE GPU CUDA CUDNN STATUS CONTAINER ID
envd-quick-start http://localhost:42779 envd-quick-start.envd /home/gaocegege/code/envd-quick-start envd-quick-start:dev false <none> <none> Up 54 seconds bd3f6a729e94
Difference between v0 and v1
[!NOTE]
To use the v1
config file, add # syntax=v1
to the first line of your build.envd
file.
Features |
v0 |
v1 |
is default for envd<v1.0 |
β
|
β |
support dev |
β
|
β
|
support CUDA |
β
|
β
|
support serving |
β οΈ |
β
|
support custom base image |
β οΈ |
β
|
support installing multiple languages |
β οΈ |
β
|
support moby builder |
β |
β
(a) |
[!NOTE]
(a) To use the moby
builder, you will need to create a new context with envd context create --name moby-test --builder moby-worker --use
.
For more information about the moby
builder, check the issue-1693.
[!IMPORTANT]
For more details, check the upgrade to v1 doc.
More on documentation π
See envd documentation.
Roadmap ποΈ
Please checkout ROADMAP.
Contribute π
We welcome all kinds of contributions from the open-source community, individuals, and partners.
Contributors β¨
Thanks goes to these wonderful people (emoji key):
This project follows the all-contributors specification. Contributions of any kind welcome!
License π
Apache 2.0