GrootFS: Garden root file system
Note: This repository should be imported as
code.cloudfoundry.org/grootfs
.
by
chattanooga-choochoo
GrootFS is a Cloud Foundry component to satisfy
garden-runc's
requirements for handling container images.
It is currently under development.
You can find us in the #garden Cloud Foundry slack
channel. Use
https://slack.cloudfoundry.org to get an
invitation.
Index
Installation
Because GrootFS depends on Linux kernel features, you can only build it from
or to a Linux machine.
mkdir -p $GOPATH/src/code.cloudfoundry.org
git clone https://github.com/cloudfoundry/grootfs.git $GOPATH/src/code.cloudfoundry.org/grootfs
cd $GOPATH/src/code.cloudfoundry.org/grootfs
git submodule update --init --recursive
make
Using go get code.cloudfoundry.org/grootfs
is discouraged because it might
not work due to our versioned dependencies.
Filesystems
GrootFS makes use of various Linux filesystem features in order to efficiently store
container root filesystems on the host machine.
Currently we support:
- BTRFS (
--driver btrfs
)
- Overlay on XFS (
--driver overlay-xfs
)
GrootFS's 'store' directory must be stored on one of these filesystems. Our setup
script will try to set up both of these filesystems for you so you can experiment
with GrootFS, or you can provision your own and configure GrootFS to point to the
mounted filesystem you create, using the --store
and --driver
command-line flags.
These are documented in the instructions below.
Instructions
Requirements
sudo hack/quick-setup
We assume that:
- You are running a Debian/Ubuntu based system with BTRFS and XFS enabled in the kernel
This will:
- Install the userspace tools required to use GrootFS on either BTRFS or XFS
- Create a BTRFS filesystem and mount it under /var/lib/grootfs/btrfs
- Create an XFS filesystem and mount it under /var/lib/grootfs/xfs
By default all operations will happen in the /var/lib/grootfs
folder. You can
change this by passing the --store
flag to the grootfs
binary. The store folder is
expected to be inside either of the mounted BTRFS or XFS volumes. You will also
need to pass the --driver
option with a value of either btrfs
or overlay-xfs
accordingly.
For user/group id mapping, you'll also require newuidmap
and newgidmap
to be
installed (uidmap package on Ubuntu)
Configuration
GrootFS can optionally be configured with a YAML config file. If both a config file and command
line arguments are provided, the command line arguments will take precedence.
To use a config file:
grootfs --config ./my-config.yml <command> ...
Following is an example configuration file with all options provided:
store_path: /var/vcap/data/grootfs/store
driver: btrfs
btrfs_bin: /var/vcap/packages/btrfs-progs/bin/btrfs
drax_bin: /var/vcap/packages/grootfs/bin/drax
newuidmap_bin: /var/vcap/packages/idmapper/bin/newuidmap
newgidmap_bin: /var/vcap/packages/idmapper/bin/newgidmap
log_level: debug
metron_endpoint: 127.0.0.1:8081
insecure_registries:
- my-docker-registry.example.com:1234
ignore_base_images:
- docker:///ubuntu
- docker://my-docker-registry.example.com:1234/busybox
clean_on_create: true
clean_threshold_bytes: 1048576
uid_mappings:
- "0:4294967294:1"
- "1:1:4294967293"
gid_mappings:
- "0:4294967294:1"
- "1:1:4294967293"
Key |
Description |
store_path |
Path to the store directory |
driver |
Filesystem driver to use <btrfs | overlay-xfs> |
btrfs_bin |
Path to btrfs bin. (If not provided will use $PATH) |
xfsprogs_path |
Path containing the xfsprogs binaries. (If not provided will use $PATH) |
drax_bin |
Path to drax bin. (If not provided will use $PATH) |
newuidmap_bin |
Path to newuidmap bin. (If not provided will use $PATH) |
newgidmap_bin |
Path to newgidmap bin. (If not provided will use $PATH) |
log_level |
Set logging level <debug | info | error | fatal> |
metron_endpoint |
Metron endpoint used to send metrics |
insecure_registries |
Whitelist a private registry |
ignore_base_images |
Images to ignore during cleanup |
clean_on_create |
Clean up unused layers before creating rootfs |
clean_threshold_bytes |
Disk usage of the store directory at which cleanup should trigger |
uid_mappings |
UID mapping for image translation, e.g.: <Namespace UID>:<Host UID>:<Size> |
gid_mappings |
GID mapping for image translation, e.g.: <Namespace GID>:<Host GID>:<Size> |
Creating an image
You can create a rootfs image based on a remote docker image:
grootfs --store /mnt/btrfs create docker:///ubuntu:latest my-image-id
Or from a local tar file as an image source:
grootfs --store /mnt/btrfs create /my-rootfs.tar my-image-id
Output
The output of this command is a rootfs image path
(/mnt/btrfs/images/<uid>/my-image-id
) which has the following structure:
- The
<uid>
is the effective user id running the command.
<Returned directory>
|____ rootfs/
|____ image.json
- The
rootfs
folder is where the root filesystem lives.
- The
image.json
file follows the OCI image
description
schema.
User/Group ID Mapping
You might want to apply some user and group id mappings to the contents of the
rootfs
folder. GrootFS supports the --uid-mapping
and --gid-mapping
arguments. Suppose you are user with UID/GID 1000:
grootfs --store /mnt/btrfs create \
--uid-mapping 0:1000:1 \
--uid-mapping 1:100000:650000 \
--gid-mapping 0:1000:1 \
--gid-mapping 1:100000:650000 \
docker:///ubuntu:latest \
my-image-id
Some important notes:
- If you're not running as root, and you want to use mappings, you'll also need
to map root (
0:--your-user-id:1
)
- Your id mappings can't overlap (e.g. 1:100000:65000 and 100:1000:200)
- You need to have these mappings
allowed in the
/etc/subuid
and /etc/subgid
files
Disk Quotas & Drax
GrootFS supports per-filesystem disk-quotas through the Drax binary. BTRFS
disk-quotas can only be enabled by a root user, therefore Drax must be owned by
root, with the user bit set, and moved somewhere in the $PATH.
make
chown root drax
chmod u+s drax
mv drax /usr/local/bin/
Once Drax is configured, you can apply a quota to the rootfs:
grootfs --store /mnt/btrfs create \
--disk-limit-size-bytes 10485760 \
docker:///ubuntu:latest \
my-image-id
Deleting an image
You can destroy a created rootfs image by calling grootfs delete
with the
image-id:
grootfs --store /mnt/btrfs delete my-image-id
Or the rootfs image path:
grootfs --store /mnt/btrfs delete /mnt/btrfs/images/<uid>/my-image-id
Caveats:
The store is based on the effective user running the command. If the user tries
to delete a rootfs image that does not belong to her/him the command fails.
Logging
By default GrootFS will not emit any logging, you can set the log level with
the --log-level
flag:
grootfs --log-level debug create ...
It also supports redirecting the logs to a log file:
grootfs --log-level debug --log-file /var/log/grootfs.log create ...
Stats
You can get stats from an image by calling grootfs stats
with the
image-id:
grootfs --store /mnt/btrfs stats my-image-id
Or the image path:
grootfs --store /mnt/btrfs delete /mnt/btrfs/images/<uid>/my-image-id
This will result in a JSON object of the following form:
{
"disk_usage": {
"total_bytes_used": 132169728,
"exclusive_bytes_used": 16384
}
}
total_bytes_used
refers to the total space the image takes.
exclusive_bytes_used
is the amount of space the image takes excluding the
base image, i.e.: just the container data.
Clean up
grootfs --store /mnt/btrfs clean
When clean
is called, any layers that aren't being used by a rootfs that
currently exists are deleted from the store*.
For example: Imagine that we create two rootfs images from different base
images, Image A
and Image B
:
- Image A
Layers:
- layer-1
- layer-2
- layer-3
- Image B
Layers:
- layer-1
- layer-4
- layer-5
They have a layer in common, layer-1
. And after deleting Image B
,
layer-4
and layer-5
can be collected by clean
, but not layer-1
because
Image A
still uses that layer.
It is safe to run the command in parallel, it does not interfere with other
creations or deletions.
The clean
command has an optional integer parameter, threshold-bytes
, and
when the store* size is under that clean
is a no-op, it does not remove
anything. On the other hand, if the store* is over the threshold it cleans up
any resource that is not being used. If 0 is provided it will behave the same
way as if the flag wasn't specified, it will clean up everything that's not
being used. If a non integer or negative integer is provided, the command
fails without cleaning up anything.
Caveats:
The store is based on the effective user running the command. If the user tries
to clean up a store that does not belong to her/him the command fails.
* It takes only into account the cache and volumes folders in the store.
Running tests in Concourse
GrootFS uses Concourse for both Continuous Integration
(CI) and testing. One way of running tests locally is with
concourse-lite
, which is a Vagrant box.
Starting Concourse Lite
vagrant box add concourse/lite
vagrant init concourse/lite
vagrant up
Running tests
make concourse-test
Known Issues
Restrictions when running as a non-root user
-
GrootFS cannot use docker:///centos
when run as a non-root user without
UID/GID mappings. CentOS sets the /root
permissions to 0550 (i.e.:
r-xr-x---
) and therefore, when GrootFS runs as a non-root user it cannot
write files into the rootfs /root
. You can work around this by either
running as root or using a UID/GID mapping.
-
You must mount the btrfs volume with -o user_subvol_rm_allowed
or you
won't be able to delete images or clean up after failure scenarios.
-
Files not visible to the calling user in the base image won't be in the
resulting rootfs.
-
The store must have the right permissions or ownership for the calling user
otherwise the command will fail.
-
The calling user can only delete the rootfs that it owns.
-
The calling user can only cleanup the cached volumes that it owns.
-
The calling user can only request stats for the rootfs that it owns.
Misc
- All devices inside a image are ignored.
Links