Similar to other integrated github CI/CD services, Cirrus utilizes a simple
YAML-based configuration/description file: .cirrus.yml
. Ref: https://cirrus-ci.org/
All tasks execute in parallel, unless there are conditions or dependencies which alter this behavior. Within each task, each script executes in sequence, so long as any previous script exited successfully. The overall state of each task (pass or fail) is set based on the exit status of the last script to execute.
N/B: Steps below are performed by automation
-
Launch a purpose-built container in Cirrus's community cluster. For container image details, please see the contributors guide.
-
validate
: Perform standardmake validate
source verification, Should run for less than a minute or two. -
lint
: Execute regularmake lint
to check for any code cruft. Should also run for less than a few minutes. -
vendor
: runsmake vendor-in-container
followed by./hack/tree_status.sh
to check whether the git tree is clean. The reasoning for that is to make sure that the vendor.conf, the code and the vendored packages in ./vendor are in sync at all times.
N/B: Steps below are performed by automation
-
Launch a container built from definition in
./contrib/imgts
. -
Update VM Image metadata to help track usage across all automation.
-
Always exits successfully unless there's a major problem.
N/B: Steps below are performed by automation
-
After
gating
passes, spin up one VM permatrix: image_name
item. Once accessible,ssh
into each VM as theroot
user. -
setup_environment.sh
: Configure root's.bash_profile
for all subsequent scripts (each run in a new shell). Any distribution-specific environment variables are also defined here. For example, setting tags/flags to use compiling. -
integration_test.sh
: Execute integration-testing. This is much more involved, and relies on access to external resources like container images and code from other repositories. Total execution time is capped at 2-hours (includes all the above) but this script normally completes in less than an hour.
Confirm that cross-compile of podman-remote functions for both windows
and darwin
targets.
Use the latest Fedora release with the required kernel options pre-set for
exercising cgroups v2 with Podman integration tests. Also depends on
having SPECIALMODE
set to 'cgroupv2`
Modifying the contents of cache-images is tested by making changes to
one or more of the ./contrib/cirrus/packer/*_setup.sh
files. Then
in the PR description, add the magic string: ***CIRRUS: TEST IMAGES***
N/B: Steps below are performed by automation
-
setup_environment.sh
: Same as for other tasks. -
build_vm_images.sh
: Utilize the packer tool to produce new VM images. Create a new VM from each base-image, connect to them withssh
, and perform the steps as defined by the$PACKER_BASE/libpod_images.yml
file:- On a base-image VM, as root, copy the current state of the repository
into
/tmp/libpod
. - Execute distribution-specific scripts to prepare the image for
use. For example,
fedora_setup.sh
. - If successful, shut down each VM and record the names, and dates into a json manifest file.
- Move the manifest file, into a google storage bucket object. This is a retained as a secondary method for tracking/auditing creation of VM images, should it ever be needed.
- On a base-image VM, as root, copy the current state of the repository
into
Only runs following successful test_build_cache_images_task
task. Uses
images following the standard naming format; however, only runs a limited
sub-set of automated tests. Validating newly built images fully, requires
updating .cirrus.yml
.
N/B: Steps below are performed by automation
-
Using the just build VM images, launch VMs and wait for them to boot.
-
Execute the
setup_environment.sh
as in thetesting
task. -
Execute the
integration_test.sh
as in thetesting
task.
Manual Steps: Assuming the automated steps pass, then
you'll find the new image names displayed at the end of the
test_build_cache_images
. For example:
...cut...
[+0747s] ==> Builds finished. The artifacts of successful builds are:
[+0747s] --> ubuntu-18: A disk image was created: ubuntu-18-libpod-5664838702858240
[+0747s] --> fedora-29: A disk image was created: fedora-29-libpod-5664838702858240
[+0747s] --> fedora-30: A disk image was created: fedora-30-libpod-5664838702858240
[+0747s] --> ubuntu-19: A disk image was created: ubuntu-19-libpod-5664838702858240
Notice the suffix on all the image names comes from the env. var. set in
.cirrus.yml: BUILT_IMAGE_SUFFIX: "-${CIRRUS_REPO_NAME}-${CIRRUS_BUILD_ID}"
.
Edit .cirrus.yml
, in the top-level env
section, update the suffix variable
used at runtime to launch VMs for testing:
env:
...cut...
####
#### Cache-image names to test with (double-quotes around names are critical)
###
_BUILT_IMAGE_SUFFIX: "libpod-5664838702858240"
FEDORA_CACHE_IMAGE_NAME: "fedora-30-${_BUILT_IMAGE_SUFFIX}"
PRIOR_FEDORA_CACHE_IMAGE_NAME: "fedora-29-${_BUILT_IMAGE_SUFFIX}"
...cut...
NOTES:
-
If re-using the same PR with new images in
.cirrus.yml
, take care to also update the PR description to remove the magic***CIRRUS: TEST IMAGES***
string. Keeping it and--force
pushing would needlessly cause Cirrus-CI to build and test images again. -
In the future, if you need to review the log from the build that produced the referenced image:
- Note the Build ID from the image name (for example
5664838702858240
). - Go to that build in the Cirrus-CI WebUI, using the build ID in the URL.
(For example
https://cirrus-ci.com/build/5664838702858240
. - Choose the test_build_cache_images task.
- Open the build_vm_images script section.
- Note the Build ID from the image name (for example
Gathers up zip files uploaded by other tasks, from the local Cirrus-CI caching service. Depending on the execution context (a PR or a branch), this task uploads the files found to storage buckets at:
- https://storage.cloud.google.com/libpod-pr-releases
- https://storage.cloud.google.com/libpod-master-releases
Note: Repeated builds from the same PR or branch, will clobber previous archives by design. This is intended so that the "latest" archive is always available at a consistent URL. The precise details regarding a particular build is encoded within the zip-archive comment.
Base-images are VM disk-images specially prepared for executing as GCE VMs. In particular, they run services on startup similar in purpose/function as the standard 'cloud-init' services.
-
The google services are required for full support of ssh-key management and GCE OAuth capabilities. Google provides native images in GCE with services pre-installed, for many platforms. For example, RHEL, CentOS, and Ubuntu.
-
Google does not provide any images for Fedora (as of 5/2019), nor do they provide a base-image prepared to run packer for creating other images in the
test_build_vm_images
Task (above). -
Base images do not need to be produced often, but doing so completely manually would be time-consuming and error-prone. Therefor a special semi-automatic Makefile target is provided to assist with producing all the base-images:
libpod_base_images
To produce new base-images, including an image-builder-image
(used by
the cache_images
Task) some input parameters are required:
-
GCP_PROJECT_ID
: The complete GCP project ID string e.g. foobar-12345 identifying where the images will be stored. -
GOOGLE_APPLICATION_CREDENTIALS
: A JSON file containing credentials for a GCE service account. This can be a service account or end-user credentials -
Optionally, CSV's may be specified to
PACKER_BUILDS
to limit the base-images produced. For example,PACKER_BUILDS=fedora,image-builder-image
.
If there is an existing 'image-builder-image' within GCE, it may be utilized to produce base-images (in addition to cache-images). However it must be created with support for nested-virtualization, and with elevated cloud privileges (to access GCE, from within the GCE VM). For example:
$ alias pgcloud='sudo podman run -it --rm -e AS_ID=$UID
-e AS_USER=$USER -v $HOME:$HOME:z quay.io/cevich/gcloud_centos:latest'
$ URL=https://www.googleapis.com/auth
$ SCOPES=$URL/userinfo.email,$URL/compute,$URL/devstorage.full_control
# The --min-cpu-platform is critical for nested-virt.
$ pgcloud compute instances create $USER-making-images \
--image-family image-builder-image \
--boot-disk-size "200GB" \
--min-cpu-platform "Intel Haswell" \
--machine-type n1-standard-2 \
--scopes $SCOPES
Alternatively, if there is no image-builder-image available yet, a bare-metal
CentOS 7 machine with network access to GCE is required. Software dependencies
can be obtained from the packer/image-builder-image_base_setup.sh
script.
In both cases, the following can be used to setup and build base-images.
$ IP_ADDRESS=1.2.3.4 # EXTERNAL_IP from command output above
$ rsync -av $PWD centos@$IP_ADDRESS:.
$ scp $GOOGLE_APPLICATION_CREDENTIALS centos@$IP_ADDRESS:.
$ ssh centos@$IP_ADDRESS
...
When ready, change to the packer
sub-directory, and build the images:
$ cd libpod/contrib/cirrus/packer
$ make libpod_base_images GCP_PROJECT_ID=<VALUE> \
GOOGLE_APPLICATION_CREDENTIALS=<VALUE> \
PACKER_BUILDS=<OPTIONAL>
Assuming this is successful (hence the semi-automatic part), packer will
produce a packer-manifest.json
output file. This contains the base-image
names suitable for updating in .cirrus.yml
, env
keys *_BASE_IMAGE
.
On failure, it should be possible to determine the problem from the packer
output. Sometimes that means setting PACKER_LOG=1
and troubleshooting
the nested virt calls. It's also possible to observe the (nested) qemu-kvm
console output. Simply set the TTYDEV
parameter, for example:
$ make libpod_base_images ... TTYDEV=$(tty)
...
Some tasks alter their behavior based on this value. A summary of supported values follows:
none
: Operate as normal, this is the default value if unspecified.rootless
: Causes a random, ordinary user account to be created and utilized for testing.in_podman
: Causes testing to occur within a container executed by Podman on the host.cgroupv2
: The kernel on this VM was prepared with options to enable v2 cgroupswindows
: See darwindarwin
: Signals thespecial_testing_cross
task to cross-compile the remote client.