Build multi-arch rootless-cni-infra on GitHub Actions #8415

AkihiroSuda · 2020-11-19T21:08:02Z

The image is built as a multi-arch OCI tar.gz and uploaded as a GitHub Artifact.

The image is NOT pushed to quay automatically.
~~A maintainer can use skopeo to push the artifact archive to quay.~~ (EDIT: might not be true for multi-arch...)

Relates to #8411

openshift-ci-robot · 2020-11-19T21:08:04Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: AkihiroSuda
To complete the pull request process, please assign giuseppe after the PR has been reviewed.
You can assign the PR to them by writing /assign @giuseppe in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

AkihiroSuda · 2020-11-19T21:19:18Z

A maintainer can use skopeo to push the artifact archive to quay.

On second thought, this might not be true for multi-arch?

rhatdan · 2020-11-19T21:20:22Z

@cevich @baude @mheon PTAL

The image is built as a multi-arch OCI tar.gz and uploaded as a GitHub Artifact. The image is NOT pushed to quay automatically. A maintainer can use `skopeo` to push the artifact archive to quay. Signed-off-by: Akihiro Suda <[email protected]>

AkihiroSuda · 2020-11-19T21:33:12Z

Can Red Hat configure GitHub Actions to allow pushing to quay?

TomSweeneyRedHat · 2020-11-19T22:23:03Z

@barthy1 would mind taking a peak at this?
@AkihiroSuda I think you're looking for something like what @cevich did in Skopeo containers/skopeo#1104

TomSweeneyRedHat · 2020-11-19T22:31:26Z

And FWIW, here's the PR that @barthy1 put together for Skopeo, I've not looked at this PR to see if it's using a similar technique or not, but would like the ending images to be similar. containers/skopeo#1066

barthy1 · 2020-11-20T09:43:55Z

Hi @AkihiroSuda Don't you consider to use github actions to push multi-arch image to quay.io/libpod/rootless-cni-infra instead of generating archive? Of course this should be triggered not by pull_request, but push to master, I guess?

AkihiroSuda · 2020-11-20T09:45:51Z

That would be better. Needs assistance from Red Hat for adding quay token to GitHub Actions.

barthy1 · 2020-11-20T09:55:01Z

@TomSweeneyRedHat in this PR the approach is - github action + emulation with docker buildx
For Skopeo it was Travis + native hardware, as emulation one failed to build Skopeo.

If emulation approach works for all listed architectures in this PR, I guess, it's the best way to build multi-arch image. To push to quay.io/libpod @AkihiroSuda need your (or @cevich ) assistance to provide push credentials for Github Actions, see https://docs.github.com/en/free-pro-team@latest/actions/reference/encrypted-secrets#creating-encrypted-secrets-for-a-repository

The only one disadvantage I can see for the future extensions, if some podman image needs to be built with native hardware (as it was for Skopeo), then you will need to support 2 different build environments - github actions and Travis

AkihiroSuda · 2020-11-20T09:58:22Z

Can we get a separate quay account for GitHub Actions? Also, I think the account should be maintained by maintainers rather than me, so that the account can be potentially reused for other image as well in future.

mheon · 2020-11-20T14:54:13Z

This sounds sounds like it's going to require collaboration from @cevich to get the Github Actions piece working and @TomSweeneyRedHat who maintains the Quay account last I checked

TomSweeneyRedHat · 2020-11-24T00:07:43Z

@barthy1 @AkihiroSuda I'm trying to get my head around this. So this looks to produce the same images that @barthy1 did in Skopeo in the attached PR, but they're in a tar file that needs to be untarred and then the images pushed to quay.io via a GitHub action? Currently, in Skopeo there are v-1.2.0-* tags where the '*' is amd64, ppc64le, and s390x. It looks like this would create arm and arm64 variants too. I don't see a version in the change, would this change add the Podman version number too, or is that work for the GitHub action?

We don't currently have travis here in this repo, but if we did, would it be better to build these the way the @barthy1 did using Travis?

I'm sure @cevich would be happy to add the GitHub actions, but I believe he's enjoying PTO at the moment and probably won't be back until early next week.

@AkihiroSuda and @barthy1 ty both very much for all your help on this in both projects.

AkihiroSuda · 2020-11-24T08:03:23Z

but they're in a tar file that needs to be untarred and then the images pushed to quay.io via a GitHub action

It is built as a tar file just because I don't have the registry token in CI to push the image to quay

TomSweeneyRedHat · 2020-11-24T16:23:40Z

@AkihiroSuda thanks for the info. @cevich is away until next week, let's touch base on this again then.

cevich · 2020-12-01T16:37:33Z

I'm back, and happy to create/add needed credentials I'll take a peek at this PR later today.

cevich · 2020-12-01T21:55:48Z

I took a look, and am concerned on several fronts.

It seems dangerous/failure-prone to rely on github-actions and action-steps which pull from docker hub. Since this is running for every single PR, I'm afraid we'll rapidly run into the new rate-limits.
It depends heavily/entirely on docker-centric third-party tooling which the podman maintainers are unlikely to ever embrace. This creates a conflict of interest where the multi-arch images could be "held hostage" by the very technology podman is attempting to disrupt.

The second point is the real kicker beceause, runs-on: ubuntu-20.04 will not be forever. At some point we will have to update the CI-host and (likely) all tooling. If there is a conflict anywhere in the ubuntu -> docker -> buildx -> QEMU space, then production of new podman images halts...at least until (worst-case) a podman maintainer (starting from scratch) learns and debugs the entire workflow enough to apply necessary fixes.

While I'm not a big fan of Travis-CI...the approach used @barthy1 for skopeo is free from dependence on intermediate tooling and simple enough that lay-people can understand it from start to finish. Since native hardware is utilized, it only requires a few generic docker operations (which could easily be replicated with future podman), and a simple(ish) Makefile.

That all said, I'm sensitive to and appreciate the amount of work that's already been committed here. Clearly it works as designed, and just needs some secrets added (which we're happy to create/provide). On the other hand, and in accordance with the "sunken-cost fallacy", I wonder if the greater-good might be best served by re-implementing this to more closely resemble the skopeo setup.

I will support the decision of the group.

AkihiroSuda · 2020-12-02T00:47:02Z

It depends heavily/entirely on docker-centric third-party tooling which the podman maintainers are unlikely to ever embrace. This creates a conflict of interest where the multi-arch images could be "held hostage" by the very technology podman is attempting to disrupt.

What if we use buildctl directly instead of docker buildx CLI?

cevich · 2020-12-02T14:20:29Z

What if we use buildctl directly instead of docker buildx CLI?

That's (also) not something I'm familiar with. Poking around on google, it seems to be the nuts-and-bolts behind buildkit?

I thought about this situation last night, and believe my main concern is over long-term maintainability. My other argument over tooling/dependency hegemony seems a lesser concern. This root of this is that there's a HUGE disparity in the number of people working on podman and those working on CI/CD infra. and tooling. I'm not above relying on the community to help with maintenance, that's a positive force in my book. Rather, we need to accept the historically-likely possibility that if/when broken for any reason, and nobody is around with skills to fix them quickly, these workflows may simply be disabled or deleted 😞

One idea that could help: Document the heck out of it. If a future podman-developer (not automation guru) can read a doc. and be able to upgrade/update/replace components of the image-build workflow w/o trolling around the internet much, that would be an acceptable compromise. Perhaps that would be less work than re-implementing this to use the Travis+Native HW approach used by skopeo?

What are @rhatdan and @TomSweeneyRedHat and @mheon opinions here? Am I being overly paranoid, and should go back into my little corner to snuggle my bash scripts?

cevich · 2020-12-02T14:27:56Z

.github/workflows/rootless-cni-infra.yaml

+          --platform amd64,arm,arm64,s390x,ppc64le \
+          -f contrib/rootless-cni-infra/Containerfile \
+          contrib/rootless-cni-infra
+        gzip -9 rootless-cni-infra.tar


I'm not sure the compression is worthwhile here, actions/upload-artifact will create a compressed zip-file automatically.

cevich · 2020-12-02T14:28:42Z

.github/workflows/rootless-cni-infra.yaml

+        gzip -9 rootless-cni-infra.tar
+    - name: "Print SHA256SUM of rootless-cni-infra.tar.gz"
+      run: |
+        sha256sum rootless-cni-infra.tar.gz


Perhaps this is my own ignorance: What is the purpose of calculating/printing the sha256sum of the tarball?

mheon · 2020-12-02T14:29:23Z

I don't know where buildah buildx is on our priorities list, but I have to imagine it is on there somewhere (@nalind and @rhatdan would know better). Given that, we will have a drop-in replacement for the existing docker buildx solution at some eventual point in the future and can replace it then. Given that, and our urgent need to get this image distributed for non-AMD64 platforms (this is a very exciting feature, IMO - full networking without root is extremely useful - and it depends on this image) I think it's worthwhile to go forward. Though, as Chris said, documentation is always a good thing.

cevich · 2020-12-02T14:31:41Z

.github/workflows/rootless-cni-infra.yaml

+        path: rootless-cni-infra.tar.gz
+    - name: "Notice"
+      run: |
+        echo "The image is NOT pushed to quay. To push the image to quay, run skopeo manually with the artifact archive."


Were you thinking of implementing the push to quay, or is that something I could/should do as a followup PR?

I'm okay either way, but since I need to create the accounts and add the secrets, it might be easier if done as a followup.

AkihiroSuda · 2020-12-02T14:44:19Z

Another option I'm thinking of is embedding the Containerfile into podman binary, and let podman build it on first run. This should work on any arch, and yet we no longer need to maintain the quay image.

Yet another option is to build the image in Makefile (using podman/buildah present in the build host) and embed its OCI archive into podman binary (or just place the OCI archive as /usr/share/podman/rootless-cni-infra.tgz). This complicates Makefile but simplifies runtime. Also, this option is beneficial for internetless deployment.

RFC.

mheon · 2020-12-02T15:12:35Z

I like this - it resolve the distribution problem completely.

AkihiroSuda · 2020-12-02T15:24:11Z

I like this - it resolve the distribution problem completely.

Sorry, which is "this"?

mheon · 2020-12-02T15:27:38Z

Your latest suggestion, building the image automatically on first run.

cevich · 2020-12-02T16:07:58Z

we will have a drop-in replacement for the existing docker buildx solution

Oh! Wasn't aware of that. Yes, that negates my buildx + docker conflict-of-interest argument completely.

Your latest suggestion, building the image automatically on first run.

I like this idea also, as we're all familiar with Makefiles and it sounds like only basic tooling would be required. It also means writing less documentation 😁

cevich · 2020-12-02T16:11:41Z

Also, this option is beneficial for internetless deployment.

Based on my experience working the RHEL-support front-lines: That benefit should not be under-stated. It's a very critical/important benefit to a great number of often non-advertised uses. It's also "one less thing to flake/break" should there be a networking failure at an inopportune time.

TomSweeneyRedHat · 2020-12-02T20:15:29Z

Maybe I'm missing the point of the embedded Containerfile in the Podman binary, but it sounds like the end-user would have to install Podman first, then know how to make the image for their arch using the Podman binary? From what I've seen, most people just want to grab an image from some "official" location on quay.io and go from there.

I love @barthy1's work on Skopeo, but we no longer have travis here in Buildah or Podman. I'm not sure how easily that work would translate here without reviving travis and I'd rather not do that. I really don't have a leaning towards the approaches discussed here as long as it documented so even I could understand it, and that we end up with an image for each of the arches that @AkihiroSuda created in this PR that automatically get out to quay.io.

Then whatever we do here, I'd like to also do on the Buildah project. If it makes sense to do the same kind of work on the Skopeo project, I'd be OK with replacing the work @barthy1 has provided there, but I'd prefer just adding the one or two archs to @barthy1's work on Skopeo that are here in this work but not yet in Skopeo.

Hopefully, my thoughts here are somewhat on track, I've been completely buried in emails/PR's this week and it's all blurring at this point.

cevich · 2020-12-02T21:25:54Z

@TomSweeneyRedHat you're not far off. Yes I'm not too happy about reviving Travis, but it is possible.

embedded Containerfile

No...as I understand it, we would be embedding the multi-arch image (the tarball) into the release. Nothing gets built at package install time, only perhaps podman loaded (if that).

From what I've seen, most people just want to grab an image from some "official" location on quay.io and go from there.

Yes, and this is the tail-end of my multi-dependency + complexity fear: Users don't want some months old image, they assume it's "recent". But if the automation is broken and nobody knows how to fix it, nothing newer will get pushed to quay 😖 It becomes 100% manual building and pushing at that point 😞

mheon · 2020-12-02T21:50:50Z

I'm going to bring this up for discussion with the team at scrum tomorrow, because we really need a plan here. We are preparing to ship a release of Podman including support for this in RHEL, which means we need a plan for distributing this image via RHEL channels, so this needs to be solved sooner rather than later.

mheon · 2020-12-04T14:17:34Z

We deferred some of the discussion to Monday, but from the options we looked at, the most promising seems to be moving builds to OBS and then pushing to Quay from there - OBS possesses builders for all major architectures, the capability to autobuild on new commits (though this may require us to move the containerfile over to its own repository so builds are only triggered by changes to the image itself), and we're already very familiar with it as we use it to distribute the latest version of Podman for non-Red Hat OSes.

AkihiroSuda · 2020-12-04T14:33:03Z

My current opinion is to just place the OCI archive as /usr/share/podman/rootless-cni-infra.tgz .

It doesn't need multi-arch builder, and doesn't need Internet either.

rhatdan · 2020-12-04T23:05:01Z

I like that idea.

rhatdan · 2020-12-07T21:17:44Z

After some discussion, I believe the best solution would be to build a container image off of the host OS. We should not need an image at all. Just use the hosts CNI plugins and friends. (We should do this for the pause container as well).

Someone needs to work out the logistics of this, but theoretically we should be able to do something like:

podman run --rootfs / echo hello

Doing this from rootless mode would allow us to do everything your container image requires, without needing to download or load an image, and using up all the extra disk space. (Or adding the image to the podman package.)

AkihiroSuda · 2020-12-07T22:50:06Z

I agree that --rootfs / is the correct solution, but implementation may take long time.

I think we need a short-time solution as well (i.e. local OCI archive under /usr/share/podman)

rhatdan · 2020-12-07T22:53:49Z

Yes I think we should ship a Containerfile that could be used to build the image and to allow pulling form container registry. But long term, we need to add support for OS based container images.

AkihiroSuda · 2020-12-08T05:34:27Z

I agree that --rootfs / is the correct solution, but implementation may take long time.

This might be easy. runc spec; mount --rbind / ./rootfs; runc run foo inside userns works without extra work.

AkihiroSuda · 2020-12-14T08:31:18Z

opened a proposal for imageless rootless-cni-infra #8709

rhatdan · 2020-12-15T15:20:09Z

Very nice.

AkihiroSuda force-pushed the upload-rootless-cni-infra-multiarch branch from be0b7fe to b3ab81d Compare November 19, 2020 21:19

AkihiroSuda force-pushed the upload-rootless-cni-infra-multiarch branch from b3ab81d to fc34be7 Compare November 19, 2020 21:22

cevich reviewed Dec 2, 2020

View reviewed changes

AkihiroSuda marked this pull request as draft December 3, 2020 07:50

openshift-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Dec 3, 2020

AkihiroSuda closed this Dec 8, 2020

AkihiroSuda mentioned this pull request Dec 14, 2020

Proposal: imageless rootless-cni-infra #8709

Closed

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 23, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 23, 2023

Build multi-arch rootless-cni-infra on GitHub Actions #8415

Build multi-arch rootless-cni-infra on GitHub Actions #8415

Conversation

AkihiroSuda commented Nov 19, 2020 • edited Loading

openshift-ci-robot commented Nov 19, 2020

AkihiroSuda commented Nov 19, 2020 • edited Loading

rhatdan commented Nov 19, 2020

AkihiroSuda commented Nov 19, 2020

TomSweeneyRedHat commented Nov 19, 2020

TomSweeneyRedHat commented Nov 19, 2020

barthy1 commented Nov 20, 2020

AkihiroSuda commented Nov 20, 2020

barthy1 commented Nov 20, 2020

AkihiroSuda commented Nov 20, 2020

mheon commented Nov 20, 2020

TomSweeneyRedHat commented Nov 24, 2020

AkihiroSuda commented Nov 24, 2020

TomSweeneyRedHat commented Nov 24, 2020

cevich commented Dec 1, 2020

cevich commented Dec 1, 2020

AkihiroSuda commented Dec 2, 2020

cevich commented Dec 2, 2020

cevich Dec 2, 2020

Choose a reason for hiding this comment

cevich Dec 2, 2020

Choose a reason for hiding this comment

mheon commented Dec 2, 2020

cevich Dec 2, 2020

Choose a reason for hiding this comment

AkihiroSuda commented Dec 2, 2020 • edited Loading

mheon commented Dec 2, 2020

AkihiroSuda commented Dec 2, 2020

mheon commented Dec 2, 2020

cevich commented Dec 2, 2020

cevich commented Dec 2, 2020

TomSweeneyRedHat commented Dec 2, 2020

cevich commented Dec 2, 2020

mheon commented Dec 2, 2020

mheon commented Dec 4, 2020

AkihiroSuda commented Dec 4, 2020

rhatdan commented Dec 4, 2020

rhatdan commented Dec 7, 2020

AkihiroSuda commented Dec 7, 2020

rhatdan commented Dec 7, 2020

AkihiroSuda commented Dec 8, 2020

AkihiroSuda commented Dec 14, 2020

rhatdan commented Dec 15, 2020

AkihiroSuda commented Nov 19, 2020 •

edited

Loading

AkihiroSuda commented Nov 19, 2020 •

edited

Loading

AkihiroSuda commented Dec 2, 2020 •

edited

Loading