-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[main] Cirrus: Update VM Images for 4.0 release #13055
[main] Cirrus: Update VM Images for 4.0 release #13055
Conversation
6b31cf0
to
f7e2eee
Compare
I'm trying to update our VM images, but for some reason checkpoint test (among others) aren't happy just on F34. @adrianreber any idea what's breaking in the checkpoint tests? Do we need package updates in F34 maybe? |
f7e2eee
to
7b50354
Compare
@adrianreber un-ping. I think the problems may have been caused by premature/unintentional introduction of netavark & aardvark-dns packages on F35 for all tests. |
7b50354
to
0cebeb0
Compare
@adrianreber re-ping 😢 F34 criu is breaking with the latest updates. @edsantiago IIRC you touched these tests recently (disabled/enabled them), no? Do any of the errors in those logs ring any bells? Since this is only affecting F34, I'm thinking it must be due to some package available in F35 (possible from updates-testing) that's necessary. |
Hm. The recent criu errors were due to rawhide kernel new-stuff. The errors I see here are all:
...and they're happening in Link to annotated log, which is infinitely better than the cirrus page. |
Oh okay, so this is something different then.
There are some instances which do fail after criu:
But that sounds like it could be related to the error you pointed out. The packages on both new VM flavors are basically the same Edit: F35 annotated log ref. |
This is f34, so |
Oh right, I completely forgot, thanks!
Grabbing a run from |
Well, that's really interesting because some of the discussion on the RHEL 8.6 gating-test failures is zeroing in on the move from runc 1.0 to 1.1. Are you able to rebuild the VM using runc 1.0, and just sweep 1.1 under the carpet, la la la? |
The broken checkpoint/restore tests seem all to be about checkpointing out of and restoring into another pod. That seems broken with runc 1.1. There is also something wrong with setting up a pod that shares a cgroup namespace.
Seems like this triggers a couple of different errors. |
Overall none of these checkpoint/restore errors seems to be a CRIU problem, but all of them seem to be triggered by changes in Podman and runc as far as I can tell. Strange that CI did not catch it earlier. |
Thanks for all the analysis and PR-work @adrianreber I sincerely appreciate it. I can't tell from your comments, but hopefully these efforts will help ensure stability downstream for runc users. |
0cebeb0
to
eb1f908
Compare
Rebased on #13214 and force-pushed. |
@adrianreber your PR does seem to help, though curiously, not in the "remote" case. Annotated results: |
Yes of course @edsantiago if that's what's needed, but I'm not sure I'd call it a solution - we'd not be testing in an environment representative of what users are using. The "proper" way to handle it for both tests and users, is to (somehow) roll back the version released in Fedora (but which I believe takes weeks 😖). Of course, I am assuming that the issue also affects the F34-stock podman 3.4 - may/not be true. |
3b81078
to
2590deb
Compare
Rebased/force-pushed. |
2590deb
to
f19da99
Compare
f19da99
to
e67e5a6
Compare
Mainly this is to confirm some changes needed for the podman-py CI setup don't disrupt operations here. Ref: containers/automation_images#111 Also includes a minor steup fix WRT setting up for test-rpm build. Signed-off-by: Chris Evich <[email protected]>
Podman 4.0 will never be supported in F34, and the use of F35 in CI is temporary until F36 is brought up to speed. Rather than fight with testing issues that will never be fixed/supported, simply disable it. This commit may be reverted at a future date when F36 VM support is added. Signed-off-by: Chris Evich <[email protected]>
e67e5a6
to
7b55ab4
Compare
This should be ready to go (The v4.0 PR w/ same images already merged). |
/lgtm |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: cevich, rhatdan The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
This is to ensure VM images for CI, which contain the
intended dependency versions to support the podman
4.0 release.
Ref: containers/automation_images#114