Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error: OCI runtime error: error creating systemd unit libpod-sha.scope: got failed #10442

Closed
edsantiago opened this issue May 24, 2021 · 18 comments
Labels
flakes Flakes from Continuous Integration locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. rootless

Comments

@edsantiago
Copy link
Member

New flake:

# $ podman run --name ignore-me --name my-container quay.io/libpod/testimage:20210427 true
# Error: OCI runtime error: error creating systemd unit `libpod-1b6bc93400c0c619d576f5efbcf411c9a4b09be70424e2081c8726b76424cfe6.scope`: got `failed`
# [ rc=126 (** EXPECTED 0 **) ]

Happens on the first podman run in system tests. Once it happens, all future tests will fail (i.e. this destroys the system). Two instances in the last week, both ubuntu-2104 rootless:

@edsantiago edsantiago added flakes Flakes from Continuous Integration rootless labels May 24, 2021
@github-actions
Copy link

A friendly reminder that this issue had no activity for 30 days.

@rhatdan
Copy link
Member

rhatdan commented Jun 24, 2021

@edsantiago what is the latest on this flake?

@edsantiago
Copy link
Member Author

Seen June 23 in int podman ubuntu-2104 rootless host. (Prior two failures were in sys tests, this one is int; all so far are rootless ubuntu 2104). As with the others, once it triggers, all subsequent tests fail.

@edsantiago
Copy link
Member Author

Also seen June 8 in sys podman ubuntu-2104 rootless host.

@edsantiago
Copy link
Member Author

Seen just now, again in sys podman ubuntu-2104 rootless

@github-actions
Copy link

A friendly reminder that this issue had no activity for 30 days.

@edsantiago
Copy link
Member Author

August 10, 23, 23 and 24. All of them ubuntu-2104

@edsantiago
Copy link
Member Author

And, speak of the devil, another one right now

@vrothberg
Copy link
Member

vrothberg commented Sep 23, 2021

That looks like a runc issue to me (see https://github.com/opencontainers/runc/blob/master/libcontainer/cgroups/systemd/common.go#L354).

@kolyshkin , do you have a suspicion what's going on?

@github-actions
Copy link

A friendly reminder that this issue had no activity for 30 days.

@rhatdan
Copy link
Member

rhatdan commented Oct 25, 2021

@AkihiroSuda @kolyshkin Thoughts?

@kolyshkin
Copy link
Contributor

This means that runc asked systemd to create a unit, and systemd was unable to do that. Alas it only returns a single word ("failed"), I guess systemd logs might have some more details.

Now, I have never seen anything like this myself, and have no idea what could be the reason. If there is a way to repro, or get systemd logs after such failure, I'll be happy to take a look.

If I remember correctly, older runc versions used to ignore the failure from systemd, which resulted in errors later (upon changing unit properties or unit removal). This was fixed so that runc would error out earlier, which in theory is easier to debug.

Sorry for not being very helpful -- someone who's more familiar with system might have more ideas.

@edsantiago
Copy link
Member Author

Since my last comment here the flake has appeared twice: Sept 9 and Sept 10, both on ubuntu-2104 rootless. No other instances since then (at least that my flake logger sees).

I realize this means that someone will close this issue. I'm semi-ok with that, since there's really nothing useful to go on; I just want the closing to be done intentionally and with awareness by someone other than me.

@rhatdan
Copy link
Member

rhatdan commented Oct 26, 2021

Ok, I will do the honors.

@rhatdan rhatdan closed this as completed Oct 26, 2021
@edsantiago
Copy link
Member Author

Reported by @baude today in planning meeting: ubuntu 2110 rootless

@kolyshkin
Copy link
Contributor

@edsantiago is there a way to see system journal from that system? It might shed some light on what happened.

Again, this is systemd being unable to create a unit; all it returns back to runc is failed. Unless I have logs or a repro I can't do anything about it.

@edsantiago
Copy link
Member Author

@kolyshin I think so: this?

More generally: from any log page, scroll to the top, click the Task link (not the Build one), and that gives you all the Cirrus details. The link I included here is to the journal section.

@kolyshkin
Copy link
Contributor

OK this is actually crun, not runc. Otherwise all I said above is still true.

Error: OCI runtime error: crun: error creating systemd unit libpod-394f64f70cd61df9eeab89d8a24a7c5e8384536c3394c5d3efa08f82da7f3778.scope: got failed

@github-actions github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 21, 2023
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Sep 21, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
flakes Flakes from Continuous Integration locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. rootless
Projects
None yet
Development

No branches or pull requests

4 participants