-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rootless: pods: unlinkat, EBUSY #7139
Comments
Yes, pods are supposed to work. |
I have not been able to get this to fail on my laptop with master? |
Now I can't either. Worse, I can't even get the reproducer script to fail on 7f38774. I did get |
If I remove all the |
A friendly reminder that this issue had no activity for 30 days. |
Update: this might not be remote-only: this failed today in #7556 on ubuntu-19
|
I have definitely seen this on non-remote Podman, but it's more of a symptom than a cause. Usually, something blocked Podman from unmounting the container's root filesystem, so we subsequently could not remove said root filesystem. Unfortunately, we only see the second bit, so we don't know what actually started the failure. |
Another one: PR 7851, ubuntu root |
I think this might be the same error, although it's just plain podman-run, no pods: sys podman fedora-31 rootless host
|
Another one in sys podman fedora-31 rootless host |
One in sys podman fedora-32 rootless host (first one I've seen in f32). This and the above are in regular old 'run' tests, nothing to do with |
Another one in sys podman fedora-32 rootless host |
Another one: sys podman fedora-33 rootless host |
A friendly reminder that this issue had no activity for 30 days. |
@edsantiago still seeing this? |
It is really hard to answer that question, because the problem manifests in many different tests - so I have to examine individual logs to see if this is the cause. Since my last report on Nov 18, I see the following: |
FWIW I've tried the reproducer in comment 0, no luck. I do get a lot of these:
...and once in a while one of these:
but in about fifteen minutes of retries, have never seen the |
A friendly reminder that this issue had no activity for 30 days. |
I can't tell if this is an issue: I've done my Monday-morning pass over the flakes list, and don't see any instances in the last two weeks - but again, the only way to know for sure is to click on and examine every single CI log, and I didn't actually do that - I just did a sample of what I hope was representative. |
Ok I will close, and we can reopen if you see another instance. |
It's not dead yet: sys: podman start --all - start all containers
Once again, this is a pernicious flake because it manifests in many different tests. I caught this one just now by doing a manual review of recent flakes. It's possible that there are other instances I haven't caught. |
Indeed, here's another: sys: podman run - basic tests
|
sys: podman start --all - start all containers
This one looks similar (EBUSY) but the error message is slightly different: Podman pod rm [It] podman pod rm removes a pod with a container
Podman pod prune [It] podman pod prune removes a pod with a stopped container
Please help, this one is getting bad. |
Here are a few more from today, but all of them non-pod-related. Should I create a separate issue for the non-pod unlinkat-EPERM flake? sys: podman run - basic tests
sys: podman start --all - start all containers
|
[NO TESTS NEEDED] This is an attempt to fix a Race condition since it is a race it is difficult to fix. Helps fix: containers#7139 Signed-off-by: Daniel J Walsh <[email protected]>
@edsantiago, have you seen this flake since commit c9609d8? I wonder if this issue was an early symptom of the recent flake of doom. |
No unlinkat/EBUSY flakes since May 27. I'm going to close in hopes that it was fixed by containers/storage#926 |
Are pods supposed to work with podman-remote? There's a race condition somewhere in rootless pods. (There's one in rootful, too, but I'm having a really hard time getting a reproducer).
In window 1:
$ ./bin/podman system service --timeout=0
In window 2:
This leaves droppings behind, the two directories listed above. I can remove them manually:
FWIW I can't reproduce by running the commands manually in my shell; only by running the script above.
master @ 7f38774, rootless only. f32 with crun
The text was updated successfully, but these errors were encountered: