race: 'podman stop' does not always remove all podman mounts #5747

edsantiago · 2020-04-07T12:22:54Z

One of the e2e tests, podman list running container in test/e2e/mount_test.go, occasionally flakes in CI. Basically, podman mount is finding an active mountpoint even after the container is podman stopped. Rerunning podman mount a second later finds no mounts, so it's pretty likely a race: maybe there's some cleanup that isn't happening in time.

If this race condition is OK, that is, if it doesn't matter whether podman mount shows a dead mountpoint from a stopped container, then the e2e test must be fixed.

If this race condition is not OK, that is, if podman stop should guarantee that there are no mount points when it exits, then podman stop must be fixed.

Reproducer:

#!/bin/bash

set -e

T0=$SECONDS

while :;do
    cid=$(podman run -dt docker.io/library/alpine:latest top)
    podman mount --notruncate | grep -q $cid
    podman stop $cid > /dev/null
    m2=$(podman mount --notruncate)
    if [[ "$m2" =~ $cid ]]; then
        echo "FOO! Still mounted!"
        echo "$m2"
        echo "time = $(( $SECONDS - $T0 )) seconds"
        sleep 1
        echo
        echo "after sleep 1:"
        podman mount
        exit 1
    fi
    podman rm $cid >/dev/null
done

Sample run:

# /tmp/mtest
FOO! Still mounted!
2b473b127c369cd11da3c775780ea2e693e17511534c1e296980a35443d14a70 /var/lib/containers/storage/overlay/5798db64ca882ed888a403a5398d1bb5d020c8c0bab719ba67a71b13673b4773/merged
time = 96 seconds

after sleep 1:

CI failure is f30; I can reproduce with podman-1.8.0-4.fc30.

Problem still present in rawhide: podman-1.8.3-0.75.dev.gitf7dffed.fc33

The text was updated successfully, but these errors were encountered:

rhatdan · 2020-04-07T14:37:40Z

Well if the container is running podman stop will stop the container, then conmon realizes that the container has stopped, and then execs podman container cleanup to cleanup the container. This is where the race happens.
We could make podman stop wait for the container to enter the stop state or disappear, but it could wait for a very long time,

mheon · 2020-04-07T14:38:06Z

Hmmmm. I think that, right now, podman stop does not provide a guarantee that a container has been cleaned up immediately after exit (we instead wait for the cleanup process to do it). I'm not sure if this is desirable, though. If we want to add such a guarantee, it would be trivial to make podman stop call ctr.Cleanup() immediately before exiting.

rhatdan · 2020-04-07T15:03:52Z

Wouldn't that give you a race as well, in that the container would still be running? Or at least podman container cleanup would fail.

mheon · 2020-04-07T15:07:05Z

I think we guarantee that the container is stopped after the Stop() API call - it's just that we don't actually verify that the cleanup has completed when podman stop exits.

github-actions · 2020-05-08T00:07:34Z

A friendly reminder that this issue had no activity for 30 days.

mheon · 2020-05-08T13:26:30Z

I'll self-assign this. It's fairly low priority, but hopefully I can get to it sometime in the next few weeks.

The cleanup process was already running and ensuring that mounts and networking configuration was cleaned up on container stop, but this was async from the actual `podman stop` command which breaks some expectations - the container is still mounted at the end of `podman stop` and will be cleaned up soon, but not immediately. Fortunately, it's a trivial change to resolve this. Fixes containers#5747 Signed-off-by: Matthew Heon <[email protected]>

github-actions bot added the stale-issue label May 8, 2020

mheon self-assigned this May 8, 2020

mheon mentioned this issue Jun 3, 2020

When stopping containers locally, ensure cleanup runs #6483

Merged

openshift-merge-robot closed this as completed in #6483 Jun 3, 2020

mheon mentioned this issue Aug 13, 2020

Ensure pod infra containers have an exit command #7283

Merged

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 23, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

race: 'podman stop' does not always remove all podman mounts #5747

race: 'podman stop' does not always remove all podman mounts #5747

edsantiago commented Apr 7, 2020

rhatdan commented Apr 7, 2020

mheon commented Apr 7, 2020

rhatdan commented Apr 7, 2020

mheon commented Apr 7, 2020

github-actions bot commented May 8, 2020

mheon commented May 8, 2020

race: 'podman stop' does not always remove all podman mounts #5747

race: 'podman stop' does not always remove all podman mounts #5747

Comments

edsantiago commented Apr 7, 2020

rhatdan commented Apr 7, 2020

mheon commented Apr 7, 2020

rhatdan commented Apr 7, 2020

mheon commented Apr 7, 2020

github-actions bot commented May 8, 2020

mheon commented May 8, 2020