Ensure that container still exists when removing #10476

mheon · 2021-05-26T19:22:34Z

After #8906, there is a potential race condition in container removal of running containers with --rm. Running containers must first be stopped, which was changed to unlock the container to allow commands like podman ps to continue to run while stopping; however, this also means that the cleanup process can potentially run before we re-lock, and remove the container from under us, resulting in error messages from podman rm. The end result is unchanged, the container is still cleanly removed, but the podman rm command will seem to have failed.

Work around this by pinging the database after we stop the container to make sure it still exists. If it doesn't, our job is done and we can exit cleanly.

[NO TESTS NEEDED] because the race doesn't trigger easily and I can't make a test to reproduce it on demand anywhere.

openshift-ci · 2021-05-26T19:22:36Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: mheon

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [mheon]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

rhatdan · 2021-05-26T19:30:35Z

Is this to fix THE race?
LGTM

mheon · 2021-05-26T19:32:44Z

Nope. Separate BZ, unfortunately.

Would not be surprised if it fixes some races in CI, though.

After containers#8906, there is a potential race condition in container removal of running containers with `--rm`. Running containers must first be stopped, which was changed to unlock the container to allow commands like `podman ps` to continue to run while stopping; however, this also means that the cleanup process can potentially run before we re-lock, and remove the container from under us, resulting in error messages from `podman rm`. The end result is unchanged, the container is still cleanly removed, but the `podman rm` command will seem to have failed. Work around this by pinging the database after we stop the container to make sure it still exists. If it doesn't, our job is done and we can exit cleanly. Signed-off-by: Matthew Heon <[email protected]>

TomSweeneyRedHat · 2021-05-26T20:14:44Z

FWIW, addresses https://bugzilla.redhat.com/show_bug.cgi?id=1964852
LGTM

rhatdan · 2021-05-26T20:18:13Z

/lgtm
/hold

vrothberg

/hold cancel

LGTM

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 26, 2021

mheon force-pushed the ensure_exists_on_remove branch from 9a8d9ca to fad6e1d Compare May 26, 2021 19:33

openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label May 26, 2021

openshift-ci bot assigned rhatdan May 26, 2021

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label May 26, 2021

vrothberg reviewed May 27, 2021

View reviewed changes

openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label May 27, 2021

openshift-merge-robot merged commit 542d730 into containers:master May 27, 2021

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 23, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure that container still exists when removing #10476

Ensure that container still exists when removing #10476

mheon commented May 26, 2021

openshift-ci bot commented May 26, 2021

rhatdan commented May 26, 2021

mheon commented May 26, 2021

TomSweeneyRedHat commented May 26, 2021

rhatdan commented May 26, 2021

vrothberg left a comment

Ensure that container still exists when removing #10476

Ensure that container still exists when removing #10476

Conversation

mheon commented May 26, 2021

openshift-ci bot commented May 26, 2021

rhatdan commented May 26, 2021

mheon commented May 26, 2021

TomSweeneyRedHat commented May 26, 2021

rhatdan commented May 26, 2021

vrothberg left a comment

Choose a reason for hiding this comment