podman play --service-container : is not stopping (flake) #14351

edsantiago · 2022-05-24T18:58:26Z

Three flakes in my PR, finally passed on the fourth try:

# #| FAIL: Timed out waiting for container b9180307e1fd-service to enter state running=false

Unfortunately, #14338 (timeout bump) did not work. Sorry @vrothberg.

[sys] 341 podman play --service-container

PR help-message system test: catch more cases #14346

The text was updated successfully, but these errors were encountered:

vrothberg · 2022-05-25T07:27:51Z

Thanks, @edsantiago. I am going to take a look immediately.

vrothberg · 2022-05-25T08:02:48Z

I smell where it's coming from. Will prepare a PR in a jiffy.

Simplify the work-queue implementation by using a wait group. Once all queued work items are done, the channel can be closed. The system tests revealed a flake (i.e., containers#14351) which indicated that the service container does not always get stopped which suggests a race condition when queuing items. Those items are queued in a goroutine to prevent potential dead locks if the queue ever filled up too quickly. The race condition in question is that if a work item queues another, the goroutine for queuing may not be scheduled fast enough and the runtime shuts down; it seems to happen fairly easily on the slow CI machines. The wait group fixes this race and allows for simplifying the code. Also increase the queue's buffer size to 10 to make things slightly faster. [NO NEW TESTS NEEDED] as we are fixing a flake. Fixes: containers#14351 Signed-off-by: Valentin Rothberg <[email protected]>

edsantiago added the flakes Flakes from Continuous Integration label May 24, 2022

vrothberg mentioned this issue May 25, 2022

work queue: simplify and use a wait group #14354

Merged

vrothberg self-assigned this May 25, 2022

openshift-merge-robot closed this as completed in #14354 May 25, 2022

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 20, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

podman play --service-container : is not stopping (flake) #14351

podman play --service-container : is not stopping (flake) #14351

edsantiago commented May 24, 2022

vrothberg commented May 25, 2022

vrothberg commented May 25, 2022

podman play --service-container : is not stopping (flake) #14351

podman play --service-container : is not stopping (flake) #14351

Comments

edsantiago commented May 24, 2022

[sys] 341 podman play --service-container

vrothberg commented May 25, 2022

vrothberg commented May 25, 2022