Fix race condition in testLifecycleOperations, Serialize updateContainer() calls, Ignore irrelevant container events #1353

martinpitt · 2023-07-13T10:45:47Z

No description provided.

The initial container gets started and immediately stopped via the CLI. The events propagate through the UI asynchronously, so strengthen the initial wait to ensure that the container is actually shown as "Exited". Otherwise it could still be "Running" in the UI, and trying to open the action menu would not show "Start".

With a burst of events these get called in parallel. But podman does not return them in the call order [1], which led to non-current state updates. [1] containers/podman#19124

src/app.jsx

martinpitt · 2023-07-14T05:45:14Z

testHealthcheck is still very flaky. But that already happens on main, investigating in #1324 (comment) . So retrying.

src/app.jsx

marusak

Looks good! The last commit with ignoring events may be potentially dangerous, see my comment

These are internal transient states which don't need to reflect in the UI. They happen quickly in bursts, with a "permanent state" event following such as "create", "died", or "remove". This helps to reduce the API calls and thus mitigates out-of-order results; see containers/podman#19124 We are not really interested in `podman exec` events, so we would like to ignore `exec_died` along with `exec`. However, it is the only thing that saves us from inconsistent `health_state` events (see containers/podman#19237). So we cannot rely on the latter event, but instead have to do a full update after each `exec_died`, as some of them are the health checks. Also fix the alphabetical sorting of the remaining events.

martinpitt · 2023-07-14T08:59:02Z

Argh, this makes the health check tests more flaky, especially on ubuntu-2204 (but not limited to that). I analyzed that in #1324 (comment) , reported it as containers/podman#19237 , and documented our accidental workaround explicitly.

Now we are back to the status quo of "the test flake a lot", instead of "all the time on ubuntu-2204" 😢

marusak

I am so sorry you have to deal with this :/ Thanks!

martinpitt added 2 commits July 13, 2023 12:32

Serialize updateContainer() calls

eed34d9

With a burst of events these get called in parallel. But podman does not return them in the call order [1], which led to non-current state updates. [1] containers/podman#19124

martinpitt marked this pull request as draft July 13, 2023 10:54

martinpitt force-pushed the fixes branch 2 times, most recently from a8183c4 to c10a8aa Compare July 13, 2023 11:33

martinpitt added the flake unstable test label Jul 13, 2023

martinpitt marked this pull request as ready for review July 13, 2023 12:07

martinpitt requested a review from marusak July 13, 2023 12:07

marusak reviewed Jul 13, 2023

View reviewed changes

src/app.jsx Outdated Show resolved Hide resolved

martinpitt force-pushed the fixes branch from c10a8aa to 9e7430a Compare July 13, 2023 17:30

martinpitt changed the title ~~Fix race condition in testLifecycleOperations, Serialize updateContainer() calls, Handle 'health_status' container event~~ Fix race condition in testLifecycleOperations, Serialize updateContainer() calls, Ignore irrelevant container events Jul 13, 2023

martinpitt force-pushed the fixes branch from 9e7430a to 035e797 Compare July 13, 2023 18:09

martinpitt requested a review from marusak July 14, 2023 05:45

marusak reviewed Jul 14, 2023

View reviewed changes

src/app.jsx Show resolved Hide resolved

marusak previously approved these changes Jul 14, 2023

View reviewed changes

martinpitt dismissed marusak’s stale review via 7fcf4d1 July 14, 2023 08:55

martinpitt force-pushed the fixes branch from 035e797 to 7fcf4d1 Compare July 14, 2023 08:55

martinpitt requested a review from marusak July 14, 2023 09:25

marusak approved these changes Jul 14, 2023

View reviewed changes

martinpitt merged commit 92398c3 into cockpit-project:main Jul 14, 2023

martinpitt deleted the fixes branch July 14, 2023 09:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix race condition in testLifecycleOperations, Serialize updateContainer() calls, Ignore irrelevant container events #1353

Fix race condition in testLifecycleOperations, Serialize updateContainer() calls, Ignore irrelevant container events #1353

martinpitt commented Jul 13, 2023 •

edited

Loading

martinpitt commented Jul 14, 2023

marusak left a comment

martinpitt commented Jul 14, 2023 •

edited

Loading

marusak left a comment

Fix race condition in testLifecycleOperations, Serialize updateContainer() calls, Ignore irrelevant container events #1353

Fix race condition in testLifecycleOperations, Serialize updateContainer() calls, Ignore irrelevant container events #1353

Conversation

martinpitt commented Jul 13, 2023 • edited Loading

martinpitt commented Jul 14, 2023

marusak left a comment

Choose a reason for hiding this comment

martinpitt commented Jul 14, 2023 • edited Loading

marusak left a comment

Choose a reason for hiding this comment

martinpitt commented Jul 13, 2023 •

edited

Loading

martinpitt commented Jul 14, 2023 •

edited

Loading