rework container.execute to use multiprocessing as watchdog #50

jmtd · 2023-11-30T11:52:18Z

I'm going to ask for some of the other behave test users I know of to try this out since it's quite a significant change. But it seems to be critical in getting GitHub Actions CI working again for our images: an individual test that blocks will now fail without causing the whole test run to be aborted.

I've pushed this to my fork's v1 branch too, to make it easier to try out.

(commit message follows)

Workaround docker.APIClient.exec_start sometimes blocking indefinitely by running in a sub-process and throwing an exception if the sub-process does not complete within a given timeout.

Remove the existing post-exec code which polled the value of docker.APIClient.exec_inspect for 15 seconds to determine if the command had completed. This is effectively performed by the new sub-process waiting. I've set the timeout to 30 seconds, up from 15, which (from experimentation) seems to be necessary to account for the extra time it takes to invoke exec_start within the timeout period.

A future change should make this timeout configurable.

This general pattern (of watchdogging the docker library code) might be useful elsewhere, in particular for any future efforts to support parallel test execution.

Workaround `docker.APIClient.exec_start` sometimes blocking indefinitely by running in a sub-process and throwing an exception if the sub-process does not complete within a given timeout. Remove the existing post-exec code which polled the value of `docker.APIClient.exec_inspect` for 15 seconds to determine if the command had completed. This is effectively performed by the new sub-process waiting. I've set the timeout to 30 seconds, up from 15, which (from experimentation) seems to be necessary to account for the extra time it takes to invoke `exec_start` within the timeout period. A future change should make this timeout configurable. This general pattern (of watchdogging the docker library code) might be useful elsewhere, in particular for any future efforts to support parallel test execution. Signed-off-by: Jonathan Dowland <[email protected]>

The jmtd fork of behave-test-steps has its v1 branch matching this PR: cekit/behave-test-steps#50 This adds a multiprocess watchdog around invoking `docker.APIClient.exec_inspect`, which will abort the current step if that call has not returned within 30 seconds. This means a lock-up during a step will cause that test to fail and not the whole test run. Signed-off-by: Jonathan Dowland <[email protected]>

spolti · 2023-11-30T15:04:12Z

IIRC there is a environment variable that you can set to increase the timeout, BEHAVE_TIMEOUT I guess.
@rnc .

The jmtd fork of behave-test-steps has its v1 branch matching this PR: cekit/behave-test-steps#50 This adds a multiprocess watchdog around invoking `docker.APIClient.exec_inspect`, which will abort the current step if that call has not returned within 30 seconds. This means a lock-up during a step will cause that test to fail and not the whole test run. Signed-off-by: Jonathan Dowland <[email protected]>

rnc · 2024-03-12T11:29:41Z

@jmtd Is this ready to merge?

jmtd · 2024-03-12T15:56:36Z

yes. sorry for the delay

jmtd mentioned this pull request Dec 4, 2023

Fix GHA by various methods rh-openjdk/redhat-openjdk-containers#418

Merged

rnc merged commit bf40440 into cekit:v1 Mar 13, 2024
1 check passed

jmtd mentioned this pull request Apr 4, 2024

Disable a flaky test on CI rh-openjdk/redhat-openjdk-containers#473

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rework container.execute to use multiprocessing as watchdog #50

rework container.execute to use multiprocessing as watchdog #50

jmtd commented Nov 30, 2023 •

edited

Loading

spolti commented Nov 30, 2023

rnc commented Mar 12, 2024

jmtd commented Mar 12, 2024

rework container.execute to use multiprocessing as watchdog #50

rework container.execute to use multiprocessing as watchdog #50

Conversation

jmtd commented Nov 30, 2023 • edited Loading

spolti commented Nov 30, 2023

rnc commented Mar 12, 2024

jmtd commented Mar 12, 2024

jmtd commented Nov 30, 2023 •

edited

Loading