-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
auto update: return restart error #17796
Conversation
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: vrothberg The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Return the error when restarting the unit failed during an update. The task is correctly marked to have failed but we really need to return the error to the user. [NO NEW TESTS NEEDED] - The flakes in containers#17607 will reveal errors. Signed-off-by: Valentin Rothberg <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Well, here's the error message [source]:`
|
This is really poking in the dark. I added some more debug logs to the test in hope to get something useful. Really no idea what's going on on Debian. It's always the same pattern that restarting the systemd units fails during auto update. Why is it only flaking on Debian? @edsantiago have you seen these tests fail on Fedora as well but maybe at such a low frequency that it fell under the (flake) radar? |
Add debug logs from systemctl and journalctl in hope to get more data on the Debian flakes tracked in containers#17796. Signed-off-by: Valentin Rothberg <[email protected]>
There are only two Fedora flakes in auto-update, and both are the "1 vs 2" flake:
|
Still happening, still debian. Is there anything in that log that can help you? |
I'd need to clone myself at the moment to find time to look into it, sorry :( Will take a look as soon as I find time. |
@edsantiago are you cool to merge or shall I remove the echos in the tests? |
/lgtm |
Thanks! My current "feeling" is that after this change we will only see I will look into the dbus/systemd dependency we use and check whether a bug may linger there. Other than that, we may try to restart the service another time if the first attempt has failed. But that would be my last resort as it feels like giving up. |
Return the error when restarting the unit failed during an update. The task is correctly marked to have failed but we really need to return the error to the user.
@edsantiago 🤞
[NO NEW TESTS NEEDED] - The flakes in #17607 will reveal errors.
Does this PR introduce a user-facing change?