-
Notifications
You must be signed in to change notification settings - Fork 176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[8.0] Getting more details about failed/aborted pilots from HTCondor #7069
[8.0] Getting more details about failed/aborted pilots from HTCondor #7069
Conversation
12ca458
to
1ab913c
Compare
0ab51ff
to
f9fd4d5
Compare
Please fix the conflict. |
f9fd4d5
to
2e963c4
Compare
a89508b
to
71254ad
Compare
de00232
to
1a1415b
Compare
5c2487a
to
aa812d3
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry you were waiting for me for so long.
Emoji responses, and marking discussions as resolved don't trigger notifications.
So I was expecting explicit responses to comments with "Done", or a final: "I changed things along the lines you proposed" to have another look.
aa812d3
to
b5afceb
Compare
Indeed, my fault. Thanks. |
…s from HTCondor
Sweep summary Sweep ran in https://github.com/DIRACGrid/DIRAC/actions/runs/6782583263 Successful:
|
This PR:
allows operators to get output data even if the pilots fail/are aborted:
Failed
if they don't terminate correctly.on_exit_hold
option allows to mark pilots asheld
if they fail (on_exit_hold_subcode
allows to identify them with a specific code).Failed
.periodic_remove
deletes the pilots at some point.fixes wrong
Aborted
pilots:Aborted
in DIRAC while there are temporaryHeld
in HTCondor because inputs are spooled.Waiting
.BEGINRELEASENOTES
*Resources
FIX: getting more details about failed/aborted pilots from HTCondor
ENDRELEASENOTES