-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JobManager: add resilience for backend failures #365
Comments
FYI, my current approach to add retries in connection.py:
|
JohanKJSchreurs
added a commit
that referenced
this issue
Jan 26, 2023
JohanKJSchreurs
added a commit
that referenced
this issue
Jan 26, 2023
JohanKJSchreurs
added a commit
that referenced
this issue
Jan 26, 2023
JohanKJSchreurs
added a commit
that referenced
this issue
Jan 30, 2023
JohanKJSchreurs
added a commit
that referenced
this issue
Jan 31, 2023
jdries
added a commit
that referenced
this issue
Jan 31, 2023
…end-failures-to-jobmanager Issue #365 Add resilience to job manager
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Backends have all kind of intermittent failures that go away over time. This usually means they send 50x errors.
Especially for the job manager, it can be better to keep trying until the backend works again. The python requests library can be configured to do this.
This way, long running tasks are more resilient.
The text was updated successfully, but these errors were encountered: