-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Service jobs should support the ability to be restarted after a failure #134
Comments
@kelseyhightower That is odd, it should have done that automatically... Was there another healthy client available at the time of failure? |
No, which has me thinking that the scheduler attempts to make a decision and fails the job when there are no other healthy nodes. I was expecting the job to be retried until success. |
@kelseyhightower Yeah, there is no good story for retrying a job when the scheduler fails in 0.1. We probably need to create a "retry" evaluation in a wait state. |
PR #393 fixes this. |
Fixes races with Raft configurations member, adds GetConfiguration accessor.
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
While testing the example redis job, I stopped the agent and the job was marked dead. When I restarted the agent, the job did not get rescheduled. This maybe by design, but it was unexpected, and I assumed since the job was a service nomad would just reschedule the job to an health node.
The text was updated successfully, but these errors were encountered: