-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Script checks in Consul stuck at critical state "TTL expired" if node was restarted #6332
Comments
One of the possible fixes: #6333 |
Hi @AlexanderZagaevskiy and thanks for this report! The handling of script checks was refactored in the 0.10.0-beta and that should resolve this problem. The lifecycle of the script check is now tied directly to the task runner for its task, so when we restore the task we'll restore the script check as well. It looks like the fix in #6333 was opened against the master branch. But that branch already has the 0.10.0-beta work I described above. If you don't want to wait till 0.10.0 for the fix, would you be willing to fork from the v0.9.5 tag instead? If you do that, you'll see we already have the logic you've added a bit further down at |
@tgross You are absolutely right. The PR #6333 was created mistakably with applying a patch for v0.9.5 on the current master branch. But it does fix the issue for me if that patch is applied for Nomad v0.9.5. Great to read that described issue more likely will be fixed in the coming 0.10.0. So, is it worth to make any PR for fixing it for 0.9.5? |
We are going to be cutting a 0.9.6 bugfix release as well, so it would be great to have your PR in for those folks who won't be ready to go directly to 0.10.0. |
Closing this as mentioned in #6351 (comment) |
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
Nomad version
Nomad v0.9.5
Operating system and Environment details
Ubuntu 18.04.1 LTS which runs a cluster consisted of few docker containers with nomad's and consul's agents into them.
Issue
Scripts checks that registered in Consul by Nomad stayed at not-running state after node had been restarted. This issue looks very similar to the old one: #1636
Reproduction steps
All you need is to restart the node.
The text was updated successfully, but these errors were encountered: