-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nomad failed to start after node reboot #5584
Comments
If you're using Docker, Nomad's initial failure to start after rebooting is due to #5566 and fixed by #5568. The subsequent failure is likely due to a docker_logger process still existing after the initial nomad process has exited? I'm unsure. Can you share your systemd unit file and the output of |
But
Systemd unit file:
|
Existing processes:
|
Something similar is happening to me...when I restart Nomad client it simply hangs:
Only removing Nomad's state directories solves this. |
@zonnie Yes, it looks like 0.9.0 only. I was wrong thinking it was the same problem (as corrupted state). Will fix description. Happens on each reboot, does not depend on OS. And by the way |
Looks it is #5566 |
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
Nomad version
0.9.0
Operating system and Environment details
Arch Linux, Debian 9
Issue
Each time I reboot Nomad can't start after it.
Maybe related: #4748
Reproduction steps
Just reboot
Nomad Server logs
There is nothing in logs at first:
But after I manually restart Nomad
sudo systemctl restart nomad
:Nomad isn't started actually. I need to delete
alloc
andclient
dirs to start it.My thoughts
I think it is somehow connected with systemd. I remember that in Debian 9 systemd waited 1m 30s before it actually reboots. And I think Nomad had time to do what it wanted. On Arch Linux there is no wait time (almost instant reboot). And each time Nomad can't start after it.
Maybe some unmounting problem... don't know.
As soon as I delete
alloc
andclient
dirs Nomad starts perfectly.The text was updated successfully, but these errors were encountered: