-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[0.8.4] Nomad UI Hanging on Job detail Viewing #5946
Comments
Hello, thanks for the report. Are you able to try this out with Nomad 0.9.2 or later? It looks to me that this problem was fixed with UI updates in that version. |
Will try it out - I'm waiting on 0.9.4 to start upgrading our fleet. |
i had managed to overcome the problem. i had stoped and purged a periodic job by name (redis-config-updater) by stop --purge. below is a script i had run to remove all dead periodic jobs - now the ui is responsive. i am not sure if the ui is stuck due to amount of jobs - it sure looks like it - anyway after the purge the ui become responsive again. @echo off for /f "tokens=1,2,4" %%a in ('..\nomad.exe status --address http://192.168.15.123:4646') do ( ) |
I can confirm we are seeing this issue on 0.10.4. Also seems related to stopped periodic jobs once they are garbage collected. |
I can confirm we just hit this issue when we upgraded Nomad to UI was hanging, Purging pending batch jobs fixed it for us by running the following (
UI works again. |
I looked into this awhile back and ran into deep issues within Ember itself that have since been fixed in newer versions. I'll be revisiting this after we finish our UI tech debt work that includes an Ember upgrade: #7834 |
We've recently run into the same issue. It happend on one of our clusters that's still running an ancient version of Nomad: v0.8.3. That cluster has been running for some years now and it's the first time we've encountered this issue (it happend after a significant network outage). We're currently migrating this cluster to the latest Nomad release (actually we're rebuilding it). But apparently that doesn't matter since it's also present in current releases. Lucky the workaround posted by @scalp42 works perfectly. So for anyone else bumping into this issue: execute the bash oneliner posted by @scalp42 and the Nomad UI should be working again. |
Hey folks, I'm seeing this issue several times a day. The one-liner from @scalp42 works sometimes, but the issue also happens with dead batch jobs. Forcing a |
I also ran into this today; seems to be related to some pending parameterized jobs. @scalp42 's fix worked for me. |
Yup, I have lately been having the same issue with parameterized jobs. Opening multiple tabs to jobs reproduces this. UI hangs and then the web server becomes completely clogged (with error on Chrome |
Hi everyone! Thank you for your patience with this bug, and especially thank you @scalp42 for the one-liner workaround. I believe this is now fixed in v0.12.1. See the explanation of the solution here. Given the number of reports this bug has gotten, I don't want to close this issue until there has been some community confirmation. Please try out 0.12.1 and see if this fixes the issue for you! |
I'm going to deploy Nomad 0.12.1 across the infrastructure soon and I'll report back. Thanks @DingoEatingFuzz 😅 |
I guess this issue can be closed now (I haven't seen it anymore). |
Confirming, can be closed. |
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
Re-opening previously closed issue. This issue is not resolved and definitely seems like a Nomad bug.
Nomad version
0.8.4
Operating system and Environment details
Ubuntu 16.04.4 LTS
Issue
In the Nomad UI, when you click on a job or client to view allocations, the UI hangs attempting to load allocations instead of loading the actual allocations.
I was able to still use the CLI to run nomad status to view the jobs and view their underlying allocations through nomad status
Reproduction steps
If there is a periodic job launch that is still in the allocations list, but the parent job had aged out (in our case the periodic job is just stuck in
pending
).Other logs
404 in web browser console details in https://user-images.githubusercontent.com/6162849/46262074-e7078f80-c52e-11e8-9102-44d112cf3e9e.png as per #4464 . Note that this screenshot is from another user, but it looks to be a similar problem - the console records a single job showing a 404 when loading
/jobs
splash page.Note that browsing directly to a valid job seems to work (as in changing the URL to
/jobs/<JOBNAME>
manually), but it seems that this job that Nomad is confused about giving the 404 back on the main/jobs
page is preventing clicking-in to any job from loading.The text was updated successfully, but these errors were encountered: