-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Job fails, logs unaccessible "The specified log stream does not exist." #12
Comments
note - I am now rerunning to see if it fails again, so the error is not
currently visible
…On Sun, Aug 27, 2017 at 11:36 AM, Chris Filo Gorgolewski < ***@***.***> wrote:
https://openneuro.org/datasets/ds001038/versions/
00001?app=BARACUS&version=3&job=da564531-76cd-402f-809b-37994f1c3223
"download all logs" fails with "Failed - Server problem"
Originally reported by @poldrack <https://github.com/poldrack>
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#12>, or mute the thread
<https://github.com/notifications/unsubscribe-auth/AA1KkD33kMsi8rS7GcApP6aoKx5DTXfTks5scbdBgaJpZM4PD4Tp>
.
--
Russell A. Poldrack
Albert Ray Lang Professor of Psychology
Professor (by courtesy) of Computer Science
Bldg. 420, Jordan Hall
Stanford University
Stanford, CA 94305
[email protected]
http://www.poldracklab.org/
|
Original error:
|
This seems to influence logs from all jobs, which means this is a high priority bug. |
Looks like the log stream names created by Batch have changed. "FreeSurfer/eb251a5f-6314-457f-a36f-d11665451ddb/bf4fc87d-2e87-4309-abd9-998a0de708de" is now "FreeSurfer/default/bf4fc87d-2e87-4309-abd9-998a0de708de" Logs are not lost but our handler will need to be updated to support both formats. |
It would be great to get a fix rolled out to production soon. We cannot debug jobs at the moment. Thank you! |
FYI - this only seems to be happening on prod not on dev. |
This seems to have regressed: Download link is also a HTML redirect: Additional error from dev console:
|
I think this is the memory problem from #131 and not related to this issue. |
MRIQC (without ICA option which was the case here) has much lower memory footprint so I'm surprised it run out of memory. Interference from another job on the same node? |
This has regressed yet again in https://openneuro.org/datasets/ds001091/versions/00001?app=antsCorticalThickness&version=1 Furthermore the job status visual labels are not showing It also seems that plenty of jobs failed recently: https://openneuro.org/admin/jobs |
Closing this as this is different than the issue causing the recent errors (ECS instability) and that should be fixed. |
https://openneuro.org/datasets/ds001038/versions/00001?app=BARACUS&version=3&job=da564531-76cd-402f-809b-37994f1c3223
"download all logs" fails with "Failed - Server problem"
Originally reported by @poldrack
The text was updated successfully, but these errors were encountered: