-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GST_release_public_v1 test fails on Hera in latest develop #874
Comments
Very interesting. I take it that the test is failing on Hera using the Intel compiler? I ask because the Hera coverage tests are passing, and |
Yes, sorry for the missing detail: this is for Intel. Here is a link to my working directory for the latest develop: /scratch2/BMC/fv3lam/kavulich/UFS/workdir/test_develop/2023-07-26/expt_dirs/GST_release_public_v1 |
The
at the exact same location (~27 steps). The link to my working directory for the latest develop on Orion is: |
PR #799 (hash 294e18b) appears to be the point that the |
Thanks @MichaelLueken, that makes sense since the failure seems to be model instability again. Since this was a test specifically for the v1 release, it might make sense to return to the DT_ATMOS= 40 used in that release for that specific test. But a higher value would probably also work. |
I tried various
So, it looks like the test will only pass for either GNU compilers or Intel compilers. Are there other parameters that can be tweaked to try and correct these errors, or will we need to add a |
I don't think a convoluted solution is necessary. This is an old test using now-unsupported data and a now-unsupported physics suite. And we don't actually know if it originally worked on GNU hera since that wasn't tested regularly until recently. I am almost of the mind that the test should be removed (for the above reasons) if it can't be fixed for all platforms, but this is something that probably needs wider discussion. |
From the August 3rd SRW App Code Management meeting, @gsketefian noted that the GST_release_public_v1 test was only meant for SRWv1 testing, so it can be removed now. |
Expected behavior
WE2E test GST_release_public_v1 should run successfully on all platforms.
Current behavior
Currently the test fails at the run_fcst step with the line
FATAL from PE 7: compute_qs: saturation vapor pressure table overflow, nbad= 1
followed by a core dump. This typically indicates a CFL violation/model instability.
Full log file can be found below. This occurs in the current develop as well as hash f9696e1 (July 10), but likely occurs in earlier hashes as well.
Machines affected
Hera. Have not noticed this on other machines, but I can not be sure if this is Hera-specific or not.
Edit: note that this is for the Intel compiler, in community mode (GNU compiler seems to succeed strangely). I have not tested in NCO mode.
Steps To Reproduce
Output
run_fcst_mem000_2019061500.log
The text was updated successfully, but these errors were encountered: