-
Notifications
You must be signed in to change notification settings - Fork 252
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Coupled, wam and Debug GNU Cases failing on Hera Rocky8 #2200
Comments
Related Hera Rocky8 PR #2194 |
@DomHeinzeller installed gnu-12 on Hera: /scratch1/NCEPDEV/nems/role.epic/spack-stack/modulefiles - gcc/12.2.0 |
I don't know if this will help, but I was able to get the following lua modulefile to load OK. I have yet to test running a test though:
|
@jkbk2004 @RatkoVasic-NOAA did SA's on hera install Slurm correctly?
|
@BrianCurtis-NOAA |
@RatkoVasic-NOAA same error for me with that added --mpi=pmi2 |
sounds like slurm/rocky8 is another inconsistence. @RatkoVasic-NOAA @zach1221 @FernandoAndrade-NOAA let me know if you want me to send email to get SA involved here. |
Yes, please. Go ahead and send if you don't mind. |
@RatkoVasic-NOAA When openMPI is built did you have to specify the location of Slurm? If yes, what was specified as the location for Slurm? |
@rreddy2001 here is configure for openMPI:
|
My recommendation would be to build OpenMPI with a pmi library that is being used by slurm. The pmi library used by slurm on Hera is at @rreddy2001 |
How about using the same options for compiling MPI as on Hercules (see spack-stack documentation). I remember on Rocky9 the slurm installs are all in |
Updates for the new gnu/13.3.0 compiler with openmpi/4.1.6 used to built spack-stack/1.6.0 environment with the packages for the WM and SRW. |
Description
These cases are currently crashing when run on Hera's new Rocky8 OS. These tests are the three listed below.
control_wam_debug_gnu
cpld_control_p8_gnu
cpld_debug_p8_gnu
To Reproduce:
Additional context
cpld_control_p8_gnu and cpld_debug_p8_gnu dumps crashing err osc pt2pt during run_test.sh.
control_wam_debug_gnu shows slightly different crashing behavior
and PET000.ESMF_LogFile shows
nectedCplFields FV3 Import Field vtype is not connected
20240321 013524.316 INFO PET000 /scratch2/NCEPDEV/marine/Jong.Kim/UFS-RT/pr-2194/FV3/cpl/module_cplfields.F90:545 realizeConnectedCplFields FV3 Import Field stype is not connected
20240321 013524.316 INFO PET000 /scratch2/NCEPDEV/marine/Jong.Kim/UFS-RT/pr-2194/FV3/cpl/module_cplfields.F90:545 realizeConnectedCplFields FV3 Import Field vfrac is not connected
Solutions attempted but unsuccessful have included
forcing to run on nodes 5-12
adding -mcmodel=large into gnu.cmake file
adjusting gnu.cmake as below and testing medium and large setting
#if (${CMAKE_SYSTEM_NAME} MATCHES "Darwin" AND ${CMAKE_HOST_SYSTEM_PROCESSOR} MATCHES "arm64")
set( CMAKE_Fortran_FLAGS_DEBUG "${CMAKE_Fortran_FLAGS_DEBUG} -mcmodel=small" )
#else()
set( CMAKE_Fortran_FLAGS_DEBUG "${CMAKE_Fortran_FLAGS_DEBUG} -mcmodel=medium" )
#endif()
Output
The text was updated successfully, but these errors were encountered: