-
Notifications
You must be signed in to change notification settings - Fork 318
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Derecho transition: Get CTSM5.2 mksurfdata_esmf tool working on Derecho, Casper, Izumi #2201
Comments
UPDATE: See below for latest test on derecho.
On casper it fails with a different error:
On izumi it fails later in the build:
|
UPDATE:
We updated /tools/mksurfdata_esmf/gen_mksurfdata_build.sh
Now the build proceeds most of the way, but fails with this error:
|
Update: |
@ekluzek this seems like a blocker for continued development of the 5.2 branch. Is this an issue that can be addressed quickly? |
Thank you everyone! As discussed with Keith @olyson , I am posting my recent experience trying to get mksurfdata_esmf work on Derecho. Hope some of this information could be useful. I started on ctsm5.2.mksrf.19_ctsm5.1.dev125, when Derecho was not on the config_machines.xml (ccs_config_cesm0.0.64), so I copied the latest part from ccs_config_cesm/machines/derecho/config_machines.xml as well as files config_machines.xsd and env_mach_specific.xsd under CIME/data/config/xml_schemas/. I also added the part Sam mentioned above (set pio_iotype=1 for Derecho) before building the tool. I noticed an update on 1/23 which change the version to ctsm5.1.dev163-482-g1e0bca41f. This seems to fix the problems above. Then I made the following modification: (I was not completely sure what I was doing though, but the tool seems working after these changes...)
Then
The tool was built sucessfully on Derecho and I was able to generate surfdata and landuse.timeseries for (1) a default namelist for 0.9x1.25, 1850-2015, 16-pft and (2) a 16-pft SSP3-7.0 case (0.9x1.25, 1850-2100). I also tried 78-pft with no success(3). The main error was "OFI tagged senddata failed (ofi_send.h:372:MPIDI_OFI_send_normal:Bad address)". I feel my setup or input files could be causing the issue. mklai() did also complain that the input is 16pft instead of 78. Relevant file locations: (2) success - /glade/work/bowen/ctsm5.2.mksurfdata/tools/mksurfdata_esmf/SSP370_for_Yuan/ (this one was ctsm5.2.mksrf.19_ctsm5.1.dev125). Steps I took for this one:
I used (3) failed - multiple, e.g. /glade/work/bowen/ctsm5.2.mksurfdata_new/tools/mksurfdata_esmf/hist_urban_for_Song/surfdata_0.9x1.25_hist_1850_78pfts_c240124.namelist I feel I probably did not change the right thing, but would be happy in case this helps in any way... -Bowen |
A big thank you to you, @fang-bowen |
Looking in my mksurfdata_jobscript_single, I see |
Yes, thanks for sharing your experience @fang-bowen! This gets us going. I think this might not be the right thing long term in order to get it working on multiple machines and compilers and MPI libraries. But, this gets us going and others can benefit as well, so it's great all around... |
I do find some cases failing. I currently suspect memory issues. I will play with the number of nodes that I request. |
Opened new issue for casper and izumi. Closing this one as fixed for derecho. |
Originally part of #1995 from @slevis-lmwg.
The text was updated successfully, but these errors were encountered: