-
Notifications
You must be signed in to change notification settings - Fork 253
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
slow CICE6 IO in the GFSV17 HR1 test on wcoss2 #1895
Comments
CICE has PIO option, pnetcdf can speed up the IO performance. CICE can't use pnetcdf on wcoss2. On C5, Pio was built with pnetcdf and no slow IO issue. |
netcdf parallel is using HDF5 storage (parallel IO capability). Pnetcdf is using netcdf5 to write out netcdf file with parallel capability. |
@junwang-noaa I've run a C768 coupled test on C5, using pio + netcdf (ie, serial netcdf) for ICE on 960 procs and I see no large signal at CICE's restart frequency. In the test case below, I've turned off MOM6 restarts (by inserting a line in the cap to turn off the alarm) and I see that every 3hours, at the restart frequency, the Advance time for CICE
Run is in |
With the box rearranger + pnetcdf + 8 iotasks, I get
|
@junwang-noaa Are your tests on WCOSS2 done with Aerosols? Because I've always been testing no-wave,no-aersols. |
No, the tests I am doing do not have aerosol and wave. |
The GFSv17 HR2 load balance issue is analyzed with the S2S configuration in order to fit into the operational time window (~7mins/forecast day). The results are shown in the google sheet CICE in HR2 test3. |
In summary:
From the results, even though it's ideal to further reduce the history file writing time in CICE, but it is not a blocker to have GFSv17 to reach the operational time window with th The issue will be closed. |
Description
This issue is found when investigating the GFSv17 scalability (issue #1367). On wcoss2, it is found that the GFSV17 HR1 test (without wave) runs have large coupling overhead. It is turned out that CICE6 restart writing is very slow (~100s writing one restart file). However this slowness does not show up in the cpld_bmark_p8 test on wcoss2. Further investigation is required to resole this issue.
To Reproduce:
Additional context
Output
The text was updated successfully, but these errors were encountered: