-
Notifications
You must be signed in to change notification settings - Fork 250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create 'coupler.res' log file in write grid comp. Explicitly specify chunk sizes in restart files #2021
Create 'coupler.res' log file in write grid comp. Explicitly specify chunk sizes in restart files #2021
Conversation
@DusanJovic-NOAA rrfs team requested to commit this pr first for operational application. Can you sync up the branch? |
Jenkins-ci ORTs passed |
Acorn is sitting on it's last item and it's queued still, it's been there for at least a few hours. There are no current fail-tests in the working directory. I plan on killing the last task on Acorn so we don't need to wait. The queue on Acorn is being used by the SA's, so I don't think this long wait is about the resources needed for this test being too high. |
…ther-model into rrfs_coupler_res
@BrianCurtis-NOAA ok I think we're good to go then. |
@DusanJovic-NOAA fv3atm pr is merged. Please revert change in .gitmodules and update the submodule hash. |
Did you find any consistency in the run taking longer than 30 or a history of higher run times? |
Sorry for confusing commit remarks.. Actually the case is cpld_debug_pdlib_p8 on jet. The baseline was originated from #1967. Somehow the test log wasn't reported in the report for a few PRs. But 1882 sec is not bad on jet in a sense jet is small machine. On Orion and hercules, the case is taking about 1500 sec. I am trying to confirm and recover missing baseline case on jet At least, I was catching up the case on jet from last PR and this PR. So, we can let this pr move on to be merged. I will update in tomorrow's tag up. |
@jkbk2004 Im confused. We have changes in this PR for cpld_debug_p8 but you mention cpld_debug_pdlib_p8 and missing baselines? Do we need the cpld_debug_p8 changes? |
@BrianCurtis-NOAA Oh! my mistake. it's cpld_debug_pdlib_p8 issue as the jet log show at the end. I am fixing now. |
All set now. |
@jkbk2004 The pdlib tests were restricted to only hera, orion, wcoss2 and cheyenne, but they got turned on everywhere in #1967. So the last two weeks worth of pdlib tests should show up in the jet logs but the test does not show up in the log for that PR's commit. Why is the test not running on jet? You can see here that the test (25) is not reported: ufs-weather-model/tests/logs/RegressionTests_jet.log Lines 1568 to 1578 in f431fbe
|
As I said, somehow the case was missing for the commits on Nov 22/27/29 and Dec but reported on Dec 4 and this pr. I am recovering now. cases are running and shows around 1880 sec. Anyway, no need to hold this pr. We can merge this pr. I will report the result of recovering missing jet cases in previous baseline dates tomorrow. |
cpld_debug_pdlib_p8 is all recovered on jet for baselines: develop-20231117 and develop-20231122. timing is around 1800 ~ 1900 sec. So bumping up wlclk makes a sense. @zach1221 it's ok to proceed to merge this pr. |
Ok, I'll need one more approval. Waiting on that. |
Please wait until after the CM meeting to merge. |
PR Author Checklist:
I have linked PR's from all sub-components involved in section below.
I am confirming reviews are completed in ALL sub-component PR's.
I have run the full RT suite on either Hera/Cheyenne AND have attached the log to this PR below this line:
I have added the list of all failed regression tests to "Anticipated changes" section.
I have filled out all sections of the template.
Description
When the restart files are written by the write grid component, the log file (coupler.res) must also be written by write grid comp to ensure that it is written after all other restart files have already be created.
The chunk sizes are now explicitly specified in restart file when quilting_restart is used, and are equal to the Nx x Ny in horizontal and 1 in all other dimensions.
Linked Issues and Pull Requests
Associated UFSWM Issue to close
Closes #2020
Subcomponent Pull Requests
NOAA-EMC/fv3atm/pull/726
Blocking Dependencies
Subcomponents involved:
Anticipated Changes
Input data
Regression Tests:
Tests effected by changes in this PR:
Libraries
Code Managers Log
Testing Log: