Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Short term archiving removing MPAS restart files #1364

Closed
rljacob opened this issue Apr 16, 2017 · 4 comments
Closed

Short term archiving removing MPAS restart files #1364

rljacob opened this issue Apr 16, 2017 · 4 comments
Assignees
Labels
CRITICAL Responsibility: E3SM Responsibility to manage and accomplish this issue is through E3SM

Comments

@rljacob
Copy link
Member

rljacob commented Apr 16, 2017

Issue #1252 is nearly solved. But testing with some different restart frequencies is showing problems.

Test 1: low-res fully coupled ACME config.
Run for 1 year with restart and end of 1 year.
Everything works fine. See https://gist.github.com/rljacob/ca750c3d4a2ea262db5b728b78e59acc

Test 2: same case but run for 2 years and keep the restart frequency the same.
The interim restarts have the string "0002-01-01" while the final restarts have "0003-01-01".

The archiver deleted both sets for the active MPAS components, thinking they were all interim:

-------------------------------------------
doing short term archiving for mpaso (ocn)
-------------------------------------------
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsDaily.0003-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsCustom.0003-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.0002-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsClimatology.0003-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsClimatology.0002-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsMonthly.0003-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsDaily.0002-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsMonthly.0002-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.0003-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.rst.am.timeSeriesStatsCustom.0002-01-01_00000.nc
histfiles_savein_rundir []
moving
/lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpaso.hist.am.globalStats.0003-01-01.nc to
/lcrc/group/acme/jacob/archive/Wlowrmaster/ocn/hist/mpaso.hist.am.globalStats.0003-01-01.nc

and also

-------------------------------------------
doing short term archiving for mpascice (ice)
-------------------------------------------
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpascice.rst.0003-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpascice.rst.0002-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpascice.rst.am.timeSeriesStatsMonthly.0002-01-01_00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/mpascice.rst.am.timeSeriesStatsMonthly.0003-01-01_00000.nc

@rljacob
Copy link
Member Author

rljacob commented Apr 16, 2017

Compare to land model which does the right thing:

-------------------------------------------
doing short term archiving for clm (lnd)
-------------------------------------------
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/Wlowrmaster.clm2.r.0002-01-01-00000.nc
removing interim restart file /lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/Wlowrmaster.clm2.rh0.0002-01-01-00000.nc
copying
/lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/Wlowrmaster.clm2.r.0003-01-01-00000.nc to
/lcrc/group/acme/jacob/archive/Wlowrmaster/rest/0003-01-01-00000/Wlowrmaster.clm2.r.0003-01-01-00000.nc
copying
/lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/Wlowrmaster.clm2.h0.0002-12.nc to
/lcrc/group/acme/jacob/archive/Wlowrmaster/rest/0003-01-01-00000/Wlowrmaster.clm2.h0.0002-12.nc
copying
/lcrc/group/acme/jacob/acme_scratch/Wlowrmaster/run/Wlowrmaster.clm2.rh0.0003-01-01-00000.nc to
/lcrc/group/acme/jacob/archive/Wlowrmaster/rest/0003-01-01-00000/Wlowrmaster.clm2.rh0.0003-01-01-00000.nc

@rljacob rljacob added Responsibility: E3SM Responsibility to manage and accomplish this issue is through E3SM ty: Bug CRITICAL labels Apr 25, 2017
@rljacob
Copy link
Member Author

rljacob commented Apr 25, 2017

Reporting here but we need to fix/test within ACME.

@rljacob
Copy link
Member Author

rljacob commented Apr 25, 2017

Removed "bug" because this isn't a problem with code on master, just my (ACME) branch.

@rljacob
Copy link
Member Author

rljacob commented May 4, 2017

This has been fixed on the ACME side. Will be brought in the next time we subtree split from ACME.

@rljacob rljacob closed this as completed May 4, 2017
@ghost ghost removed the in progress label May 4, 2017
jgfouca pushed a commit that referenced this issue Jun 2, 2017
…c-compsets

Adds E and I compsets for BGC experiments
jgfouca pushed a commit that referenced this issue Feb 23, 2018
…c-compsets

Adds E and I compsets for BGC experiments
jgfouca pushed a commit that referenced this issue Mar 13, 2018
…c-compsets

Adds E and I compsets for BGC experiments
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CRITICAL Responsibility: E3SM Responsibility to manage and accomplish this issue is through E3SM
Projects
None yet
Development

No branches or pull requests

2 participants