Skip to content
Andrew Kiss edited this page Feb 12, 2021 · 76 revisions

Archiving model output

Check that the line

# postscript: sync_output_to_gdata.sh

is commented out in $ACCESS_OM_DIR/control/1deg_jra55_ryf/config.yaml, or if not, that GDATADIR in sync_output_to_gdata.sh is in fact where you want output and restarts to be written. WARNING: double-check GDATADIR so you don't overwrite existing output! - see below.

Updating an experiment

CAUTION: OUT OF DATE! SEE ISSUE 42

Use a git branch for each experiment

Each experiment can be assigned a separate git branch via

git branch expt
git checkout expt

where expt is the name for your experiment. For clarity it's best if this matches the name of the output directory used for the COSIMA Cookbook (e.g. 1deg_jra55_ryf_spinupN in the next section). For that matter you could also put the experiment name as the first line of $ACCESS_OM_DIR/control/1deg_jra55_ryf/ocean/diag_table which will make it appear in the MOM output netcdf file metadata as the global title field.

More details here.

Starting a new experiment using restarts from a previous experiment

The first step is to work out which restart you want to begin from. run_summary.py can help - it will show you which run number corresponds to which date - e.g. see /g/data/hh5/tmp/cosima/access-om2-run-summaries.

Simple case: no date change

It's pretty straightforward if you want to continue from some restart in another experiment and have the start date of your experiment match the date in the initial restart:

  1. git clone the control directory from the experiment you are starting from, e.g. git clone --no-hardlinks prev_control_dir my_new_experiment, where prev_control_dir is what you're starting from, and my_new_experiment is whatever you want to call your new experiment. This creates a new directory my_new_experiment that copies the contents of prev_control_dir, including its entire git commit history. Using --no-hardlinks to avoids potential permissions issues in git.
  2. cd my_new_experiment
  3. view your git remotes with git remote -v. You may wish to fix origin if you don't want it to refer to prev_control_dir, e.g. git remote set-url origin https://github.com/COSIMA/1deg_jra55_iaf.git if you're using that config.
  4. if you want to start from a restart that isn't the most recent in prev_control_dir it's a good idea to check out the commit corresponding to that restart in case the configuration for that run was different from the latest, e.g. git checkout abc123 if the git hash for the run that generated that restart starts with abc123. You can determine the relevant git hash with git log or by using run_summary.py which tabulates the run git hash for each run number and model date - e.g. see /g/data/hh5/tmp/cosima/access-om2-run-summaries.
  5. check out a new branch for the new experiment (e.g. git checkout -b my_new_experiment; it avoids confusion if this matches your directory name)
  6. change SYNCDIR in sync_data.sh (or GDATADIR in sync_output_to_gdata.sh in older configurations) to a currently non-existent path (for clarity it should match your branch name from the previous step) CHANGING SYNCDIR/GDATADIR IS CRUCIAL TO PREVENT POSSIBLE DATA LOSS!
  7. make an archive symlink: do payu setup to create a work symlink. Use ls -l to see where this links to and create an analogous directory with the same path but with work replaced with archive, using mkdir -p. Then use ln -s <archive-path> archive to make an archive directory symlink in your new control directory, where <archive-path> is the path you used in mkdir -p. Then do payu sweep to get rid of the work symlink.
  8. copy the required restart directory from the previous experiment (say, restart123) into archive
  9. you may also need to copy the associated output123/ice/cice_in.nml into your archive - see https://github.com/payu-org/payu/issues/193
  10. make whatever other configuration changes you want
  11. update metadata.yaml if you want to document how your new experiment differs from prev_control_dir
  12. run your experiment with payu run

More complicated cases

Things are more complicated if you want to change the run start date, for example continuing with interannual forcing after spinning up with repeat-year forcing. As an example, the following steps show how the IAF run 01deg_jra55v13_iaf was started from year 40 in the RYF run 01deg_jra55v13_ryf8485_spinup6.

  1. git clone the control directory from the experiment you are starting from, e.g. git clone --no-hardlinks prev_control_dir my_new_experiment, where prev_control_dir is what you're starting from, and my_new_experiment is whatever you want to call your new experiment. This creates a new directory my_new_experiment that copies the contents of prev_control_dir, including its entire git commit history. Using --no-hardlinks to avoids potential permissions issues in git.
  2. cd my_new_experiment
  3. check out the commit corresponding to the restart you have chosen to use (run_summary.py can help - it shows the run git hash corresponding to each run number and model date - e.g. see /g/data/hh5/tmp/cosima/access-om2-run-summaries), e.g. git checkout abc123 if the run git hash starts with abc123
  4. check out a new branch for the new experiment (e.g. git checkout -b my_new_experiment; it avoids confusion if this matches your directory name)
  5. change GDATADIR in sync_output_to_gdata.sh to a currently non-existent path (for clarity it should match your branch name from the previous step) CHANGING GDATADIR IS CRUCIAL TO PREVENT POSSIBLE DATA LOSS!
  6. check that atmosphere/forcing.json specifies the right forcing files (NB: wildcards such as * cannot be used following this commit)
  7. check/fix forcing_start_date and forcing_end_date in accessom2.nml
  8. make an archive symlink: do payu setup to create a work symlink. Use ls -l to see where this links to and create an analogous directory with the same path but with work replaced with archive, using mkdir -p. Then use ln -s <archive-path> archive to make an archive directory symlink in your new control directory, where <archive-path> is the path you used in mkdir -p. Then do payu sweep to get rid of the work symlink.
  9. copy the required restart directory from the previous experiment (say, restart123) into archive, and rename it restart000 (if you want to)
  10. you may also need to copy the associated output123/ice/cice_in.nml into your archive - see https://github.com/payu-org/payu/issues/193
  11. check/fix FORCING_CUR_DATE and EXP_CUR_DATE in archive/restart000/accessom2_restart.nml (you'll probably want FORCING_CUR_DATE to be the date of the start of your new run, since CICE will use this if use_restart_time = .false. in ice/cice_in.nml)
  12. check/fix calendar (should be gregorian for IAF, noleap for RYF) and current model time in archive/restart000/ocean/ocean_solo.res
  13. for the first run you'll need to set use_restart_time = .false. in ice/cice_in.nml
  14. run your experiment with payu run
  15. after the first run you'll need to set use_restart_time = .true. in ice/cice_in.nml

Check archive/output*/atmosphere/log/matmxx.pe00000.log to see if the correct forcing files are being read. Also check that archive/output*/ice/OUTPUT file dates are correct and that archive/output*/ocean/time_stamp.out is correct.

If you can't get this to work you may need to omit step 10 and alter the timing information in the restart file as discussed here.

Be aware that there can be subtle problems with calendars and leap years, which can trigger a forcing and experiment dates are out of sync error - see https://github.com/COSIMA/access-om2/issues/117 and https://github.com/COSIMA/access-om2/issues/149. If it doesn't bother you to have the forcing and run dates differing in more than just the year, set allow_forcing_and_exp_date_mismatch = .true. in the date_manager_nml group in accessom2.nml.

FIXME: integration information from the following conversation

aidan [09:28 AM] Care to share in case anyone else has the same problem?

nic [10:35 AM] yes, sorry. The problem was/is that CICE figures out its restart date based on an offset kept in the netcdf header of the restart file. This offset is from the beginning of the experiment and counted in seconds. the other models just use a restart date. the problem was that my experiment start date was different from Andrew, hence the ice restart date calculation came up with something different. so when I said that I dont see any difference between our configs I wasn't looking very hard.

aidan [10:40 AM] You had to edit the cice restart file to make this work?

nic [10:41 AM] no, I had to edit the accessom2.nml to make the forcing_start_date the same as his. this setup is not very clear/intuitive

however fixing cice date handling might be biting of more than we want to chew

aidan [10:51 AM] I'm thinking this might be a pretty common thing to want to do, so as long as we have clear guidelines/instructions on how to use someone else's restarts, what to do if you do need to change the model date etc.

Updating restarts for new bathymetry

If the MOM bathymetry file (topog.nc) needs to be changed on an existing run it's ocean restarts will need to be fixed up. Otherwise the restarts may not contain valid data at all points. This section describes a method for doing this. This github issue comment may also be helpful.

The approach taken is to create ocean restart files that match the new bathymetry (we call these the template restarts), then copy over all valid data from restarts for the existing run (call these the old restarts). The end result will be a restart that is the same as the existing run at all points which exist in both the old and the template (we call these the new restarts). Any new points that don't exist in the old restarts will contain whatever existed in the template restarts. This approach is very simple but does have a downside - if the bathymetry has changed a lot then there may be many points whose state is not consistent with the old restarts.

Step by step:

  1. Download topogtools. This contains a simple script that does the copying described above.
git clone https://github.com/COSIMA/topogtools.git
  1. You'll need template restarts from a run with the new bathymetry. If no previous run exists you'll need to create the restarts with a very short run from rest using the new topog.nc and matching CICE and MOM land masks kmt.nc and ocean_mask.nc as inputs (see step 7; create the land masks with topogtools/topog2mask.py topog.nc kmt.nc ocean_mask.nc).

  2. Collate the MOM restarts from both the template run with the new bathymetry and the run with the old bathymetry, e.g:

module use /g/data/hh5/public/modules
module load conda/analysis3
payu collate -d template-run/archive/restart000/ocean
payu collate -d old-topo-run/archive/restart123/ocean
  1. Get an interactive PBS session with additional CPUs and memory, e.g.:
qsub -I -P e14 -v DISPLAY -q express -l ncpus=4,mem=128Gb,walltime=10:00:00
  1. Run the fix_restarts.py script to create new MOM restarts based on the old restarts but with the new bathymetry and any new ocean points filled in with the template, e.g.:
cd topogtools
fix_restarts.py --help
./fix_restarts.py template-run/archive/restart000/ocean old-topo-run/archive/restart123/ocean new-topo-run/archive/restart123/ocean

This can take quite a while. Note that fix_restarts.py requires Python 3 - you might need to do module use /g/data/hh5/public/modules; module load conda/analysis3 first.

  1. Copy the restart files for the other model components, including the appropriate kmt.nc, e.g.:
cp old-topo-run/archive/restart123/accessom2_restart.nml new-topo-run/archive/restart123/
cp -r old-topo-run/archive/restart123/atmosphere new-topo-run/archive/restart123/
cp -r old-topo-run/archive/restart123/ice new-topo-run/archive/restart123/
cp template-run/archive/restart000/ice/kmt.nc new-topo-run/archive/restart123/ice
  1. Edit submodel input directories in config.yaml to ensure that MOM is using topog.nc and the matching land mask ocean_mask.nc, and that CICE is using the matching land mask kmt.nc. You might want to do payu setup to check that the resulting work directory links to the correct input files (then payu sweep to tidy up).

The configuration should now be ready to run.

Scaling the forcing fields

YATM permits arbitrary spatiotemporal scaling of the forcing (e.g. JRA55-do), which can be useful for perturbation experiments or to eliminate occasional timestep-limiting storms.

We leave the original forcing files untouched but carry multiplicative factors in auxiliary scaling files that are specified in atmosphere/forcing.json via an optional key "scaling_filename", for example in /g/data/hh5/tmp/cosima/access-om2-01/01deg_jra55v13_ryf8485_spinup6_newtopog_scalewind/output435/atmosphere/forcing.json:

{
  "description": "JRA55-do V1.3 RYF 1984-85 forcing",
  "inputs": [
    {
      "filename": "/g/data/ua8/JRA55-do/RYF/v1-3/RYF.rsds.1984_1985.nc",
      "fieldname": "rsds",
      "cname": "swfld_ai"
    },

...trimmed...

    {
      "filename": "/g/data/ua8/JRA55-do/RYF/v1-3/RYF.u_10.1984_1985.nc",
      "scaling_filename": "/home/156/aek156/payu/01deg_jra55v13_ryf8485_spinup6_newtopog_scalewind/input/RYF.u_10.1984_1985_scale.nc",
      "fieldname": "uas_10m",
      "cname": "uwnd_ai"
    },
    {
      "filename": "/g/data/ua8/JRA55-do/RYF/v1-3/RYF.v_10.1984_1985.nc",
      "scaling_filename": "/home/156/aek156/payu/01deg_jra55v13_ryf8485_spinup6_newtopog_scalewind/input/RYF.v_10.1984_1985_scale.nc",
      "fieldname": "vas_10m",
      "cname": "vwnd_ai"
    },
  ]
}

Each auxiliary scaling .nc file (e.g. RYF.u_10.1984_1985_scale.nc and RYF.v_10.1984_1985_scale.nc above) has the same structure (variable and dimension names and order) as the forcing file that it will scale, specified by "filename" (e.g. RYF.u_10.1984_1985.nc and RYF.v_10.1984_1985.nc above). Note that the scaling fields only need to be defined for the forcing times they are required. If the scaling file is present and defined for a given forcing time and field, the forcing field seen by the model will be the original field multiplied by the scaling factor. Otherwise the original field will be used.

The scaling values are arbitrary. The examples RYF.u_10.1984_1985_scale.nc and RYF.v_10.1984_1985_scale.nc above were generated by this notebook and are simply 1 minus a Gaussian in x,y,t which is localised to the location and time at which a problematic storm occurred in the JRA55-do data (see discussion here).

It's a very good idea to use compression in the netcdf scaling file, especially if your scaling is time-independent. We are investigating better ways to handle time-independent perturbations: https://github.com/COSIMA/libaccessom2/issues/31

This scaling capability was implemented in this commit; the scaling multiplication is done here.

This approach could scale any JRA55-do forcing field used to drive CICE (and therefore MOM), but if you wanted additive (not multiplicative) perturbations a code change would be needed: https://github.com/COSIMA/libaccessom2/issues/30

Changing the bathymetry, land-sea mask and OASIS remapping weights

This tutorial describes the process to follow when changing the bathymetry and land-sea mask for the purposes of, say, opening a strait or simulating a paleo-oceanographic situation.

1. Make your changes to the MOM topog.nc file

All changes to the bathymetry are made by creating a custom version of the topog.nc located in the MOM folder of the input (e.g. /short/public/access-om2/input_236a3011/mom_1deg for the 1-degree model). Any locations with a depth greater than 0 will be considered sea points subsequently for mask creation, while anything that has depth 0 (or negative depth) will be considered land. Place the new topography file in your own input folder (e.g. /short/e14/rmh561/access-om2/input/custom_topog_test/mom_1deg/). This new input folder needs to be refered to in your config.yaml in the mom submodule section above the default input folder (a similar line is needed to include the other custom input files below):

    - name: ocean
      model: mom
      exe: /short/e14/rmh561/access-om2/bin/fms_ACCESS-OM_1c1f23e_libaccessom2_b6caeab.x
      input:
         - /short/e14/rmh561/access-om2/input/custom_topog_test/mom_1deg
         - /short/public/access-om2/input_236a3011/mom_1deg

2. Changing the associated land–sea masks

Since the input topog.nc file has been changed, MOM's & CICE's land-sea masks will also need to be changed to match the new topography. The topog2mask script will create the corresponding CICE mask file (kmt.nc) and MOM mask file (ocean_grid.nc).

Download topogtools via:

git clone https://github.com/COSIMA/topogtools.git

This repository contains a simple python script (topog2mask.py) that will create the matching land–sea masks through the command: ./topog2mask.py topog.nc kmt.nc ocean_mask.nc

Place the new files in your input folder, referring to them in config.yaml above the public inputs (kmt.nc in the cice submodule).

3. Changing the OASIS remapping files

The OASIS coupler is in charge of feeding CICE with the correct atmospheric forcings. To do this, OASIS remaps the atmosphere grid into the corresponding CICE grid according to the remapping weights files specified for each field in the ATMOSPHERE --->>> ICE section of namcouple. The default remapping weights files (rmp_jra55_cice_conserve.nc and rmp_jra55_cice_smooth.nc) mask out the atmospheric forcings over land. If they are not remade then OASIS will feed the model 0's over every land point, which can result in a variety of errors (e.g. a division-by-zero runtime error associated with the air temperature, or problems with the air-sea heat flux, see Issue 173.

To remake the mapping files using the new land-sea masks use the script make_remap_weights.py in the access-om2/tools/ directory. This script needs the ocean_mask.nc and ocean_hgrid.nc input files from mom, as well as the JRA55 data directories and the yatm_1deg directory (not sure why for this last one?). I used the following command from a qsub script (executed from my custom_topog_test input directory) to get this working (I'm not on x77 so couldn't use Nic's directories):

../../tools/make_remap_weights.py ./ /g/data/ua8/JRA55-do/RYF/v1-3/ ../yatm_1deg/ --atm JRA55 --ocean MOM1 --npes 16

(the ocean_mask.nc and ocean_hgrid.nc files must be in the ./mom_1deg folder). Note that the 1/4-degree and 1/10-degree models likely require more resources and a longer processing time (see the tools/make_remap_weights.sh qsub script).

This command produces the JRA55_MOM1_conserve.nc and JRA55_MOM1_patch.nc remapping weights files that should be copied into a commmon_1deg_jra55/ input directory and then referenced in config.yaml under the common inputs:

model: access-om2
input:
     - /short/e14/rmh561/access-om2/input/custom_topog_test/common_1deg_jra55
     - /short/public/access-om2/input_236a3011/common_1deg_jra55

Finally, the ATMOSPHERE --->>> ICE section of namcouple needs to be changed to refer to these new remap weights files. Replace all appearances of rmp_jra55_cice_conserve.nc with JRA55_MOM1_conserve.nc and rmp_jra55_cice_smooth.nc with JRA55_MOM1_patch.nc.

4. Checks

You should now be good to go. It is worth doing a payu setup to check that all input folders' paths are correct. Note that some of the problems won't neccessarily result in a error being thrown by the model (e.g. incorrect conserve remapping weights - see Issue 173). A useful field to look at to check that things are working is the net surface heat flux net_sfc_heating. Note that you will likely still have problems if you're trying to restart from a previous run, as the initial conditions won't be defined if you've created new ocean points. However, it will work if starting from WOA initial conditions as these are defined everywhere.