Fixes Redi Surface Taper double counting #5171

vanroekel · 2022-09-02T03:56:53Z

Currently the SlopeTriads are tapered at the surface AND tapered in
mpas_ocn_tracer_hmix_redi as well, which is a double counting.
The tapering in the latter is removed.

In addition to removing the double count in the taper, Redi is reduced to horizontal mixing in the mixed layer, consistent with other modeling center implementations and literature (e.g. Ferrari et al 2008). This is accomplished by setting the Redi taper to zero in the mixed layer, which disables all terms except the horizontal mixing term.

[NCC]

Currently the SlopeTriads are tapered at the surface AND tapered in mpas_ocn_tracer_hmix_redi as well, which is a double counting. The tapering in the latter is removed.

vanroekel · 2022-09-02T03:57:33Z

The sfc tapering is first applied in mpas_ocn_gm.F here https://github.com/E3SM-Project/E3SM/blob/master/components/mpas-ocean/src/shared/mpas_ocn_gm.F#L353-L361 and also in tracer_hmix_redi so was applied twice.

vanroekel · 2022-09-02T03:58:13Z

This PR is nonBFB and potentially climate changing, but the impact is likely small.

components/mpas-ocean/src/shared/mpas_ocn_tracer_hmix_redi.F

vanroekel · 2022-09-02T13:36:02Z

Thanks for the catches @mark-petersen I think they are all fixed now

mark-petersen

Compared visually to commit 0a90dc3 on branch vanroekel/vanroekel/ocean/add-submesoscale-eddies, which is what @vanroekel used to run the long tests at https://web.lcrc.anl.gov/public/e3sm/diagnostic_output/ac.vanroekel/E3SMv2/20220715.submeso.piControl.ne30pg2_EC30to60E2r2.chrysalis/. This PR has the identical code for these lines. Also tested with nightly suite on stand-alone for gnu debug and intel debug. Using gnu optimized, this PR is not bfb for tests with Redi on, as expected.

jonbob · 2022-09-08T21:27:02Z

Started two 10-year LR tests on anvil to check the impact of this non-BFB PR

jonbob · 2022-09-13T17:11:53Z

I completed the two 10-year runs, one as a baseline and the other including this branch. The results comparing the two are here:

20220908.PR5171.anvil

@vanroekel - if you get a chance, could you please make sure the differences are as expected? And @xylar and @mark-petersen, I would very much appreciate it if you could both check as well. The third run @xylar and I discussed, which includes PR #5172 as well, has completed 5-years and waiting in the anvil queue to get to 10...

mark-petersen · 2022-09-13T22:20:02Z

Thanks for posting this @jonbob. It appears that the mean and variability of all these plots is very close, and differences are what you would expect from two ensemble members. This can be seen most easily in the transports, but other statistics are similar:

xylar · 2022-09-14T02:20:18Z

Thank @jonbob. I'll want to see the comparison with the 3rd run before making any assessment.

vanroekel · 2022-09-14T02:28:05Z

In testing in the AMOC group we found fairly small sensitivity to this change @jonbob so your results make sense, but I agree with @xylar that seeing the third run will be important before making the final judgement.

xylar · 2022-09-14T02:33:00Z

@vanroekel, for context, I had requested the 3 runs just so we can be sure what the relative effects of this change are compared with turning on submesoscale. @jonbob agreed to do 2 of those runs last week and then we agreed it was easier to redo the 3rd rather than trying to compare with the existing submesoscale run. I presume we'll see much bigger changes in the latter but I just don't want us to be mistakenly attributing changes to the submesoscale parameterization that actually come from this bug fix (even if that might be pretty harmless).

vanroekel · 2022-09-14T02:50:39Z

Thanks for the additional context @xylar this was my hunch as to why you requested the run. and I agree it makes good sense to do that run prior to signing off on this PR.

jonbob · 2022-09-14T15:05:15Z

I got a seaice error at 00090402, so I've flipped the BFBFLAG and resubmitted

jonbob · 2022-09-15T15:44:11Z

With the BFBFLAG flipped, I ran into an ocean state validation error at 00090406 on the anvil run. Because I had been waiting in the queue for so long, I had also started a test on chrysalis. That one hit an ocean state validation error at 00070223, so I've flipped the BFBFLAG there as well and that run is past that point now. @vanroekel -- I'll try to figure out where your long run is and compare the input files and configurations

jonbob · 2022-09-15T16:20:23Z

A diff of @vanroekel's mpaso_in file and the one generated by my codebase shows some potential issues:

74c74
<  config_submesoscale_ce = 0.08
---
>  config_submesoscale_ce = 0.06
102,103c102,105
<  config_mixedlayerdepths_crit_dens_threshold = 0.03
<  config_mld_reference_depth = 10
---
>  config_eddymld_dens_threshold = 0.03
>  config_eddymld_reference_depth = 10
>  config_eddymld_reference_pressure = 1.0e5
>  config_eddymld_use_old = .true.
238a241
>  config_flux_limiter = 'monotonic'
240,242c243,247
<  config_monotonic = .true.
<  config_vert_tracer_adv = 'stencil'
<  config_vert_tracer_adv_order = 3
---
>  config_remap_limiter = 'monotonic'
>  config_vert_advection_method = 'flux-form'
>  config_vert_remap_interval = 0
>  config_vert_remap_order = 3
>  config_vert_tracer_adv_flux_order = 3

Is there a reason our default value for config_submesoscale_ce is different? And the vertical tracer advection?

vanroekel · 2022-09-15T17:46:05Z

The submesoscale parameter is something I changed that we should change by default, sorry for missing that earlier. I don’t know about the vertical advection parts. I never changed those. Perhaps the VLR capability went in after I did the big run?

jonbob · 2022-09-15T18:02:49Z

Thanks @vanroekel - we'll get that merged in, and I'll rerun my test with the value to match. DID you run into a bunch of state validation errors getting your run off the ground? On chrysalis my restart failed again at 00090326 so I've flipped the BFBFLAG and trying again. But after that I'll reset the submesoscale parameter and start over

jonbob · 2022-09-15T19:04:42Z

@vanroekel - we can change the submesoscale parameter in PR #5172, when we turn the model on

mark-petersen · 2022-09-15T20:20:54Z

@jonbob the options that changed above are the correct translation for running with flux-form vertical advection (i.e. no change). See here:

In https://github.com/E3SM-Project/E3SM/pull/4968/files#diff-abb97a018f3fd5f1b21ca9ed827ae06a36fde557c388b4a42d69c33cbfbd62e7

jonbob · 2022-09-15T20:28:14Z

Thanks @mark-petersen -- good to know. So the namelist comparison makes sense, once we change config_submesoscale_ce? I have a test running with the updated value and will post on its progress

mark-petersen · 2022-09-15T20:34:21Z

Yes, after you change config_submesoscale_ce the namelist is identical.

mark-petersen · 2022-09-20T20:24:01Z

Coordinated with @vanroekel on comparing code with branch used for successful 800 year simulation. Added the following commit. This code was in the long simulation but not in this PR. Tested with gnu debug using stand-alone nightly suite.

xylar · 2022-09-21T00:54:12Z

components/mpas-ocean/src/shared/mpas_ocn_gm.F

                  sfcTaper = min(RediKappaSfcTaper(k, cell1), RediKappaSfcTaper(k, cell2))
+                  if (k < min( indMLD(cell1), indMLD(cell2))) then
+                    sfcTaper = 0.0_RKIND
+                  else
+                    sfcTaper = 1.0_RKIND
+                  end if


I don't understand why the first line here isn't replaced by the last 5. It seems like the work done in the first line now does nothing.

Also, could something be added to explain this change?

In particular, I'm wanting to make sure these still relate to the topic of this PR, the double counting of the Redi surface taper.

Your last point is a very good one. This is definitely beyond the title of the PR. This is now more along the lines of 'generic redi fixes'. I'll add a bit of explanation here about this change -- it is physically motivated and I believe fixes the stability issues @jonbob was seeing. Within the upper ocean the mesoscale eddies act horizontally and transition to along isopycnal in the interior (e.g. Ferrari et al 2005), this is a very rudimentary way to achieve that. The reason I think this will fix the issue with stability is that when I removed the surface taper double counting the redi cross terms became stronger, leading to numerical instability. This change is effectively a much stronger taper on those terms. I included this in my long 800 year spin up and didn't see stability issues, but Jon sees them quickly. I can add more explanation to this PR and/or in the code.

Finally, you are absolutely right that the first line of five is not necessary.

Thanks! If you update the PR description, I am happy with this being included.

more than happy to add this to the description, but I may hold off until @jonbob can test the impact to see if this stabilizes the run.

xylar · 2022-09-21T00:55:05Z

components/mpas-ocean/src/shared/mpas_ocn_gm.F

@@ -351,6 +351,11 @@ subroutine ocn_GM_compute_Bolus_velocity(statePool, &
                  slopeTaperDown = 1.0_RKIND + slopeTaperFactor*(slopeTaperDown - 1.0_RKIND)

                  sfcTaper = min(RediKappaSfcTaper(k, cell1), RediKappaSfcTaper(k, cell2))
+                  if (k < min( indMLD(cell1), indMLD(cell2))) then


Suggested change

if (k < min( indMLD(cell1), indMLD(cell2))) then

if (k < min(indMLD(cell1), indMLD(cell2))) then

vanroekel · 2022-09-21T03:21:50Z

@xylar @jonbob @mark-petersen -- I agree with Xylar that the changes to the surface taper are not consistent with the PR title. If this fix helps relieve Jon's instability issues, is it possible/sufficient to change the PR title / description? Or is a new PR needed? i think the branch name is still appropriate.

jonbob · 2022-09-21T15:22:54Z

@vanroekel - it's easy to change the PR title and description, with edit buttons. And I do not believe there's any point in changing the branch name, because that does require a new PR. So please let me know if you need any help editing this PR, and we'll otherwise assume to push on getting this PR in.

jonbob · 2022-09-21T21:40:03Z

@vanroekel, @mark-petersen - the test of this PR ran 10 years just fine, but with #5172 added to turn on the submesoscale, my test still crashes. This time it's at 00050130, and has left a pile of error and debug files if anyone wants to try to make sense of them. If so, they're at:

/lcrc/group/acme/ac.jwolfe/scratch/chrys/20220920.PR5172.chrysalis/run

I'll flip the BFBFLAG and see if it holds together

jonbob · 2022-09-22T22:05:57Z

With a little coaxing (and BFBFLAG flipping) I did get the 10-year test with the submesoscale model on to complete as well. Here are links to the maps-analysis output for both:

20220920.PR5171.chrysalis - test with this PR merged but the mesoscale model OFF

20220920.PR5172.chrysalis - test with this PR merged and the mesoscale model ON

@vanroekel, @xylar, @mark-petersen - please check the output and make sure it all makes sense to you

please note that the analysis package was run comparing these runs to a baseline instead of observations

xylar

Based on the results of @jonbob's 3 simulations and the analysis comparing them, I'm ready to approve this. From the analysis comparing this PR to the baseline, I don't see any changes that consistently point to the simulation results getting either worse or better, they are simply slightly different (like a different ensemble member).

On the other hand, significant changes are visible from #5172, as expected. Over a 10-year period, it is not clear if these changes are in a desirable direction so that's where @vanroekel's longer simulations are required. But for the purposes of approving this PR, it is clear that the submesoscale parameterization and not this bug fix are most likely responsible for the changes we're seeing, as expected.

Thanks you @vanroekel and @jonbob for the very hard work you have put into these PRs, simulations, and analyisis!

xylar · 2022-09-24T18:50:36Z

@jonbob, I'll try to find out why the hovmoller plots didn't work but I don't think that should hold up this merge.

xylar · 2022-09-24T22:10:22Z

@jonbob, just so you know, I found the cause of the MPAS-Analysis issue you had and have a fix here: MPAS-Dev/MPAS-Analysis#900

It is unrelated to this PR (except that it was related to you having to restart because of the BFBFLAG flips you were having to do).

vanroekel · 2022-09-25T04:36:53Z

@xylar @mark-petersen Just a note about @jonbob's comment about coaxing the runs to finish and the instability of the code with submesoscales on. Looking through I found an issue in the submesoscale implementation where I named the variable gradBuoyEddy, but to achieve BFB in #5099 the computation of gradBuoyEddy was changed to be the calculation of the density gradient. I have fixed this in #5172 and Jon tested this and it seems to be running smoothly through 13 years.

vanroekel · 2022-09-25T04:37:25Z

@xylar one last note - I modified the PR description to explain the other change to the sfcTaper. Does that description look good to you?

xylar · 2022-09-25T05:29:40Z

@vanroekel, yep, looks good!

jonbob · 2022-09-26T16:04:56Z

The test of this branch with PR #5172 now runs stably, with no failures through 30 years. Comparison of this run with a 10-year baseline is at:

20220920.PR5172.chrysalis - test with this PR merged and the mesoscale model ON

I'll also run diagnostics comparing to observations for the full 30-year run and post as soon as they complete.

Here's also output from a 10-year run with just PR #5172 and not this branch, to help tease out the impacts of the two PR's:

20220923.PR5172.chrysalis - test w/o this PR but submesoscale model on

mark-petersen

Looking at the simulations, before and after this PR are the same, within variability, as expected, both with and without the submesoscale eddy parameterization flag on. Thanks @jonbob for your thorough testing.

jonbob · 2022-09-26T18:53:57Z

MPAS-Analysis of 30-year runs with the submesoscale model on, both with and without this PR, comparing to obs:

20220920.PR5172.chrysalis_vs_obs1-30 - test with this PR merged and the submesoscale model ON

20220923.PR5172.chrysalis_vs_obs1-30 - test w/o this PR merged and the submesoscale model ON

golaz · 2022-09-28T17:26:56Z

Simulations look quite similar as noted by @mark-petersen. Some differences I noticed:

Slight weakening of AMOC after the PR.
More pronounced near four-year oscillation in global SST and AMOC time series after the PR.

…5171) Fix Redi Surface Taper double counting Currently the SlopeTriads are tapered at the surface AND tapered in mpas_ocn_tracer_hmix_redi as well, which is a double counting. The tapering in the latter is removed. In addition to removing the double count in the taper, Redi is reduced to horizontal mixing in the mixed layer, consistent with other modeling center implementations and literature (e.g. Ferrari et al 2008). This is accomplished by setting the Redi taper to zero in the mixed layer, which disables all terms except the horizontal mixing term. [NCC]

jonbob · 2022-09-28T17:46:51Z

passes sanity testing -- merged to next

jonbob · 2022-09-29T18:36:11Z

merged to master and expected DIFFs blessed, except for PEM_Ln9.ne30pg2_EC30to60E2r2.WCYCL1850.chrysalis_intel which seemed to have randomly failed instead of making a DIFF

xylar · 2022-09-29T19:56:53Z

Thanks, @jonbob and @vanroekel for the hard work on this. Exciting that we're so close to having #5172 in!

Fixes Redi Surface Taper double counting

b547d67

Currently the SlopeTriads are tapered at the surface AND tapered in mpas_ocn_tracer_hmix_redi as well, which is a double counting. The tapering in the latter is removed.

vanroekel added bug mpas-ocean CC PR is climate changing labels Sep 2, 2022

vanroekel requested a review from mark-petersen September 2, 2022 03:56

vanroekel assigned jonbob Sep 2, 2022

vanroekel added NCC Larger-then-roundoff diffs but not believed climate changing and removed CC PR is climate changing labels Sep 2, 2022

mark-petersen reviewed Sep 2, 2022

View reviewed changes

components/mpas-ocean/src/shared/mpas_ocn_tracer_hmix_redi.F Outdated Show resolved Hide resolved

mark-petersen reviewed Sep 2, 2022

View reviewed changes

components/mpas-ocean/src/shared/mpas_ocn_tracer_hmix_redi.F Outdated Show resolved Hide resolved

Removes extra characters

98296b4

mark-petersen approved these changes Sep 2, 2022

View reviewed changes

mark-petersen mentioned this pull request Sep 6, 2022

Enables and modifies the submesoscale eddy parameterization #5172

Merged

rljacob added bug fix PR and removed bug labels Sep 15, 2022

mark-petersen self-requested a review September 20, 2022 15:11

Add mixed layer depth check to surface taper

42eca04

xylar reviewed Sep 21, 2022

View reviewed changes

xylar approved these changes Sep 24, 2022

View reviewed changes

mark-petersen approved these changes Sep 26, 2022

View reviewed changes

jonbob merged commit 878597d into master Sep 29, 2022

jonbob deleted the vanroekel/ocean/fix-redi-surface-taper branch September 29, 2022 18:20

xylar mentioned this pull request Nov 19, 2022

Update E3SM-Project submodule MPAS-Dev/compass#461

Merged

32 tasks

	if (k < min( indMLD(cell1), indMLD(cell2))) then
	if (k < min(indMLD(cell1), indMLD(cell2))) then

Fixes Redi Surface Taper double counting #5171

Fixes Redi Surface Taper double counting #5171

Conversation

vanroekel commented Sep 2, 2022 • edited Loading

vanroekel commented Sep 2, 2022

vanroekel commented Sep 2, 2022

vanroekel commented Sep 2, 2022

mark-petersen left a comment

Choose a reason for hiding this comment

jonbob commented Sep 8, 2022

jonbob commented Sep 13, 2022

mark-petersen commented Sep 13, 2022

xylar commented Sep 14, 2022

vanroekel commented Sep 14, 2022

xylar commented Sep 14, 2022

vanroekel commented Sep 14, 2022

jonbob commented Sep 14, 2022

jonbob commented Sep 15, 2022

jonbob commented Sep 15, 2022

vanroekel commented Sep 15, 2022

jonbob commented Sep 15, 2022

jonbob commented Sep 15, 2022

mark-petersen commented Sep 15, 2022

jonbob commented Sep 15, 2022

mark-petersen commented Sep 15, 2022

mark-petersen commented Sep 20, 2022

xylar Sep 21, 2022

Choose a reason for hiding this comment

xylar Sep 21, 2022

Choose a reason for hiding this comment

xylar Sep 21, 2022

Choose a reason for hiding this comment

vanroekel Sep 21, 2022

Choose a reason for hiding this comment

xylar Sep 21, 2022

Choose a reason for hiding this comment

vanroekel Sep 21, 2022

Choose a reason for hiding this comment

xylar Sep 21, 2022

Choose a reason for hiding this comment

vanroekel commented Sep 21, 2022

jonbob commented Sep 21, 2022

jonbob commented Sep 21, 2022

jonbob commented Sep 22, 2022 • edited Loading

xylar left a comment

Choose a reason for hiding this comment

xylar commented Sep 24, 2022

xylar commented Sep 24, 2022 • edited Loading

vanroekel commented Sep 25, 2022

vanroekel commented Sep 25, 2022

xylar commented Sep 25, 2022

jonbob commented Sep 26, 2022

mark-petersen left a comment

Choose a reason for hiding this comment

jonbob commented Sep 26, 2022

golaz commented Sep 28, 2022

jonbob commented Sep 28, 2022

jonbob commented Sep 29, 2022

xylar commented Sep 29, 2022

vanroekel commented Sep 2, 2022 •

edited

Loading

jonbob commented Sep 22, 2022 •

edited

Loading

xylar commented Sep 24, 2022 •

edited

Loading