dss operations within coupler pass an unused `t` argument. #396

akshaysridhar · 2023-08-17T23:41:47Z

Following the CTS interface, the weighted_dss_slab! function definition requires a third t (time) argument which is unused within the ClimaCore.Spaces.weighted_dss! function. We may want to remove this from our general design pattern.

The text was updated successfully, but these errors were encountered:

LenkaNovak · 2023-08-18T02:09:39Z

@akshaysridhar @sriharshakandala @kmdeck

As @gdecker1 found earlier, Float32 runs now break in ClimaLSM due to t being specified as an FT type here. For some reason t is being returned as Float64 even if all arguments to the ODEProblem are passed as FT32. @dennisYatunin maybe you'll be familiar with the internals of this, do you think this could be an issue in ClimaTimeSteppers?

For some reason our FT32 Buildkite was set with FT64, which is why we didn't catch this earlier! Error is reproducible if the driver on this branch is run interactively.

(Note, FT32 didn't cause problems in these interactive Coupler runs before the ClimaLSM.dss! PR. )

Unless we're doing something silly here, are you all happy to remove the type specification (like we did in the coupler #387) in ClimaLSM, before this issue gets resolved in ClimaTimeSteppers? As it stands, we can't test AMIP for FT32.

akshaysridhar · 2023-08-18T15:53:16Z

Adding notes from Slack here: (Callbacks in the timestepping methods also require a (u,p,t) arg list)

dennisYatunin · 2023-08-18T19:01:15Z

You should probably just drop the unnecessary FT type restriction. The value of t is passed here from the integrator for the sake of debugging (e.g., printing @info "Value at $t: ..."); its type does not matter because it should not be used for anything else. For now, t always needs to be stored as a Float64 because Float32 does not have enough bits to accurately track time without roundoff error.

kmdeck · 2023-08-21T16:59:03Z

hi all - catching up here.
is this correct: ClimaLSM needs a PR which drops the type restriction on t in dss! but which keeps it as an argument.

We may need to do this in multiple functions because we had assumed that t would be the same type as that underlying the state vectors. In the past, I thought this was a desirable feature, that we wanted the type restriction to ensure the simulation took place entirely at float32 or at float64. we can first make the above change, and see if we run into further issues?

LenkaNovak · 2023-08-22T00:36:14Z

Hi @kmdeck , thanks for following up on this! 🚀 That's right! Looks like removing the type specification would be the quick (and probably only) fix. Thanks @dennisYatunin for the explanation. I agree that it would be better to have consistent types, but I also get the float precision issue. Do you guys have tests that run with Float32?

kmdeck · 2023-08-22T15:40:27Z

Hi @kmdeck , thanks for following up on this! 🚀 That's right! Looks like removing the type specification would be the quick (and probably only) fix. Thanks @dennisYatunin for the explanation. I agree that it would be better to have consistent types, but I also get the float precision issue. Do you guys have tests that run with Float32?

We have unit tests that run with Float32 but all of our integrations (in experiments, etc) use Float64, I think. Ill make sure we can run with Float32 before merging anything!

kmdeck · 2023-08-24T17:45:15Z

Hi @kmdeck , thanks for following up on this! 🚀 That's right! Looks like removing the type specification would be the quick (and probably only) fix. Thanks @dennisYatunin for the explanation. I agree that it would be better to have consistent types, but I also get the float precision issue. Do you guys have tests that run with Float32?

We have unit tests that run with Float32 but all of our integrations (in experiments, etc) use Float64, I think. Ill make sure we can run with Float32 before merging anything!

@LenkaNovak it turns out that we have implicit requirements that t be the same float type as the state in a lot of places. so it will be a bigger change for us to get this to run. For example, whenever we prescribe time varying BC (reanalysis data, etc), the BC ends up being the same type as t, or a combination of quantities that are of type (t) and of the state FT).

what is the priority on this?

LenkaNovak · 2023-08-25T04:49:29Z

Hmm... I thought you guys don't use the t variable in the locally defined dss! function like here. All we should need is for the t::FT to be changed to just t on that line. Or is that not always the case? 🤔

kmdeck · 2023-08-29T18:30:06Z

Hmm... I thought you guys don't use the t variable in the locally defined dss! function like here. All we should need is for the t::FT to be changed to just t on that line. Or is that not always the case? 🤔

The issue is actually that we assume t is the same type as the state in many places (not just here - also in time varying boundary conditions, using renalysis data, etc). We didnt adequately test running with Float32; I think since switching to ClimaTimesteppers we lost this capability/can only run with Float64.

LenkaNovak · 2023-08-29T21:24:11Z

I see, thank you for looking into this, @kmdeck . I think Atmos has similar problems, but it was on their radar to re-enable single precision soon. As for the question on priorities, I'm not totally sure. @cmbengue @tapios How much should we prioritize Float32 runs right now?

tapios · 2023-08-29T21:36:53Z

I see, thank you for looking into this, @kmdeck . I think Atmos has similar problems, but it was on their radar to re-enable single precision soon. As for the question on priorities, I'm not totally sure. @cmbengue @tapios How much should we prioritize Float32 runs right now?

It depends on what right now means. It's not crucial, I think, for the 4-6 weeks or so. But beyond that, we'd like to be able to do coupled runs (on GPUs) in Float32.

dennisYatunin · 2023-08-29T21:39:21Z

It's also worth noting that we will probably continue to avoid using Float32 to represent t; it just leads to too much roundoff error for long simulations. The most viable options now are Float64 and maybe also DateTime (though using the latter would require changing a lot of code where we assume that t is a number). Perhaps the fastest solution for ClimaLSM would be to wrap t in FT(t) whenever it is passed from the integrator to a tendency function.

kmdeck mentioned this issue Aug 24, 2023

type restriction on t CliMA/ClimaLand.jl#308

Closed

1 task

kmdeck mentioned this issue Sep 8, 2023

Cannot run simulations with Float32 CliMA/ClimaLand.jl#327

Closed

juliasloan25 mentioned this issue Oct 2, 2023

enable Float32 compatibility #449

Closed

LenkaNovak added this to the O5.1.x (coupler) Software Improvements for ClimaCoupler #516 milestone Feb 21, 2024

juliasloan25 added the good first issue label Jun 22, 2024

juliasloan25 added the 💰 Grab Bag label Sep 18, 2024

juliasloan25 mentioned this issue Sep 20, 2024

rm weighted_dss_slab #959

Merged

4 tasks

Sbozzolo assigned juliasloan25 Sep 23, 2024

juliasloan25 closed this as completed in #959 Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dss operations within coupler pass an unused `t` argument. #396

dss operations within coupler pass an unused `t` argument. #396

akshaysridhar commented Aug 17, 2023

LenkaNovak commented Aug 18, 2023 •

edited

Loading

akshaysridhar commented Aug 18, 2023

dennisYatunin commented Aug 18, 2023 •

edited

Loading

kmdeck commented Aug 21, 2023 •

edited

Loading

LenkaNovak commented Aug 22, 2023

kmdeck commented Aug 22, 2023

kmdeck commented Aug 24, 2023

LenkaNovak commented Aug 25, 2023

kmdeck commented Aug 29, 2023 •

edited

Loading

LenkaNovak commented Aug 29, 2023

tapios commented Aug 29, 2023

dennisYatunin commented Aug 29, 2023 •

edited

Loading

dss operations within coupler pass an unused t argument. #396

dss operations within coupler pass an unused t argument. #396

Comments

akshaysridhar commented Aug 17, 2023

LenkaNovak commented Aug 18, 2023 • edited Loading

akshaysridhar commented Aug 18, 2023

dennisYatunin commented Aug 18, 2023 • edited Loading

kmdeck commented Aug 21, 2023 • edited Loading

LenkaNovak commented Aug 22, 2023

kmdeck commented Aug 22, 2023

kmdeck commented Aug 24, 2023

LenkaNovak commented Aug 25, 2023

kmdeck commented Aug 29, 2023 • edited Loading

LenkaNovak commented Aug 29, 2023

tapios commented Aug 29, 2023

dennisYatunin commented Aug 29, 2023 • edited Loading

dss operations within coupler pass an unused `t` argument. #396

dss operations within coupler pass an unused `t` argument. #396

LenkaNovak commented Aug 18, 2023 •

edited

Loading

dennisYatunin commented Aug 18, 2023 •

edited

Loading

kmdeck commented Aug 21, 2023 •

edited

Loading

kmdeck commented Aug 29, 2023 •

edited

Loading

dennisYatunin commented Aug 29, 2023 •

edited

Loading