Develop one steps #236

nbren12 · 2020-04-13T20:47:39Z

No description provided.

This adds several features to the one-step pipeline - big zarr. Everything is stored as one zarr file - saves physics outputs - some refactoring of the job submission. Sample output: https://gist.github.com/nbren12/84536018dafef01ba5eac0354869fb67

* save lat/lon grid variables from sfc_dt_atmos

Use the big zarr from the one step workflow as input to the create training data pipeline

This makes the diagnostics variables appended to the big zarr have the appropriate step and forecast_time dimensions, just as the variables extracted by the wrapper do.

This accomplishes two things: 1) preventing true model 0s from being cast to NaNs in the one-step big zarr output, and 2) initializing big zarr arrays with NaNs via full so that if they are not filled in due to a failed timestep or other reason, it is more apparent than using empty which produces arbitrary output.

Remove references to hard coded dims and data variables or imports from vcm.cubedsphere.constants, replace with arguments. Can provide coords and dims as args for mappable var

Allows for starting the one-step jobs at the specified index in the timestep list to allow for testing/avoiding spinup timesteps

* change order of required args so output is last * fix arg for onestep input to be dir containing big zarr * update end to end integration test ymls * prognostic run adjustments

This PR introduces several improvements to the logging capability of our prognostic run image - include upstream changes to disable output capturing in `fv3config.fv3run` - Add `capture_fv3gfs_func` function. When called this capture the raw fv3gfs outputs and re-emit it as DEBUG level logging statements that can more easily be filtered. - Refactor `runtime` to `external/runtime/runtime`. This was easy since it did not depend on any other module in fv3net. (except implicitly the code in `fv3net.regression` which is imported when loading the sklearn model with pickle). - updates fv3config to master

…om develop (#237)

oliverwm1 · 2020-04-13T22:57:44Z

Has the requirements.txt for the prognostic run image become much longer in order to pin all the versions of the packages that vcm and fv3config depend on? And come to think of it, why does the prognostic run require vcm? Is it somehow an implicit dependence for loading the model pickle?

AnnaKwa · 2020-04-13T23:18:05Z

external/runtime/runtime/sklearn_interface.py


 __all__ = ["open_model", "predict", "update"]

+import logging


This is leftover from when I was trying to debug something. I'll remove it.

nbren12 · 2020-04-13T23:28:45Z

Has the requirements.txt for the prognostic run image become much longer in order to pin all the versions of the packages that vcm and fv3config depend on?

Exactly. It makes the build more deterministic, but ultimately we might move towards a tool like poetry. I don't know if the prognostic run really needs vcm. That would be a good dependency to remove.

* update history * fix positional args * fix function args * update history * linting

AnnaKwa

Approving- any fixes required if the integration test fails will be done as a new PR to master

oliverwm1 · 2020-04-13T23:45:39Z

I don't know if the prognostic run really needs vcm. That would be a good dependency to remove.

I agree... seems like it should be just be fv3gfs-python + scikit-learn + zarr + fv3config at same version as corresponding fv3net image. Hopefully we can get it closer to that.

oliverwm1

Great to get this merged into master. I imagine there will be some more fixes to be done, but I think it's fine to do a subsequent PR to master.

nbren12 and others added 13 commits March 26, 2020 16:32

Feature/one step save baseline (#193)

f929f75

This adds several features to the one-step pipeline - big zarr. Everything is stored as one zarr file - saves physics outputs - some refactoring of the job submission. Sample output: https://gist.github.com/nbren12/84536018dafef01ba5eac0354869fb67

save lat/lon grid variables from sfc_dt_atmos (#204)

906eb69

* save lat/lon grid variables from sfc_dt_atmos

Feature/use onestep zarr train data (#207)

3fca7b9

Use the big zarr from the one step workflow as input to the create training data pipeline

One-step sfc variables time alignment (#214)

2f05506

This makes the diagnostics variables appended to the big zarr have the appropriate step and forecast_time dimensions, just as the variables extracted by the wrapper do.

Merge branch 'master' into develop-one-steps

82b7614

adjustments to be able to run workflows in dev branch (#218)

1db14ba

Remove references to hard coded dims and data variables or imports from vcm.cubedsphere.constants, replace with arguments. Can provide coords and dims as args for mappable var

One steps start index (#231)

ee1a790

Allows for starting the one-step jobs at the specified index in the timestep list to allow for testing/avoiding spinup timesteps

Dev fix/integration tests (#234)

96a9101

* change order of required args so output is last * fix arg for onestep input to be dir containing big zarr * update end to end integration test ymls * prognostic run adjustments

manually merge in the refactor from master while keeping new names fr…

6048daa

…om develop (#237)

Merge branch 'master' into develop-one-steps

cfc6606

lint

ea2ac12

AnnaKwa reviewed Apr 13, 2020

View reviewed changes

remove logging from testing

ea9bab7

Dev fix/arg order (#238)

14c48da

* update history * fix positional args * fix function args * update history * linting

AnnaKwa approved these changes Apr 13, 2020

View reviewed changes

oliverwm1 approved these changes Apr 13, 2020

View reviewed changes

nbren12 merged commit ced15aa into master Apr 13, 2020

AnnaKwa deleted the develop-one-steps branch June 3, 2020 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop one steps #236

Develop one steps #236

nbren12 commented Apr 13, 2020

oliverwm1 commented Apr 13, 2020

AnnaKwa Apr 13, 2020

nbren12 commented Apr 13, 2020

AnnaKwa left a comment

oliverwm1 commented Apr 13, 2020

oliverwm1 left a comment

Develop one steps #236

Develop one steps #236

Conversation

nbren12 commented Apr 13, 2020

oliverwm1 commented Apr 13, 2020

AnnaKwa Apr 13, 2020

Choose a reason for hiding this comment

nbren12 commented Apr 13, 2020

AnnaKwa left a comment

Choose a reason for hiding this comment

oliverwm1 commented Apr 13, 2020

oliverwm1 left a comment

Choose a reason for hiding this comment