Fix wetday frequency correction #174

emileten · 2022-01-25T01:57:46Z

closes wet-day frequency correction uses only one value for each 'dry' data point #173
tests added / passed
docs reflect changes
entry in CHANGELOG.md

essential changes :

in the core function, instead of using a unique replacement value, use an array of (likely) different values, in process='pre'
values strictly below 1 are modified, rather than below or equal to 1, in process='pre'.
make tests deterministic (with a seed), smaller and more precise.

secondary changes :

in the core function, directly operate on the data array object within the dataset -- which means I have to introduce a var parameter -- default value is 'pr', but in tests we have other variable names. Need that parameter in the service as well.

emileten · 2022-01-25T02:05:19Z

@dgergel I requested your review for the method check.

@brews, in the computational side. I tested this on both Jupyter and Argo, and this ran within 3 mins using around 15GB of memory (that's because of the huge numpy sample array that I have to create, which has the shape of the GCM data). That fits in ~20GB resources for that step. We might need to increase slightly this number so that the ERA-5 data fits in.

But it will fit in the node and run fast.

brews

Thanks for cleaning this up, @emileten.

My one suggestion is to rename this new var argument to variable. This makes it consistent with the other functions and methods that grab a variable name. ...I see "var" and I think "variance" but that's an aside....

emileten · 2022-01-25T22:26:36Z

Thanks for cleaning this up, @emileten.

My one suggestion is to rename this new var argument to variable. This makes it consistent with the other functions and methods that grab a variable name. ...I see "var" and I think "variance" but that's an aside....

Thanks @brews, I changed this !

dgergel

This looks good, thanks @emileten for fixing this bug!

delgadom · 2022-01-26T20:06:09Z

dodola/core.py

-            ds > threshold, np.random.uniform(low=low, high=threshold)
+        ds[variable] = ds[variable].where(
+            ds[variable] >= threshold,
+            np.random.uniform(low=low, high=threshold, size=ds[variable].shape),


the issue is here - changing from np.random.uniform(low, high) to np.random.uniform(low, high, size) changes the returned type from pure python float to ndarray[np.float64]. xarray coerces the former to ds[variable].dtype, but defers to the higher precision data type of the latter

emileten added 5 commits January 25, 2022 10:44

correct the WDF correction

82c3b12

add a var parameter to the service

f9a09d3

improve test of WDF correction

86423fd

change log entry

9ea6043

format

b8c23e9

emileten requested review from brews and dgergel January 25, 2022 01:59

emileten self-assigned this Jan 25, 2022

brews approved these changes Jan 25, 2022

View reviewed changes

emileten added 2 commits January 26, 2022 07:25

replace var by variable

f7586dd

format

58982df

dgergel approved these changes Jan 25, 2022

View reviewed changes

emileten merged commit a903591 into ClimateImpactLab:main Jan 25, 2022

brews mentioned this pull request Jan 26, 2022

correct-wetday-frequency casts all output to float64 #175

Closed

delgadom reviewed Jan 26, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix wetday frequency correction #174

Fix wetday frequency correction #174

emileten commented Jan 25, 2022 •

edited

Loading

emileten commented Jan 25, 2022 •

edited

Loading

brews left a comment

emileten commented Jan 25, 2022

dgergel left a comment

delgadom Jan 26, 2022

Fix wetday frequency correction #174

Fix wetday frequency correction #174

Conversation

emileten commented Jan 25, 2022 • edited Loading

emileten commented Jan 25, 2022 • edited Loading

brews left a comment

Choose a reason for hiding this comment

emileten commented Jan 25, 2022

dgergel left a comment

Choose a reason for hiding this comment

delgadom Jan 26, 2022

Choose a reason for hiding this comment

emileten commented Jan 25, 2022 •

edited

Loading

emileten commented Jan 25, 2022 •

edited

Loading