-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Automatic parsing of dimension names leads to duplicated 'Ti' dimensions #468
Comments
I think this is still a problem that the parsing gives the same name to 2 dimensions. I still believe that the renaming from "time" (standard dimension name in 99% of climate and weather datasets) to "Ti" is not a good practice for interconnexion with other packages that often hardcode "time" in their packages (sometimes using PyCall to use some Python libraries). |
see rafaqz/DimensionalData.jl#499. as a side note I usually try to do |
@lazarusA thanks! |
Another example of this behaviour leading to sometimes strange results (or more spcifically, leading to a bad interpretation of what the data is). Coordinates saved to disk: Coordinates:
* lat (lat) float32 1kB 10.12 10.38 10.62 ... 73.88 74.12 74.38
* lead (lead) int64 32B 1 4 7 10
* lon (lon) float32 2kB -169.9 -169.6 -169.4 ... -50.38 -50.12
* quantiles (quantiles) float64 152B 0.05 0.1 0.15 0.2 ... 0.85 0.9 0.95
* forecast_date (forecast_date) datetime64[ns] 4kB 1981-01-15 ... 2024-03-15 Coordinates read by YAXArrays: (↓ quantiles Sampled{Float64} 0.05:0.05:0.95 ForwardOrdered Regular Points,
→ lon Sampled{Float32} -169.875f0:0.25f0:-50.125f0 ForwardOrdered Regular Points,
↗ lat Sampled{Float32} 10.125f0:0.25f0:74.375f0 ForwardOrdered Regular Points,
⬔ lead Sampled{Int64} 1:3:10 ForwardOrdered Regular Points,
◩ Ti Sampled{DateTime} [1981-01-15T00:00:00, …, 2024-03-15T00:00:00] ForwardOrdered Irregular Points) Hence, |
ok, definitely a bug. Any pointers @felixcremer ? we need to update the logic for this. |
The logic is basically in this line: YAXArrays.jl/src/DatasetAPI/Datasets.jl Line 288 in cac79bf
This means, whenever something is encoded as a CF convention time dimension, it will get the label Ti .
We could change this line to DD.rebuild(DD.name2dim(axname),tsteps[offs+1:end]) and then dimension names would remain untouched. I know in the beginning of our transition to DD we were doing this but then somehow ran into problems elsewhere but I can not remember what it was, maybe @felixcremer can remember. In the light of the issues brought up here I completely agree we should change something in the logic of when and if at all to parse things as |
Ideally I think the user should be able to decide if they want the re-labelling of the dimensions so that the arrays work better with plot recipes etc or not. Maybe this could even happen in an extra step. One suggestion would be that something like this would be possible: ds = open_dataset(path) # returns lon, lat, time etc...
ds_labeled = fixspacetimedims(ds) where the (to be written) |
I think having the option to relabel is a good suggestion. It would be nice to not lose the plot recipes and still keep the original dimensions names. Perhaps, within the Alternatively, is it possible to specify, at the plot recipe level, the X, Y, Ti dimensions to use? Something like : Makie.contourf(ds, X=:longitude, Y:latitude) Not sure what is best or easiest. |
I am wondering whether we could keep the dim name but make it a subtype of Ti if we detect it as a CF time dimension |
Hi! Love the package!
leads to:
The original dataset has a
time
and aprediction_timedelta
(forecast horizon). I cannot find where this is getting automatically parsed intoTi
. Maybe it is withinDimensionalData.jl
somewhere. Any way to override this behavior?The text was updated successfully, but these errors were encountered: