Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize polyfit #9766

Merged
merged 8 commits into from
Nov 13, 2024
Merged

Optimize polyfit #9766

merged 8 commits into from
Nov 13, 2024

Conversation

dcherian
Copy link
Contributor

@dcherian dcherian commented Nov 11, 2024

Closes #5629

  1. Use Variable instead of DataArray
  2. Use reshape_blockwise when possible following Polyfit performance on large datasets - Suboptimal dask task graph #5629 (comment)
  • User visible changes (including notable bug fixes) are documented in whats-new.rst

cc @phofl

Closes pydata#5629

1. Use Variable instead of DataArray
2. Use `reshape_blockwise` when possible following pydata#5629 (comment)
@dcherian dcherian force-pushed the polyfit-blockwise-reshape branch from 2bfd6af to ee7b6d3 Compare November 11, 2024 16:13
@dcherian dcherian force-pushed the polyfit-blockwise-reshape branch from 9295f17 to 3ec786c Compare November 11, 2024 17:31
doc/whats-new.rst Outdated Show resolved Hide resolved
@dcherian dcherian requested a review from aulemahal November 11, 2024 23:21
@dcherian
Copy link
Contributor Author

@aulemahal can you check that xclim's test suite passes with this please

@phofl
Copy link
Contributor

phofl commented Nov 12, 2024

Nice!

doc/whats-new.rst Outdated Show resolved Hide resolved
@aulemahal
Copy link
Contributor

@dcherian I can confirm the test suite of xclim passes successfully when tested against this branch!

Co-authored-by: Maximilian Roos <[email protected]>
@dcherian dcherian added the plan to merge Final call for comments label Nov 12, 2024
@dcherian dcherian merged commit 5a9ff0b into pydata:main Nov 13, 2024
34 of 35 checks passed
dcherian added a commit to dcherian/xarray that referenced this pull request Nov 16, 2024
* main:
  fix cf decoding of grid_mapping (pydata#9765)
  Allow wrapping `np.ndarray` subclasses (pydata#9760)
  Optimize polyfit (pydata#9766)
  Use `map_overlap` for rolling reductions with Dask (pydata#9770)
  fix html repr indexes section (pydata#9768)
@dcherian dcherian deleted the polyfit-blockwise-reshape branch November 17, 2024 16:15
dcherian added a commit that referenced this pull request Nov 19, 2024
* main: (24 commits)
  Bump minimum versions (#9796)
  Namespace-aware `xarray.ufuncs` (#9776)
  Add prettier and pygrep hooks to pre-commit hooks (#9644)
  `rolling.construct`: Add `sliding_window_kwargs` to pipe arguments down to `sliding_window_view` (#9720)
  Bump codecov/codecov-action from 4.6.0 to 5.0.2 in the actions group (#9793)
  Buffer types (#9787)
  Add download stats badges (#9786)
  Fix open_mfdataset for list of fsspec files (#9785)
  add 'User-Agent'-header to pooch.retrieve (#9782)
  Optimize `ffill`, `bfill` with dask when `limit` is specified (#9771)
  fix cf decoding of grid_mapping (#9765)
  Allow wrapping `np.ndarray` subclasses (#9760)
  Optimize polyfit (#9766)
  Use `map_overlap` for rolling reductions with Dask (#9770)
  fix html repr indexes section (#9768)
  Bump pypa/gh-action-pypi-publish from 1.11.0 to 1.12.2 in the actions group (#9763)
  unpin array-api-strict, as issues are resolved upstream (#9762)
  rewrite the `min_deps_check` script (#9754)
  CI runs ruff instead of pep8speaks (#9759)
  Specify copyright holders in main license file (#9756)
  ...
dcherian added a commit to dcherian/xarray that referenced this pull request Nov 19, 2024
* main:
  Bump minimum versions (pydata#9796)
  Namespace-aware `xarray.ufuncs` (pydata#9776)
  Add prettier and pygrep hooks to pre-commit hooks (pydata#9644)
  `rolling.construct`: Add `sliding_window_kwargs` to pipe arguments down to `sliding_window_view` (pydata#9720)
  Bump codecov/codecov-action from 4.6.0 to 5.0.2 in the actions group (pydata#9793)
  Buffer types (pydata#9787)
  Add download stats badges (pydata#9786)
  Fix open_mfdataset for list of fsspec files (pydata#9785)
  add 'User-Agent'-header to pooch.retrieve (pydata#9782)
  Optimize `ffill`, `bfill` with dask when `limit` is specified (pydata#9771)
  fix cf decoding of grid_mapping (pydata#9765)
  Allow wrapping `np.ndarray` subclasses (pydata#9760)
  Optimize polyfit (pydata#9766)
  Use `map_overlap` for rolling reductions with Dask (pydata#9770)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
plan to merge Final call for comments topic-dask
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Polyfit performance on large datasets - Suboptimal dask task graph
4 participants