Import Tree from dask-awkward if not in dask #164

martindurant · 2025-01-27T14:38:00Z

Following the disappearance of DataframeTreeReduction in upstream dask, it was copied to dask-awkward, making it a required dependency in the future.

cc @lgray

for more information, see https://pre-commit.ci

…-histogram into maybe_fix_tree

martindurant · 2025-01-27T14:46:15Z

Appears to fail on some dataframe-specific functions:

FAILED tests/test_boost.py::test_histogramdd_series - NotImplementedError: The legacy implementation is no longer supported
FAILED tests/test_boost.py::test_histogramdd_arrays_and_series - NotImplementedError: The legacy implementation is no longer supported
FAILED tests/test_boost.py::test_histogramdd_dataframe - NotImplementedError: The legacy implementation is no longer supported
FAILED tests/test_core.py::test_df_input[True] - NotImplementedError: The legacy implementation is no longer supported
FAILED tests/test_core.py::test_df_input[None] - NotImplementedError: The legacy implementation is no longer supported

I didn't even realise we supported this. I suppose we drop support for now, as with dak.to_dataframe?

martindurant · 2025-01-27T20:11:17Z

I haven't yet figured out why dask thinks we are trying to use pre-expr, there must be a config set somewhere.

Locally, I am still seeing some failures, but the following without dask is mystifying me:

>>> x = np.random.standard_normal(size=(3_000,))
>>> h2 = bh.Histogram(bh.axis.Regular(10, -3, 3))
>>> h2.fill(x)
>>> h2.to_numpy(dd=True, flow=True)
(array([  5.,  17.,  79., 246., 456., 711., 666., 487., 241.,  70.,  20.,
          2.]),
 [array([1.79769313e+308, 1.79769313e+308, 1.79769313e+308, 1.79769313e+308,
         1.79769313e+308, 1.79769313e+308, 1.79769313e+308, 1.79769313e+308,
         1.79769313e+308, 1.79769313e+308, 1.79769313e+308, 1.79769313e+308,
         1.79769313e+308])])

why are the edges messed up?? The values look right. Is there some copy thing with numpy 2 I should care about?

^ this only happens for flow=True

pfackeldey · 2025-01-29T15:05:23Z

I haven't yet figured out why dask thinks we are trying to use pre-expr, there must be a config set somewhere.

Locally, I am still seeing some failures, but the following without dask is mystifying me:
>>> x = np.random.standard_normal(size=(3_000,))
>>> h2 = bh.Histogram(bh.axis.Regular(10, -3, 3))
>>> h2.fill(x)
>>> h2.to_numpy(dd=True, flow=True)
(array([  5.,  17.,  79., 246., 456., 711., 666., 487., 241.,  70.,  20.,
          2.]),
 [array([1.79769313e+308, 1.79769313e+308, 1.79769313e+308, 1.79769313e+308,
         1.79769313e+308, 1.79769313e+308, 1.79769313e+308, 1.79769313e+308,
         1.79769313e+308, 1.79769313e+308, 1.79769313e+308, 1.79769313e+308,
         1.79769313e+308])])
why are the edges messed up?? The values look right. Is there some copy thing with numpy 2 I should care about?

^ this only happens for flow=True

I can not reproduce this boost-histogram (v1.5.0) issue with either numpy v2.0.0 or numpy v2.2.2 (latest). Maybe @henryiii has seen this behavior in the past?

martindurant · 2025-01-29T15:10:45Z

Updating boost-histogram to v1.5.0 fixed this.

martindurant and others added 4 commits January 27, 2025 09:37

Import Tree from dask-awkward if not in dask

8299fed

[pre-commit.ci] auto fixes from pre-commit.com hooks

19e0cd4

for more information, see https://pre-commit.ci

TEMP: install dask-awkward from main

298994e

Merge branch 'maybe_fix_tree' of https://github.com/martindurant/dask…

f4b5d93

…-histogram into maybe_fix_tree

rewrite is_dataframe/series_like

7d728fd

GaetanLepage mentioned this pull request Feb 7, 2025

python312Packages.dask: 2024.12.1 -> 2025.1.0 NixOS/nixpkgs#380052

Open

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Import Tree from dask-awkward if not in dask #164

Import Tree from dask-awkward if not in dask #164

martindurant commented Jan 27, 2025 •

edited

Loading

martindurant commented Jan 27, 2025

martindurant commented Jan 27, 2025 •

edited

Loading

pfackeldey commented Jan 29, 2025 •

edited

Loading

martindurant commented Jan 29, 2025

Import Tree from dask-awkward if not in dask #164

Are you sure you want to change the base?

Import Tree from dask-awkward if not in dask #164

Conversation

martindurant commented Jan 27, 2025 • edited Loading

martindurant commented Jan 27, 2025

martindurant commented Jan 27, 2025 • edited Loading

pfackeldey commented Jan 29, 2025 • edited Loading

martindurant commented Jan 29, 2025

martindurant commented Jan 27, 2025 •

edited

Loading

martindurant commented Jan 27, 2025 •

edited

Loading

pfackeldey commented Jan 29, 2025 •

edited

Loading