ENH: numba engine in df.apply #54666

lithomas1 · 2023-08-21T17:33:45Z

closes #xxxx (Replace xxxx with the GitHub issue number)
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

lithomas1 · 2023-09-07T15:55:47Z

I'm planning on following this up with general support for numba in df.apply.

mroeschke · 2023-09-08T17:23:01Z

pandas/core/frame.py

@@ -9919,6 +9919,8 @@ def apply(
        result_type: Literal["expand", "reduce", "broadcast"] | None = None,
        args=(),
        by_row: Literal[False, "compat"] = "compat",
+        engine: str = "python",
+        engine_kwargs: dict = {},


I think we usually default this as engine_kwargs: dict[str, bool] | None = None

mroeschke · 2023-09-08T17:23:11Z

pandas/core/frame.py

@@ -9919,6 +9919,8 @@ def apply(
        result_type: Literal["expand", "reduce", "broadcast"] | None = None,
        args=(),
        by_row: Literal[False, "compat"] = "compat",
+        engine: str = "python",


Could you type as a Literal here?

mroeschke · 2023-09-08T17:26:24Z

pandas/core/_numba/executor.py

+        else:
+            first_elem = values[0]
+            dim0 = values.shape[0]
+        res0 = nb_compat_func(first_elem)


Is inferring the shape from the first element similar to what we do for DataFrame.apply?

It would be good to note what type of UDFs are supported in the engine docstring

np.apply_along_axis, which we use does this.

see https://github.com/numpy/numpy/blob/d676a1fe2d495f9d8a86103644bed141c2e69787/numpy/lib/_shape_base_impl.py#L373-L380.

It would be good to note what type of UDFs are supported in the engine docstring

I'll add a note linking to numba's supported Python/numpy features.

lithomas1 · 2023-09-08T20:43:23Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

lithomas1 · 2023-09-08T22:48:41Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

mroeschke · 2023-09-11T16:34:12Z

Nice! Thanks @lithomas1

* ENH: numba engine in df.apply * fixes * more fixes * try to fix * address code review * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * go for green * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update type --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

lithomas1 · 2023-09-11T19:25:46Z

Thanks for the review!

ENH: numba engine in df.apply

a6add5c

lithomas1 added Apply Apply, Aggregate, Transform, Map numba numba-accelerated operations labels Aug 21, 2023

lithomas1 added 3 commits August 22, 2023 22:03

fixes

cca4656

more fixes

83d4598

try to fix

81e85cc

lithomas1 requested a review from mroeschke August 24, 2023 15:33

lithomas1 added 2 commits September 1, 2023 17:42

Merge branch 'main' into numba-raw-apply

c249a2c

Merge branch 'main' into numba-raw-apply

839a6d9

mroeschke reviewed Sep 8, 2023

View reviewed changes

address code review

5da0723

pre-commit-ci bot and others added 2 commits September 8, 2023 20:51

[pre-commit.ci] auto fixes from pre-commit.com hooks

dae9a15

for more information, see https://pre-commit.ci

go for green

dc5a734

pre-commit-ci bot and others added 3 commits September 8, 2023 22:51

[pre-commit.ci] auto fixes from pre-commit.com hooks

8504f45

for more information, see https://pre-commit.ci

update type

4fbd6ae

Merge branch 'main' into numba-raw-apply

9465038

mroeschke approved these changes Sep 11, 2023

View reviewed changes

mroeschke added this to the 2.2 milestone Sep 11, 2023

mroeschke merged commit ce5fdf0 into pandas-dev:main Sep 11, 2023

lithomas1 deleted the numba-raw-apply branch September 11, 2023 19:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: numba engine in df.apply #54666

ENH: numba engine in df.apply #54666

lithomas1 commented Aug 21, 2023

lithomas1 commented Sep 7, 2023

mroeschke Sep 8, 2023

mroeschke Sep 8, 2023

mroeschke Sep 8, 2023

lithomas1 Sep 8, 2023

lithomas1 commented Sep 8, 2023

lithomas1 commented Sep 8, 2023

mroeschke commented Sep 11, 2023

lithomas1 commented Sep 11, 2023

ENH: numba engine in df.apply #54666

ENH: numba engine in df.apply #54666

Conversation

lithomas1 commented Aug 21, 2023

lithomas1 commented Sep 7, 2023

mroeschke Sep 8, 2023

Choose a reason for hiding this comment

mroeschke Sep 8, 2023

Choose a reason for hiding this comment

mroeschke Sep 8, 2023

Choose a reason for hiding this comment

lithomas1 Sep 8, 2023

Choose a reason for hiding this comment

lithomas1 commented Sep 8, 2023

lithomas1 commented Sep 8, 2023

mroeschke commented Sep 11, 2023

lithomas1 commented Sep 11, 2023