Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Return a Series from JIT GroupBy apply, rather than a DataFrame #13820

Merged
merged 5 commits into from
Aug 9, 2023

Conversation

brandon-b-miller
Copy link
Contributor

Closes #13809

@brandon-b-miller brandon-b-miller added bug Something isn't working Python Affects Python cuDF API. non-breaking Non-breaking change labels Aug 4, 2023
@brandon-b-miller brandon-b-miller self-assigned this Aug 4, 2023
@brandon-b-miller brandon-b-miller marked this pull request as ready for review August 4, 2023 18:32
@brandon-b-miller brandon-b-miller requested a review from a team as a code owner August 4, 2023 18:32
expect = groupby_jit_data.to_pandas().groupby("key1").apply(func)
got = groupby_jit_data.groupby("key1").apply(func, engine="jit")

if special_val is np.nan:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This sounds like a test that should be marked as xfail. What do you think?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, this is a great question. We ran into this on the original JIT groupby apply PR, it's an outstanding issue.

There aren't any obvious solutions that I can see for this problem but I've raised an issue here to track it. I've also xfailed the test and linked there to he issue.

@brandon-b-miller
Copy link
Contributor Author

/merge

@rapids-bot rapids-bot bot merged commit ba6ff60 into rapidsai:branch-23.10 Aug 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] Groupby-apply returns DataFrame rather than Series when taking the JIT path
3 participants