Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement groupby in pylibcudf #14945

Merged
merged 35 commits into from
Feb 2, 2024

Conversation

vyasr
Copy link
Contributor

@vyasr vyasr commented Feb 1, 2024

Description

This PR implements groupby in pylibcudf along with the minimal set of aggregation logic to support groupby. To limit its scope, this PR does not include other aggregation logic for e.g. non-groupby reductions and scans. Due to the large scale of what's already in this PR, I have also omitted the changes required to leverage pylibcudf in the current cudf Cython code from this PR. That will be done in a follow-up. This PR's diff is misleadingly large, a large chunk of it is adding documentation and function declarations that shouldn't impose too heavy a cognitive load in review.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@vyasr vyasr added feature request New feature or request non-breaking Non-breaking change labels Feb 1, 2024
@vyasr vyasr self-assigned this Feb 1, 2024
@vyasr vyasr requested a review from a team as a code owner February 1, 2024 00:40
@vyasr vyasr requested review from shwina and mroeschke February 1, 2024 00:40
@github-actions github-actions bot added Python Affects Python cuDF API. CMake CMake build issue labels Feb 1, 2024
Copy link
Contributor

@shwina shwina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me - just some small doc requests.

@vyasr
Copy link
Contributor Author

vyasr commented Feb 2, 2024

/merge

@rapids-bot rapids-bot bot merged commit b80f4e2 into rapidsai:branch-24.04 Feb 2, 2024
69 checks passed
@vyasr vyasr deleted the feat/pylibcudf_groupby branch February 2, 2024 23:55
rapids-bot bot pushed a commit that referenced this pull request Feb 5, 2024
This PR builds on #14945 to use pylibcudf's groupby in cudf's internals. It should not be merged until after that PR.

Authors:
  - Vyas Ramasubramani (https://github.com/vyasr)

Approvers:
  - Ashwin Srinath (https://github.com/shwina)

URL: #14946
@vyasr vyasr added the pylibcudf Issues specific to the pylibcudf package label May 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CMake CMake build issue feature request New feature or request non-breaking Non-breaking change pylibcudf Issues specific to the pylibcudf package Python Affects Python cuDF API.
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants