-
Notifications
You must be signed in to change notification settings - Fork 916
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add groupby scan aggregation to cudf #7759
Add groupby scan aggregation to cudf #7759
Conversation
…nt of aggregators.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for the delay in reviewing this, was waiting until the changes from #7818 could be merged in. I think we can simplify this code a little and make it match the C++ internals by mapping cumulative operations to their non-cumulative counterparts earlier, but I've left a couple of comments to ascertain that this won't cause significant problems when we start supporting mixed scan/aggregate operations.
# pd.groupby.cumcount returns a series. | ||
if isinstance(expect_df, pd.Series): | ||
expect_df = expect_df.to_frame("val") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@shwina did you already write up the issue? If so, this conversation can be resolved.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One or two minor comments from me, otherwise looks ready.
@gpucibot merge |
closes #1296 Groupby cumulative count
closes #1298 Groupby cumulative sum