-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add scan_aggregation and reduce_aggregation derived types. #10357
Add scan_aggregation and reduce_aggregation derived types. #10357
Conversation
Pinging @jrhemstad. @karthikeyann, there are some comments in the existing python aggregations that I wasn't sure about: cudf/python/cudf/cudf/_lib/aggregation.pyx Line 227 in 900d55c
If there's anything I can do in this PR to address them, let me know. |
Codecov Report
@@ Coverage Diff @@
## branch-22.04 #10357 +/- ##
================================================
+ Coverage 86.13% 86.16% +0.02%
================================================
Files 139 139
Lines 22460 22457 -3
================================================
+ Hits 19347 19351 +4
+ Misses 3113 3106 -7
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comments attached. Reading issue #7106 helped me understand a lot more of this PR. A cross-link to that related issue would be nice to have in the PR description for future readers.
@nvdbaranec |
In the interest of keeping this PR down in size, I'll do this work as a second PR. This PR is already a prereq for other high priority work (implementing percentile_approx as a reduction). |
Added an issue for followup (assigned to me) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
From the java perspective this looks fine to me.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One fix for consistency -- otherwise LGTM. Thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. 👍
Just couple of cpp suggestions.
@gpucibot merge |
Fixes benchmarks compile errors introduced by #10357 Example: ``` /cudf/cpp/benchmarks/reduction/reduce.cpp: In function ‘void BM_reduction(benchmark::State&, const std::unique_ptr<cudf::aggregation>&)’: /cudf/cpp/benchmarks/reduction/reduce.cpp:52:46: error: invalid initialization of reference of type ‘const std::unique_ptr<cudf::reduce_aggregation>&’ from expression of type ‘const std::unique_ptr<cudf::aggregation>’ 52 | auto result = cudf::reduce(input_column, agg, output_dtype); ``` Aggregation types for reduce and scan were modified to include template types. Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) - https://github.com/nvdbaranec URL: #10428
This PR adds the
scan_aggregation
andreduce_aggregation
derived types. With it, all concrete aggregation types are now derived from algorithmic specific subtypes.