-
Notifications
You must be signed in to change notification settings - Fork 933
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support segmented reductions and null mask reductions #9621
Conversation
I didn't read the implementation yet. But I assume that this is extremely similar to what sort-based groupby aggregate is doing. If this new implementation is more efficient than the current groupby aggregate implementation, we also need to switch to use segmented reduction for groupby aggregate too. In order to support this idea, please add a benchmark for comparison between their performance. |
I updated the docstring for |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This combined with #10302 looks to give me what I need
This PR adds an implicit conversion operator from `column_view` to `device_span<T const>`. The immediate purpose of this PR is to make it possible to use the API `segmented_reduce(column_view data, device_span<size_type> offsets, ...)` in PR #9621. This PR also resolves #9656 by adding a `column_view` constructor from `device_span<T const>`. More broadly, this PR should make it easier to refactor instances where `column.data()` is used with counting iterators to build transform iterators, or other patterns that require a length (e.g. vector factories to copy to host). Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Mark Harris (https://github.com/harrism) - Jake Hemstad (https://github.com/jrhemstad) - David Wendt (https://github.com/davidwendt) URL: #10302
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have a few minor comments but I think this is sufficiently ready for merge. We have a few TODO items that we should probably track in an issue.
|
@gpucibot merge |
This adds in JNI support for #9621. It also adds in a helper API to allow us to do the processing on a list easily. Authors: - Robert (Bobby) Evans (https://github.com/revans2) Approvers: - Liangcai Li (https://github.com/firestarman) - Jason Lowe (https://github.com/jlowe) URL: #10413
closes #9135
closes #9552
This PR adds support for numeric types to
simple_op
,sum
,prod
,min
,max
,any
,all
. Also, this PR addssegmented_null_mask_reduction
to compute null mask reductions on segments.