Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix wrong output for collect_list/collect_set of lists column #15243

Merged
merged 10 commits into from
Mar 13, 2024

Conversation

ttnghia
Copy link
Contributor

@ttnghia ttnghia commented Mar 6, 2024

This fixes a bug in the reduction code that shows up specifically in collect_list/collect_set of lists column. In particular, the output of these reduction ops should be a list scalar holding a column that has exactly the same type structure as the input. However, when the input column contains all nulls, the output list scalar holds an empty column having wrong type structure.

Closes #14924.

ttnghia added 4 commits March 6, 2024 10:42
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
@ttnghia ttnghia added bug Something isn't working 3 - Ready for Review Ready for review by team libcudf Affects libcudf (C++/CUDA) code. Spark Functionality that helps Spark RAPIDS non-breaking Non-breaking change labels Mar 6, 2024
@ttnghia ttnghia self-assigned this Mar 6, 2024
@ttnghia ttnghia requested a review from a team as a code owner March 6, 2024 20:58
@ttnghia ttnghia requested a review from bdice March 12, 2024 23:11
@bdice bdice removed the 3 - Ready for Review Ready for review by team label Mar 13, 2024
@bdice
Copy link
Contributor

bdice commented Mar 13, 2024

/merge

@rapids-bot rapids-bot bot merged commit 2020ddd into rapidsai:branch-24.04 Mar 13, 2024
73 checks passed
@ttnghia ttnghia deleted the bug_collect_set branch March 13, 2024 14:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change Spark Functionality that helps Spark RAPIDS
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] collect_set on a LIST<LIST<INT32>> with all null entries returns the wrong result type
3 participants