Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ensure columns have valid null counts. #1148

Merged
merged 1 commit into from
May 18, 2023

Conversation

mythrocks
Copy link
Collaborator

This is in prep for rapidsai/cudf#11968 and rapidsai/cudf#13372.

libcudf will soon require that all CUDF columns are created with a known null-count. UNKNOWN_NULL_COUNT will no longer be supported, or even available as a code constant.

This change replicates part of rapidsai/cudf#13355, as it applies to row_conversion.cu. The (single) reference to the unknown-null-count is replaced with a pre-calculated value.

This is in prep for rapidsai/cudf#11968 and
rapidsai/cudf#13372.

`libcudf` will soon require that all CUDF columns are created with a
known null-count. `UNKNOWN_NULL_COUNT` will no longer be supported,
or even available as a code constant.

This change replicates part of rapidsai/cudf#13355,
as it applies to `row_conversion.cu`. The (single) reference to
the unknown-null-count is replaced with a pre-calculated value.

Signed-off-by: MithunR <[email protected]>
@mythrocks mythrocks added the bug Something isn't working label May 17, 2023
@mythrocks mythrocks self-assigned this May 17, 2023
@mythrocks mythrocks requested a review from hyperbolic2346 May 17, 2023 22:21
@mythrocks
Copy link
Collaborator Author

Build

@mythrocks mythrocks merged commit 34f7f10 into NVIDIA:branch-23.06 May 18, 2023
@mythrocks
Copy link
Collaborator Author

Thank you for reviewing, @revans2. I have merged this change.

mythrocks added a commit to mythrocks/spark-rapids-jni that referenced this pull request May 19, 2023
This is a followup to NVIDIA#1148.

`row_conversion.cu` was modified in rapidsai/cudf#13372
to explicitly calculate null-counts for output columns.

This commit replicates the changes in cudf/pull/13372, and adds explicit
null-count calculation for the string offsets column.

Signed-off-by: MithunR <[email protected]>
mythrocks added a commit to mythrocks/spark-rapids-jni that referenced this pull request May 22, 2023
This is a followup to NVIDIA#1148.

`row_conversion.cu` was modified in rapidsai/cudf#13372
to explicitly calculate null-counts for output columns.

This commit replicates the changes in cudf/pull/13372, and adds explicit
null-count calculation for the string offsets column.

Signed-off-by: MithunR <[email protected]>
mythrocks added a commit that referenced this pull request May 23, 2023
* Followup for null count fixup in row_conversion.cu.

This is a followup to #1148.

`row_conversion.cu` was modified in rapidsai/cudf#13372
to explicitly calculate null-counts for output columns.

This commit replicates the changes in cudf/pull/13372, and adds explicit
null-count calculation for the string offsets column.

Signed-off-by: MithunR <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants