-
Notifications
You must be signed in to change notification settings - Fork 920
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add shallow hash function and shallow equality comparison for column_view #9312
Add shallow hash function and shallow equality comparison for column_view #9312
Conversation
Co-authored-by: Jake Hemstad <[email protected]>
…/cudf into fea-shallow_equal_columnview
Co-authored-by: Jake Hemstad <[email protected]>
…low_hash_columnview
Co-authored-by: Jake Hemstad <[email protected]>
…/cudf into fea-shallow_hash_columnview
Co-authored-by: David Wendt <[email protected]>
rerun tests |
Codecov Report
@@ Coverage Diff @@
## branch-21.12 #9312 +/- ##
================================================
+ Coverage 10.79% 10.83% +0.04%
================================================
Files 116 116
Lines 18869 19255 +386
================================================
+ Hits 2036 2087 +51
- Misses 16833 17168 +335
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm assuming #9185 was reverted solely due to the need for SWIPAT approval, because AFAICT the contents of this PR are identical to that one and LGTM.
I confirmed that the contents are identical to #9185 with @karthikeyann offline. Since that PR received four approvals before it was merged I'm comfortable going ahead and merging this PR with a single review. |
@gpucibot merge |
Fixes #9140
Added
shallow_hash(column_view)
Added unit tests
SWIPAT approval complete
It computes hash values based on the shallow states of
column_view
:type, size, data pointer, null_mask pointer, offset, and the hash value of the children.
null_count
is not used since it is a cached value and it may vary based on contents ofnull_mask
, and may be pre-computed or not.Fixes #9139
Added
is_shallow_equivalent(column_view, column_view)
shallow_equalAdded unit tests
It compares two column_views based on the shallow states of column_view:
type, size, data pointer, null_mask pointer, offset, and the column_view of the children.
null_count is not used since it is a cached value and it may vary based on contents of null_mask, and may be pre-computed or not.