-
Notifications
You must be signed in to change notification settings - Fork 842
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Parquet] Add projection utility functions #6931
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great to me -- thank you @XiangpengHao
parquet/src/arrow/mod.rs
Outdated
/// Example: | ||
/// mask1 = [true, false, true] | ||
/// mask2 = [false, true, true] | ||
/// union(mask1, mask2) = [true, true, true] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The rendered docs look.a bit off:
I pushed a commit to make them a bit nicer
/// Example: | |
/// mask1 = [true, false, true] | |
/// mask2 = [false, true, true] | |
/// union(mask1, mask2) = [true, true, true] | |
/// Example: | |
/// mask1 = [true, false, true] | |
/// mask2 = [false, true, true] | |
/// union(mask1, mask2) = [true, true, true] |
Thank you @XiangpengHao and @tustvold |
* projection utilities * improve docs --------- Co-authored-by: Andrew Lamb <[email protected]>
* projection utilities * improve docs --------- Co-authored-by: Andrew Lamb <[email protected]>
Which issue does this PR close?
Part of #6921
Rationale for this change
To implement row level filtering for Parquet, we need to cache columns that are being filtered, i.e., the column is both in filter's predicate projection and in final output projection.
This PR adds two utility functions to union and interset two projections.
What changes are included in this PR?
Are there any user-facing changes?