-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Include row group level stats when writing ORC files #10041
Merged
Merged
Changes from 12 commits
Commits
Show all changes
19 commits
Select commit
Hold shift + click to select a range
b8ae756
separate stats by level
vuule d6c18c8
encode rg stats
vuule cb2b972
rename putb
vuule 95f018c
add put_bytes
vuule c5b62b9
include rg stats in rg index entries
vuule f1f6958
docs
vuule c078879
fix; don't use optional
vuule 61733f9
Merge branch 'branch-22.02' of https://github.com/rapidsai/cudf into …
vuule 4bfe814
return written size from put_byte and put_bytes
vuule 2535754
reuse encode_field_number in protobuf writer
vuule 683a016
copyright year
vuule 61a8bec
comment
vuule fecf4d5
Merge branch 'branch-22.02' of https://github.com/rapidsai/cudf into …
vuule 509138b
Merge branch 'branch-22.02' of https://github.com/rapidsai/cudf into …
vuule febcd04
Merge branch 'branch-22.02' of https://github.com/rapidsai/cudf into …
vuule 4c82d89
host_span; static_assert
vuule 2afedce
ProtobufType enum
vuule 46a4a3c
style
vuule 6ea0a50
copyright year
vuule File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: maybe field number (2 in this case) should be an enum. I see that it's used in a lot of places though, so maybe a followup.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's doable, but there would need to be different enums for each ORC message type, since the numbers are not unique between messages (see https://orc.apache.org/specification/ORCv1/). We can have the set of enums (non-class) and still pass them as int. I would really need to do this in a follow up for this one to make it into 22.02.