Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new native parquet footer API and deprecate the old one #362

Merged
merged 2 commits into from
Jul 11, 2022

Conversation

revans2
Copy link
Collaborator

@revans2 revans2 commented Jul 8, 2022

This adds a new API for parsing and filtering a parquet footer using native code. This will allow us to fix the issues associated with legacy parquet files. It does not fix any of those issues yet. This just adds in the API so we can start to use it without breaking anyone.

@revans2 revans2 added the enhancement New feature or request label Jul 8, 2022
@revans2 revans2 self-assigned this Jul 8, 2022
@revans2
Copy link
Collaborator Author

revans2 commented Jul 8, 2022

build

@revans2 revans2 changed the title Add new API that and deprecate the old one Add new native parquet footer API that and deprecate the old one Jul 11, 2022
andygrove
andygrove previously approved these changes Jul 11, 2022
@@ -63,6 +129,83 @@ public void close() throws Exception {
}
}

private static void depthFirstNamesHelper(SchemaElement se, String name, boolean makeLowerCase,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could a small comment block be added here describing why we need the helper?

for (ElementWithName child : st.children) {
depthFirstNamesHelper(child.element, child.name, makeLowerCase, names, numChildren);
}
} else if (se instanceof ListElement) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit,

Suggested change
} else if (se instanceof ListElement) {
} else if (se instanceof ListElement) {

jlowe
jlowe previously approved these changes Jul 11, 2022
@revans2 revans2 dismissed stale reviews from jlowe and andygrove via 1396203 July 11, 2022 14:44
@revans2
Copy link
Collaborator Author

revans2 commented Jul 11, 2022

build

@revans2
Copy link
Collaborator Author

revans2 commented Jul 11, 2022

@abellina please take another look

@revans2 revans2 changed the title Add new native parquet footer API that and deprecate the old one Add new native parquet footer API and deprecate the old one Jul 11, 2022
@revans2 revans2 merged commit ba5de10 into NVIDIA:branch-22.08 Jul 11, 2022
@revans2 revans2 deleted the new_api_parquet_footer branch July 11, 2022 17:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants