Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Control Parquet page size through Python API #11454

Merged
merged 20 commits into from
Aug 16, 2022

Conversation

etseidl
Copy link
Contributor

@etseidl etseidl commented Aug 3, 2022

Description

Closes #10988

Exposes page_size_rows and page_size_bytes properties of the Parquet writer.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@etseidl etseidl requested a review from a team as a code owner August 3, 2022 20:38
@etseidl etseidl requested review from bdice and charlesbluca August 3, 2022 20:38
@github-actions github-actions bot added the Python Affects Python cuDF API. label Aug 3, 2022
Copy link
Contributor

@bdice bdice left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks flawless to me. 😄

@codecov
Copy link

codecov bot commented Aug 3, 2022

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.10@f31e5bd). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-22.10   #11454   +/-   ##
===============================================
  Coverage                ?   86.47%           
===============================================
  Files                   ?      144           
  Lines                   ?    22856           
  Branches                ?        0           
===============================================
  Hits                    ?    19764           
  Misses                  ?     3092           
  Partials                ?        0           

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

@bdice bdice added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Aug 16, 2022
@galipremsagar
Copy link
Contributor

@gpucibot merge

@rapids-bot rapids-bot bot merged commit 0c4b319 into rapidsai:branch-22.10 Aug 16, 2022
@etseidl etseidl deleted the feature/python_page_size branch August 24, 2022 01:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvement / enhancement to an existing function non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Expose options to control page size when writing Parquet files in Python API
3 participants