-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix null hive-partition behavior in dask-cudf parquet #12866
Merged
rapids-bot
merged 17 commits into
rapidsai:branch-23.04
from
rjzamora:null-hive-partition
Mar 10, 2023
Merged
Fix null hive-partition behavior in dask-cudf parquet #12866
rapids-bot
merged 17 commits into
rapidsai:branch-23.04
from
rjzamora:null-hive-partition
Mar 10, 2023
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
rjzamora
added
2 - In Progress
Currently a work in progress
Python
Affects Python cuDF API.
dask
Dask issue
improvement
Improvement / enhancement to an existing function
non-breaking
Non-breaking change
labels
Feb 28, 2023
rjzamora
changed the title
[WIP] Fix null hive-partition behavior in dask-cudf parquet
Fix null hive-partition behavior in dask-cudf parquet
Mar 6, 2023
rjzamora
added
3 - Ready for Review
Ready for review by team
4 - Needs Dask Reviewer
and removed
2 - In Progress
Currently a work in progress
labels
Mar 7, 2023
bdice
reviewed
Mar 9, 2023
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, with some minor comments.
…into null-hive-partition
bdice
approved these changes
Mar 9, 2023
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. I don't know a lot about hive partitioning to verify the test, but the code appears fine.
rjzamora
added
5 - Ready to Merge
Testing and reviews complete, ready to merge
and removed
3 - Ready for Review
Ready for review by team
4 - Needs Dask Reviewer
labels
Mar 10, 2023
/merge |
galipremsagar
approved these changes
Mar 10, 2023
3 tasks
rapids-bot bot
pushed a commit
that referenced
this pull request
Mar 15, 2023
…12930) This is a follow-up "fix" for #12866 While that PR enables the writing/reading of null hive partitions using `dask_cudf`, it does not preserve the type of integer partition columns containing nulls. This PR should address the remaining issue. Authors: - Richard (Rick) Zamora (https://github.com/rjzamora) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Lawrence Mitchell (https://github.com/wence-) URL: #12930
3 tasks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
5 - Ready to Merge
Testing and reviews complete, ready to merge
dask
Dask issue
improvement
Improvement / enhancement to an existing function
non-breaking
Non-breaking change
Python
Affects Python cuDF API.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR includes a few simple changes to fix the handling of null hive partitions in
dask_cudf
.Depends on dask/dask#10007Checklist