-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[REVIEW] Adding decimal32
and decimal64
support to parquet reading
#6808
[REVIEW] Adding decimal32
and decimal64
support to parquet reading
#6808
Conversation
Please update the changelog in order to start CI tests. View the gpuCI docs here. |
Codecov Report
@@ Coverage Diff @@
## branch-0.17 #6808 +/- ##
===============================================
- Coverage 82.31% 81.97% -0.34%
===============================================
Files 93 96 +3
Lines 15358 16181 +823
===============================================
+ Hits 12642 13265 +623
- Misses 2716 2916 +200
Continue to review full report at Codecov.
|
decimal32
and decimal64
support to parquet readingdecimal32
and decimal64
support to parquet reading
Python tests are blocked on #6715 |
Current plan is to check in a ~2k test file to read in cpp until python tests are available. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mostly doc change suggestions. I think we need a cuIO reviewer for this PR.
Would this be a non-breaking API change or a breaking change? I assume the former since it's only adding functionality. We need to know to set appropriate labels for the automerger. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Adding suggestions from review for comment changes. Co-authored-by: Mark Harris <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good!
Some suggestions related to test coverage.
…ere is no exception thrown. Added some commented code to attempt testing the double reading code. Implemented the great suggestions of explicit conversion to std::string for output in tests.
@hyperbolic2346 I think the description for this PR is out of date based on the above. Can you please update it since it will be used in the merge commit message? |
I think we also need enable |
I think that needs to be a separate PR to avoid blocking this from going in. |
This PR adds support for reading decimals in parquet into decimal32 and decimal64 cudf types. A test was added to test these types by embedding a parquet data file into the cpp file. This is temporary until python supports decimal and the tests move there.
partially closes issue #6474