Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow Java bindings to use default decimal precisions when writing columns #10276

Merged
merged 2 commits into from
Feb 15, 2022

Conversation

sperlingxx
Copy link
Contributor

@sperlingxx sperlingxx commented Feb 14, 2022

Closes #9851

This PR is to fix the bug raised in #9851 : Currently, Java bindings set decimal precision for each column when building OrcWriterOptions. It fills zero for non-decimal columns which do not carry precision information. In principle, we should only set precision for decimal columns.

It is not easy to write a test for this change, since we won't try to read non-decimal data as decimal type in both spark-rapids and cuDF Java.

@sperlingxx sperlingxx requested a review from a team as a code owner February 14, 2022 08:41
@github-actions github-actions bot added the Java Affects Java cuDF API. label Feb 14, 2022
@sperlingxx sperlingxx added non-breaking Non-breaking change bug Something isn't working labels Feb 14, 2022
Signed-off-by: sperlingxx <[email protected]>
@sperlingxx sperlingxx added the 3 - Ready for Review Ready for review by team label Feb 14, 2022
@codecov
Copy link

codecov bot commented Feb 14, 2022

Codecov Report

Merging #10276 (fd2aa6d) into branch-22.04 (7f2a16f) will not change coverage.
The diff coverage is n/a.

Impacted file tree graph

@@              Coverage Diff              @@
##           branch-22.04   #10276   +/-   ##
=============================================
  Coverage         10.67%   10.67%           
=============================================
  Files               122      122           
  Lines             20873    20873           
=============================================
  Hits               2228     2228           
  Misses            18645    18645           

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7f2a16f...fd2aa6d. Read the comment docs.

@jlowe jlowe changed the title Stop setting decimal precision for non-decimal columns when building OrcWriterOptions Allow Java bindings to use default decimal precisions when writing columns Feb 14, 2022
@sperlingxx
Copy link
Contributor Author

@gpucibot merge

@rapids-bot rapids-bot bot merged commit f5ae74f into rapidsai:branch-22.04 Feb 15, 2022
@sperlingxx sperlingxx deleted the fix_io_decimal_precision branch February 15, 2022 01:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3 - Ready for Review Ready for review by team bug Something isn't working Java Affects Java cuDF API. non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] Java bindings for ORC/Parquet writers always set decimal precision
3 participants