Fixes CSV-reader type inference for thousands separator and decimal point #8261

elstehle · 2021-05-17T15:48:52Z

This PR fixes #6655
This PR also makes sure to respect a user-specified decimal point during type inference. I.e., when the decimal point is not '.', types are now correctly inferred.
Plus some minor doxygen fixes and style changes from camelCase to snake_case.

rgsl888prabhu

minor comment, other than that looks good.

cpp/tests/io/csv_test.cpp

vuule

Looks good except for the existing comment on testing 👍

vuule · 2021-05-19T16:59:25Z

@elstehle please don't force-push. If I remember correctly, it messes with the comment traceability.

codecov · 2021-05-19T18:29:47Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-21.06@59d8d5e). Click here to learn what that means.
The diff coverage is n/a.

❗ Current head 8252777 differs from pull request most recent head a7e2715. Consider uploading reports for the commit a7e2715 to get more accurate results

@@               Coverage Diff               @@
##             branch-21.06    #8261   +/-   ##
===============================================
  Coverage                ?   82.84%           
===============================================
  Files                   ?      105           
  Lines                   ?    17865           
  Branches                ?        0           
===============================================
  Hits                    ?    14800           
  Misses                  ?     3065           
  Partials                ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 59d8d5e...a7e2715. Read the comment docs.

vuule · 2021-05-19T19:36:17Z

@gpucibot merge

elstehle requested a review from a team as a code owner May 17, 2021 15:48

elstehle requested review from devavret and rgsl888prabhu May 17, 2021 15:48

github-actions bot added the libcudf label May 17, 2021

elstehle added 3 - Ready for Review bug non-breaking labels May 17, 2021

rgsl888prabhu reviewed May 17, 2021

View reviewed changes

cpp/tests/io/csv_test.cpp Outdated Show resolved Hide resolved

elstehle requested a review from vuule May 17, 2021 16:32

vuule approved these changes May 17, 2021

View reviewed changes

elstehle added 4 commits May 18, 2021 12:47

fixes some doxygen and camelCase style

3fe9d02

fixes csv type inference for thousands separator and decimal point

ca61bd5

added tests for csv reader type inference

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23
Expired

Verified
Learn about vigilant mode

a65bfaa

added tests for comparing parsed values

a7e2715

elstehle force-pushed the fix/read-csv-auto-detect-types branch from 79c2e55 to a7e2715 Compare May 19, 2021 15:14

rgsl888prabhu approved these changes May 19, 2021

View reviewed changes

vuule added 5 - Ready to Merge and removed 3 - Ready for Review labels May 19, 2021

rapids-bot bot merged commit 2b9fc62 into rapidsai:branch-21.06 May 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes CSV-reader type inference for thousands separator and decimal point #8261

Fixes CSV-reader type inference for thousands separator and decimal point #8261

elstehle commented May 17, 2021 •

edited by rgsl888prabhu

Loading

rgsl888prabhu left a comment

vuule left a comment

vuule commented May 19, 2021

codecov bot commented May 19, 2021 •

edited

Loading

vuule commented May 19, 2021

Fixes CSV-reader type inference for thousands separator and decimal point #8261

Fixes CSV-reader type inference for thousands separator and decimal point #8261

Conversation

elstehle commented May 17, 2021 • edited by rgsl888prabhu Loading

rgsl888prabhu left a comment

Choose a reason for hiding this comment

vuule left a comment

Choose a reason for hiding this comment

vuule commented May 19, 2021

codecov bot commented May 19, 2021 • edited Loading

Codecov Report

vuule commented May 19, 2021

elstehle commented May 17, 2021 •

edited by rgsl888prabhu

Loading

codecov bot commented May 19, 2021 •

edited

Loading