-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixes bug in csv_reader_options construction in cython #12021
Fixes bug in csv_reader_options construction in cython #12021
Conversation
Codecov ReportBase: 87.47% // Head: 88.11% // Increases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## branch-22.12 #12021 +/- ##
================================================
+ Coverage 87.47% 88.11% +0.63%
================================================
Files 133 133
Lines 21826 22003 +177
================================================
+ Hits 19093 19388 +295
+ Misses 2733 2615 -118
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The fix here looks fine. I'll let the other reviewers follow up on the question of testing.
Co-authored-by: GALI PREM SAGAR <[email protected]>
bool literals give parsing errors as int "0" and "1" give parsing errors as bool in pandas
…reader_options-cython-bug
Pandas behaviour: Our CSV parser: These are differences between our parsers and pandas. We should decide where we should deviate from pandas parsers. |
The pandas behaviour seems too restrictive here, I'm okay with leaving our behaviour as is and think it's okay to have some flexibility on our csv reader. |
@gpucibot merge |
This PR will cleanup nested json reader and csv reader's common parsing code. - Uses `std::optional` for indicating parsing failure in `parse_numeric` - Cleanup - Removed `decode_value` as it only gives only specialization for timestamp and duration types, rest of types are passthrough. - Unified `decode_digit` Depends on #11898 and #12021 Authors: - Karthikeyan (https://github.com/karthikeyann) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) - GALI PREM SAGAR (https://github.com/galipremsagar) - Vukasin Milovanovic (https://github.com/vuule) URL: #12022
Description
Fixes bug in csv_reader_options construction in cython
The false values for csv were not passed to the csv_reader_options during construction in cython code. This is fixed and a unit test is added.
Checklist