-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix ORC reader for empty DataFrame/Table #7624
Fix ORC reader for empty DataFrame/Table #7624
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🔥
Just a minor suggestion and a question.
Codecov Report
@@ Coverage Diff @@
## branch-0.19 #7624 +/- ##
===============================================
+ Coverage 81.86% 82.49% +0.62%
===============================================
Files 101 101
Lines 16884 17400 +516
===============================================
+ Hits 13822 14354 +532
+ Misses 3062 3046 -16
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One concern but approve otherwise
@gpucibot merge |
…rc_empty_dataframe
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Python looks ✔️
ff.types
by default will have a main type as struct under which all other columns will originate. So, we need to skip first which is not a column and start with 1st index.(Look for
Type Information
in ORC Specification)Along with that, we should also take care of the scenario where user would specify specific column name to retrieve, but it doesn't exist in case of empty data frame/table.
Added test case to validate both scenario.
closes #7356