-
Notifications
You must be signed in to change notification settings - Fork 916
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix skiprows issue with ORC Reader #7359
Fix skiprows issue with ORC Reader #7359
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thanks @rgsl888prabhu for the fix!
@@ -8,6 +8,7 @@ | |||
import pandas as pd | |||
import pyarrow as pa | |||
import pyarrow.orc | |||
import pyorc |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CI fails because of this import
ModuleNotFoundError: No module named 'pyorc'
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Created a PR for it rapidsai/integration#215
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
approved once more, for good measure :)
rerun tests |
Codecov Report
@@ Coverage Diff @@
## branch-0.19 #7359 +/- ##
==============================================
Coverage ? 82.19%
==============================================
Files ? 100
Lines ? 16968
Branches ? 0
==============================================
Hits ? 13947
Misses ? 3021
Partials ? 0 Continue to review full report at Codecov.
|
@gpucibot merge |
closes #7343
The validity bits in streams are placed msb to lsb in a byte, [True, False, True. False. True, True, True, False] -> 10101110.
So, when it is being analyzed as 32 bit chunk, we can't apply mask directly, which caused this issue.
__brev(__byte_perm(bits, 0, 0x0123))
takes care of that issue and rearranges the bits as per the expectation.