-
Notifications
You must be signed in to change notification settings - Fork 240
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Decimal128 support for Parquet #4362
Conversation
Signed-off-by: Kuhu Shukla <[email protected]>
3289e24
to
8d00323
Compare
It would be nice to add some tests for nested types of decimal128. |
Oops. I think I missed a commit. I'm in the process of rebasing. Will update tomorrow with more tests |
Looks good is there any way to get some parquet files with unsigned longs in them to test that code? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks good generally.
NIT: Can you add some comments about unsigned 64 conversion?
@@ -796,7 +797,8 @@ trait ParquetPartitionReaderBase extends Logging with Arm with ScanWithMetrics | |||
} | |||
|
|||
def needDecimalCast(cv: ColumnView, dt: DataType): Boolean = { | |||
cv.getType.isDecimalType && !GpuColumnVector.getNonNestedRapidsType(dt).equals(cv.getType()) | |||
cv.getType.isDecimalType && !GpuColumnVector.getNonNestedRapidsType(dt).equals(cv.getType()) || | |||
cv.getType.equals(DType.UINT64) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you add some comments to elaborate why UINT64
needs to be casted to decimal here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes I can :) Coming up shortly.
Yes that would be nice, isn't it? I will try and make that work. |
…' into parq_dec128
I cant find a simple way to get uint64s on the fly for testing. Can use any ideas I might have missed? I see we had to check in files earlier as well possibly for the same limitation. |
build |
FYI:
|
@revans2 Pushed commit to fix the missing piece from verify. |
build |
Hi @kuhushukla, I found this PR also addressed #3475. So, could you add a test case to verify that we are already able to read |
Signed-off-by: Kuhu Shukla <[email protected]>
Adding a parquet file to test UINT64.
I created the file using |
@sperlingxx Please take a look. |
Signed-off-by: Kuhu Shukla <[email protected]>
build |
Looking at the build failure with 32x shims now. |
…' into parq_dec128
The build failure should be fixed by #4427. |
LGTM |
build |
…' into parq_dec128
build |
…' into parq_dec128
build |
Running the normal - non databricks premerge build for sanity. Also pinging @sperlingxx for the additional look over and approval. |
build |
This is not ready for complete review as we need more tests and support for uint64.