Fixes creation of invalid DecimalType in GpuDivide.tagExprForGpu #1991

razajafri · 2021-03-22T23:56:29Z

While supporting DIV changes, I completely overlooked the fact that the precision and scale can be truncated.
This addresses that case.

fixes #1984

Signed-off-by: Raza Jafri [email protected]

Signed-off-by: Raza Jafri <[email protected]>

razajafri · 2021-03-22T23:57:59Z

@andygrove thank you for finding this. Please take a look at this when you get a chance. This will basically behave the same as before, which is preclude the Exp from being on the GPU.

razajafri · 2021-03-22T23:58:19Z

build

andygrove · 2021-03-23T14:50:18Z

@razajafri I tested locally and can confirm that it fixed the issue when running with NDS query 58.

andygrove · 2021-03-23T15:06:08Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuOverrides.scala

-              // To find out if outputType.precision < outputType.scale simplifies to p1 < 0
-              // which is never possible
+              // Case 1: OutputType.precision doesn't get truncated
+              //   We will never hit a case where outputType.precision < outputType.scale + r.scale.


I think it would be good (and fairly easy) to add a unit test for this. I had a go and found some values that do hit this case but it is possible that I am using values that couldn't happen in real life. I am not sure.

Here is one example:

l = DecimalType(3,0) r = DecimalType(21,17) outputType = DecimalType(38,23) 38 < 23 + 17

Perhaps I am hitting case 2 here where precision is getting truncated?

It does look like I am hitting case 2 here so I think the logic in the comments is sound, but a unit test would make me more confident and would protect against future regressions if any of this code gets updated.

+1 for the unit test to verify that this is working. The logic looks good to me. I am a little concerned about the coupling between this code and the code in GpuDivideUtil, but that is minor for now.

Yes, I wrote a unit test locally to verify this. I don't know why I didn't check it in. Let me do that

l = DecimalType(3,0)
r = DecimalType(21,17)
outputType = DecimalType(38,23)
38 < 23 + 17

So, like you said this won't happen in reality as by the time we get the call both l and r Spark will be the same precision and scale, I think it will upcast the Decimal(3,0) to Decimal(21,17).

Signed-off-by: Raza Jafri <[email protected]>

razajafri · 2021-03-23T16:26:09Z

@andygrove @revans2 I have added a unit test PTAL

andygrove

LGTM. Thanks @razajafri

razajafri · 2021-03-23T16:41:07Z

build

revans2 · 2021-03-23T16:43:22Z

integration_tests/src/main/python/arithmetic_ops_test.py

+@pytest.mark.parametrize('data_gen', [double_gen, decimal_gen_neg_scale, DecimalGen(6, 3),
+ DecimalGen(5, 5), DecimalGen(6, 0),
+pytest.param(DecimalGen(38, 21), marks=pytest.mark.xfail(reason="The precision is too large to be supported on the GPU")),
+pytest.param(DecimalGen(21, 17), marks=pytest.mark.xfail(reason="The precision is too large to be supported on the GPU"))], ids=idfn)


If I remove your patch and keep the test there will be no real difference in the results, unless someone goes through the logs manually and looks that in one case we failed by falling back to the CPU, and in another case we failed by throwing an exception about going over the limit. In both cases the xfail ignored the exception that was thrown.

I have updated the test to check if the test raises IllegalArgumentException. Let me know if you want me to tighten it down even more. Ideally we should be checking the message to make sure its failing because its not columnar but I am not sure how to accomplish that using the pytest xfail

Signed-off-by: Raza Jafri <[email protected]>

razajafri · 2021-03-23T20:05:15Z

@revans2 I tested the new test on top of branch-0.5 and it fails without my fix. PTAL

razajafri · 2021-03-23T21:33:36Z

build

razajafri · 2021-03-23T21:33:50Z

@revans2 do you have any more concerns?

razajafri · 2021-03-23T21:48:47Z

The CI has been waiting complaining insufficient CPU

…DIA#1991) * fixes NVIDIA#1984 Signed-off-by: Raza Jafri <[email protected]> * added unit tests Signed-off-by: Raza Jafri <[email protected]> * make sure the exception is not AnalysisException Signed-off-by: Raza Jafri <[email protected]> Co-authored-by: Raza Jafri <[email protected]>

fixes NVIDIA#1984

770131e

Signed-off-by: Raza Jafri <[email protected]>

andygrove reviewed Mar 23, 2021

View reviewed changes

added unit tests

73b5120

Signed-off-by: Raza Jafri <[email protected]>

andygrove previously approved these changes Mar 23, 2021

View reviewed changes

revans2 requested changes Mar 23, 2021

View reviewed changes

make sure the exception is not AnalysisException

3901e6a

Signed-off-by: Raza Jafri <[email protected]>

razajafri dismissed andygrove’s stale review via 3901e6a March 23, 2021 19:44

sameerz added the bug Something isn't working label Mar 23, 2021

revans2 approved these changes Mar 23, 2021

View reviewed changes

razajafri merged commit cc91b54 into NVIDIA:branch-0.5 Mar 23, 2021

razajafri deleted the div-fix branch March 23, 2021 23:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes creation of invalid DecimalType in GpuDivide.tagExprForGpu #1991

Fixes creation of invalid DecimalType in GpuDivide.tagExprForGpu #1991

razajafri commented Mar 22, 2021

razajafri commented Mar 22, 2021

razajafri commented Mar 22, 2021

andygrove commented Mar 23, 2021

andygrove Mar 23, 2021

andygrove Mar 23, 2021

andygrove Mar 23, 2021

revans2 Mar 23, 2021

razajafri Mar 23, 2021

razajafri Mar 23, 2021 •

edited

Loading

razajafri commented Mar 23, 2021

andygrove left a comment

razajafri commented Mar 23, 2021

revans2 Mar 23, 2021 •

edited

Loading

razajafri Mar 23, 2021

razajafri commented Mar 23, 2021

razajafri commented Mar 23, 2021

razajafri commented Mar 23, 2021

razajafri commented Mar 23, 2021 •

edited

Loading

Fixes creation of invalid DecimalType in GpuDivide.tagExprForGpu #1991

Fixes creation of invalid DecimalType in GpuDivide.tagExprForGpu #1991

Conversation

razajafri commented Mar 22, 2021

razajafri commented Mar 22, 2021

razajafri commented Mar 22, 2021

andygrove commented Mar 23, 2021

andygrove Mar 23, 2021

Choose a reason for hiding this comment

andygrove Mar 23, 2021

Choose a reason for hiding this comment

andygrove Mar 23, 2021

Choose a reason for hiding this comment

revans2 Mar 23, 2021

Choose a reason for hiding this comment

razajafri Mar 23, 2021

Choose a reason for hiding this comment

razajafri Mar 23, 2021 • edited Loading

Choose a reason for hiding this comment

razajafri commented Mar 23, 2021

andygrove left a comment

Choose a reason for hiding this comment

razajafri commented Mar 23, 2021

revans2 Mar 23, 2021 • edited Loading

Choose a reason for hiding this comment

razajafri Mar 23, 2021

Choose a reason for hiding this comment

razajafri commented Mar 23, 2021

razajafri commented Mar 23, 2021

razajafri commented Mar 23, 2021

razajafri commented Mar 23, 2021 • edited Loading

razajafri Mar 23, 2021 •

edited

Loading

revans2 Mar 23, 2021 •

edited

Loading

razajafri commented Mar 23, 2021 •

edited

Loading