[BUG] Handle Decimal128 computation with overflow of Remainder on Spark 3.4 #8330

NVnavkumar · 2023-05-19T17:43:48Z

Describe the bug
When computing the Remainder for Decimal types, the existing algorithm can only handle what previous versions of Spark could handle as far as overflow of the Decimal128 operands. This would mean that if an upcast cannot be performed because of limitations of precision, an exception would be thrown (ANSI mode) or the result would be null. Spark 3.4 actually computes this remainder now because it can use the BigDecimal type from JAVA and round from there.

Steps/Code to reproduce bug
Try to compute the remainder between 2 large decimal 128 values:

PySpark example:

data = [[Decimal('5776949384953805890688943467625198736'), Decimal('-67337920196996830.354487679299')]]
schema = StructType([
    StructField("a", DecimalType(38,0), True),
    StructField("b", DecimalType(27,7), True)])
df = spark.createDataFrame(data=data, schema=schema)

out = df.selectExpr("a", "b", r"a % b").collect()

Expected behavior
When running in Spark 3.4, this should return the correct value for a % b (in other versions of Spark, it returns null)

The text was updated successfully, but these errors were encountered:

NVnavkumar added bug Something isn't working ? - Needs Triage Need team to review and classify labels May 19, 2023

NVnavkumar mentioned this issue May 19, 2023

Add support for DecimalType in Remainder for Spark 3.4 and DB 11.3 [databricks] #8302

Merged

NVnavkumar added the Spark 3.4+ Spark 3.4+ issues label May 22, 2023

NVnavkumar self-assigned this May 22, 2023

mattahrens removed the ? - Needs Triage Need team to review and classify label May 23, 2023

This was referenced May 26, 2023

Implement Remainder for Decimal128 that handles when operands overflow NVIDIA/spark-rapids-jni#1175

Merged

Add support for computing remainder with Decimal128 operands with more precision on Spark 3.4 [databricks] #8414

Merged

NVnavkumar closed this as completed in #8414 Jun 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Handle Decimal128 computation with overflow of Remainder on Spark 3.4 #8330

[BUG] Handle Decimal128 computation with overflow of Remainder on Spark 3.4 #8330

NVnavkumar commented May 19, 2023

[BUG] Handle Decimal128 computation with overflow of Remainder on Spark 3.4 #8330

[BUG] Handle Decimal128 computation with overflow of Remainder on Spark 3.4 #8330

Comments

NVnavkumar commented May 19, 2023