[BUG] HALF_UP and HALF_EVEN rounding can produce the wrong result for decimal128 #14210

revans2 · 2023-09-27T17:03:25Z

Describe the bug
I did some extra testing in Spark for half even rounding (bround) and half up rounding (round) in spark. Specifically for decimal values. Decimal128 values showed a number of differences vs the CPU at various levels.

+-----+---------------------------------------+---+---+----+-----+------+---+---+----+-----+------+-----+----------+
|id   |div_num                                |b0 |b1 |b2  |b3   |b4    |r0 |r1 |r2  |r3   |r4    |count|_in_column|
+-----+---------------------------------------+---+---+----+-----+------+---+---+----+-----+------+-----+----------+
|2    |0.5000000000000000000000000000000000000|0  |0.5|0.50|0.500|0.5000|1  |0.5|0.50|0.500|0.5000|1    |CPU       |
|2    |0.5000000000000000000000000000000000000|1  |0.5|0.50|0.500|0.5000|1  |0.5|0.50|0.500|0.5000|1    |GPU       |
|4    |0.2500000000000000000000000000000000000|0  |0.2|0.25|0.250|0.2500|0  |0.3|0.25|0.250|0.2500|1    |CPU       |
|4    |0.2500000000000000000000000000000000000|0  |0.2|0.25|0.250|0.2500|0  |0.2|0.25|0.250|0.2500|1    |GPU       |
|8    |0.1250000000000000000000000000000000000|0  |0.1|0.12|0.125|0.1250|0  |0.1|0.13|0.125|0.1250|1    |CPU       |
|8    |0.1250000000000000000000000000000000000|0  |0.1|0.13|0.125|0.1250|0  |0.1|0.13|0.125|0.1250|1    |GPU       |
|16   |0.0625000000000000000000000000000000000|0  |0.1|0.06|0.062|0.0625|0  |0.1|0.06|0.063|0.0625|1    |CPU       |
|16   |0.0625000000000000000000000000000000000|0  |0.1|0.06|0.063|0.0625|0  |0.1|0.06|0.063|0.0625|1    |GPU       |
|20   |0.0500000000000000000000000000000000000|0  |0.0|0.05|0.050|0.0500|0  |0.1|0.05|0.050|0.0500|1    |CPU       |
|20   |0.0500000000000000000000000000000000000|0  |0.0|0.05|0.050|0.0500|0  |0.0|0.05|0.050|0.0500|1    |GPU       |
|32   |0.0312500000000000000000000000000000000|0  |0.0|0.03|0.031|0.0312|0  |0.0|0.03|0.031|0.0313|1    |CPU       |
|32   |0.0312500000000000000000000000000000000|0  |0.0|0.03|0.031|0.0313|0  |0.0|0.03|0.031|0.0313|1    |GPU       |
|40   |0.0250000000000000000000000000000000000|0  |0.0|0.02|0.025|0.0250|0  |0.0|0.03|0.025|0.0250|1    |CPU       |
|40   |0.0250000000000000000000000000000000000|0  |0.0|0.03|0.025|0.0250|0  |0.0|0.03|0.025|0.0250|1    |GPU       |
|80   |0.0125000000000000000000000000000000000|0  |0.0|0.01|0.012|0.0125|0  |0.0|0.01|0.013|0.0125|1    |CPU       |
|80   |0.0125000000000000000000000000000000000|0  |0.0|0.01|0.013|0.0125|0  |0.0|0.01|0.013|0.0125|1    |GPU       |
|160  |0.0062500000000000000000000000000000000|0  |0.0|0.01|0.006|0.0062|0  |0.0|0.01|0.006|0.0063|1    |CPU       |
|160  |0.0062500000000000000000000000000000000|0  |0.0|0.01|0.006|0.0063|0  |0.0|0.01|0.006|0.0063|1    |GPU       |
|200  |0.0050000000000000000000000000000000000|0  |0.0|0.00|0.005|0.0050|0  |0.0|0.01|0.005|0.0050|1    |CPU       |
|200  |0.0050000000000000000000000000000000000|0  |0.0|0.01|0.005|0.0050|0  |0.0|0.01|0.005|0.0050|1    |GPU       |
|400  |0.0025000000000000000000000000000000000|0  |0.0|0.00|0.002|0.0025|0  |0.0|0.00|0.003|0.0025|1    |CPU       |
|400  |0.0025000000000000000000000000000000000|0  |0.0|0.00|0.003|0.0025|0  |0.0|0.00|0.003|0.0025|1    |GPU       |
|800  |0.0012500000000000000000000000000000000|0  |0.0|0.00|0.001|0.0012|0  |0.0|0.00|0.001|0.0013|1    |CPU       |
|800  |0.0012500000000000000000000000000000000|0  |0.0|0.00|0.001|0.0013|0  |0.0|0.00|0.001|0.0013|1    |GPU       |
|2000 |0.0005000000000000000000000000000000000|0  |0.0|0.00|0.000|0.0005|0  |0.0|0.00|0.001|0.0005|1    |CPU       |
|2000 |0.0005000000000000000000000000000000000|0  |0.0|0.00|0.001|0.0005|0  |0.0|0.00|0.001|0.0005|1    |GPU       |
|4000 |0.0002500000000000000000000000000000000|0  |0.0|0.00|0.000|0.0002|0  |0.0|0.00|0.000|0.0003|1    |CPU       |
|4000 |0.0002500000000000000000000000000000000|0  |0.0|0.00|0.000|0.0003|0  |0.0|0.00|0.000|0.0003|1    |GPU       |
|20000|0.0000500000000000000000000000000000000|0  |0.0|0.00|0.000|0.0000|0  |0.0|0.00|0.000|0.0001|1    |CPU       |
|20000|0.0000500000000000000000000000000000000|0  |0.0|0.00|0.000|0.0001|0  |0.0|0.00|0.000|0.0001|1    |GPU       |

In this we are rounding 1/id where id is a decimal with a scale of -37. The b columns are for half up and the r columns are the result for half even rounding.

Steps/Code to reproduce bug
I wrote some unit tests

diff --git a/cpp/tests/round/round_tests.cpp b/cpp/tests/round/round_tests.cpp
index d802c0c270..ece7abc45d 100644
--- a/cpp/tests/round/round_tests.cpp
+++ b/cpp/tests/round/round_tests.cpp
@@ -703,4 +703,84 @@ TEST_F(RoundTests, BoolTestHalfUp)
   EXPECT_THROW(cudf::round(input, -2, cudf::rounding_method::HALF_UP), cudf::logic_error);
 }
 
+// Use __uint128_t for demonstration.
+constexpr __uint128_t operator""_uint128_t(const char* s)
+{
+  __uint128_t ret = 0;
+  for (int i = 0; s[i] != '\0'; ++i)
+  {
+    ret *= 10;
+    if ('0' <= s[i] && s[i] <= '9') {
+      ret += s[i] - '0';
+    }
+  }
+  return ret;
+}
+
+TEST_F(RoundTests, HalfEvenErrorsA)
+{
+  using namespace numeric;
+  using RepType    = cudf::device_storage_type_t<decimal128>;
+  using fp_wrapper = cudf::test::fixed_point_column_wrapper<RepType>;
+
+  {
+    // 0.5 at scale -37 should round HALF_EVEN to 0, because 0 is an even number
+    auto const input    = fp_wrapper{{5000000000000000000000000000000000000_uint128_t}, scale_type{-37}};
+    auto const expected = fp_wrapper{{0}, scale_type{0}};
+    auto const result   = cudf::round(input, 0, cudf::rounding_method::HALF_EVEN);
+
+    CUDF_TEST_EXPECT_COLUMNS_EQUAL(expected, result->view());
+  }
+}
+
+TEST_F(RoundTests, HalfEvenErrorsB)
+{
+  using namespace numeric;
+  using RepType    = cudf::device_storage_type_t<decimal128>;
+  using fp_wrapper = cudf::test::fixed_point_column_wrapper<RepType>;
+
+  {
+    // 0.125 at scale -37 should round HALF_EVEN to 0.12, because 2 is an even number
+    auto const input    = fp_wrapper{{1250000000000000000000000000000000000_uint128_t}, scale_type{-37}};
+    auto const expected = fp_wrapper{{12}, scale_type{-2}};
+    auto const result   = cudf::round(input, 2, cudf::rounding_method::HALF_EVEN);
+
+    CUDF_TEST_EXPECT_COLUMNS_EQUAL(expected, result->view());
+  }
+}
+
+TEST_F(RoundTests, HalfEvenErrorsC)
+{
+  using namespace numeric;
+  using RepType    = cudf::device_storage_type_t<decimal128>;
+  using fp_wrapper = cudf::test::fixed_point_column_wrapper<RepType>;
+
+  {
+    // 0.0625 at scale -37 should round HALF_EVEN to 0.062, because 2 is an even number
+    auto const input    = fp_wrapper{{0625000000000000000000000000000000000_uint128_t}, scale_type{-37}};
+    auto const expected = fp_wrapper{{62}, scale_type{-3}};
+    auto const result   = cudf::round(input, 3, cudf::rounding_method::HALF_EVEN);
+
+    CUDF_TEST_EXPECT_COLUMNS_EQUAL(expected, result->view());
+  }
+}
+
+TEST_F(RoundTests, HalfUpErrorsA)
+{
+  using namespace numeric;
+  using RepType    = cudf::device_storage_type_t<decimal128>;
+  using fp_wrapper = cudf::test::fixed_point_column_wrapper<RepType>;
+
+  {
+    // 0.25 at scale -37 should round HALF_UP to 0.3
+    auto const input    = fp_wrapper{{2500000000000000000000000000000000000_uint128_t}, scale_type{-37}};
+    auto const expected = fp_wrapper{{3}, scale_type{-1}};
+    auto const result   = cudf::round(input, 1, cudf::rounding_method::HALF_UP);
+
+    CUDF_TEST_EXPECT_COLUMNS_EQUAL(expected, result->view());
+  }
+}
+
+
+
 CUDF_TEST_PROGRAM_MAIN()

But they don't cover the error cases from the previous test I did in Spark.

Expected behavior
We get the correct answer when rounding decimal128 numbers.

The text was updated successfully, but these errors were encountered:

GregoryKimball · 2023-09-27T18:41:37Z

Thanks @revans2 for raising this. Is it fair to say that cudf::rounding_method::HALF_EVEN is not working correctly for some decimal values? Is this only an issue with scale_type{-37}?

But they don't cover the error cases from the previous test I did in Spark.

Do you think the unit tests you've proposed cover the root cause of all the error cases above?

revans2 · 2023-09-27T18:53:49Z

@GregoryKimball I am running tests right now to get a better idea. I have not exhaustively tested all combinations of decimal scale. I have seen it happen with scale -37 and scale -36. I will let you know when my tests are done if there are more that fail.

revans2 · 2023-09-28T13:17:27Z

I ran through a fairly large exhaustive set of tests for various scales at different rounding indexes. For HALF_UP the largest scale factor that I saw got a wrong answer was scale -21 rounding to 0.

|id |div_num                  |rounded|count|_in_column|
+---+-------------------------+-------+-----+----------+
|2  |0.50000000000000000000000|1      |1    |CPU       |
|2  |0.50000000000000000000000|0      |1    |GPU       |
+---+-------------------------+-------+-----+----------+

For HALF_EVEN it was -22 rounding to 0

|id |div_num                   |rounded|count|_in_column|
+---+--------------------------+-------+-----+----------+
|2  |0.500000000000000000000000|0      |1    |CPU       |
|2  |0.500000000000000000000000|1      |1    |GPU       |
+---+--------------------------+-------+-----+----------+

But I have a list of 743 values at different scales and roundings that produces the incorrect answers. I suspect it might have something to either do with overflow, or with the fact that a double is being used to calculate powers of 10.

cudf/cpp/src/round/round.cu

Line 226 in 53f0f74

T const n = std::pow(10, std::abs(decimal_places));

cudf/cpp/src/round/round.cu

Line 274 in 53f0f74

Type const n = std::pow(10, scale_movement);

I had to write some custom code to do 256-bit decimal math, because Spark/Java will go over 128-bits for some operations and I ran into issues with pow being inaccurate, so I wrote a LUT, which was simple enough to cover all possible versions could run into. Not that you have to do the same. Just a point of reference.

https://github.com/NVIDIA/spark-rapids-jni/blob/f32fe74027b25fdb64c997ca7515490a8c210072/src/main/cpp/src/decimal_utils.cu#L247-L510

I'll try to clean up the log file I have with the 743 failures for reference

revans2 · 2023-09-28T13:44:38Z

out.tsv.gz has a tab separated values file for the failed test cases.

bdice · 2023-09-28T19:53:53Z

I looked briefly at the rounding functors themselves but nothing jumped out at me. Based on the variation with precision (it seems that larger scale values work but -21 to -37 fail), my working hypothesis is that this is due to some stage of the process that isn't in "integral" math. There is a reference here to std::pow which could introduce a floating-point component to the math. I am going to work on adding some test cases that fail, and then see if using an integral power operation improves the situation.

cudf/cpp/src/round/round.cu

Line 274 in dea0df0

Type const n = std::pow(10, scale_movement);

revans2 added bug Something isn't working Needs Triage Need team to review and classify Spark Functionality that helps Spark RAPIDS labels Sep 27, 2023

revans2 mentioned this issue Sep 27, 2023

[BUG] bround and round do not return the correct result for some decimal values. NVIDIA/spark-rapids#9309

Closed

GregoryKimball added 1 - On Deck To be worked on next libcudf Affects libcudf (C++/CUDA) code. and removed Needs Triage Need team to review and classify labels Sep 27, 2023

GregoryKimball added this to the Decimal data type and operations milestone Sep 27, 2023

GregoryKimball added this to libcudf Sep 27, 2023

GregoryKimball moved this to Needs owner in libcudf Sep 27, 2023

revans2 mentioned this issue Sep 28, 2023

Support format_number NVIDIA/spark-rapids#9281

Merged

This was referenced Sep 28, 2023

Fix inaccuracy in decimal128 rounding. #14233

Merged

Consolidate and optimize integer power implementations in libcudf #14243

Open

GPUtester closed this as completed in 66a655c Oct 3, 2023

GregoryKimball removed the status in libcudf Oct 11, 2023

GregoryKimball removed this from libcudf Oct 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] HALF_UP and HALF_EVEN rounding can produce the wrong result for decimal128 #14210

[BUG] HALF_UP and HALF_EVEN rounding can produce the wrong result for decimal128 #14210

revans2 commented Sep 27, 2023

GregoryKimball commented Sep 27, 2023 •

edited

Loading

revans2 commented Sep 27, 2023

revans2 commented Sep 28, 2023

revans2 commented Sep 28, 2023

bdice commented Sep 28, 2023 •

edited

Loading

[BUG] HALF_UP and HALF_EVEN rounding can produce the wrong result for decimal128 #14210

[BUG] HALF_UP and HALF_EVEN rounding can produce the wrong result for decimal128 #14210

Comments

revans2 commented Sep 27, 2023

GregoryKimball commented Sep 27, 2023 • edited Loading

revans2 commented Sep 27, 2023

revans2 commented Sep 28, 2023

revans2 commented Sep 28, 2023

bdice commented Sep 28, 2023 • edited Loading

GregoryKimball commented Sep 27, 2023 •

edited

Loading

bdice commented Sep 28, 2023 •

edited

Loading