Better float/double cases for casting tests #1781

sperlingxx · 2021-02-21T09:53:01Z

This pull request is to address issue #1764. And current PR also includes some code clean work .

Signed-off-by: sperlingxx <[email protected]>

sperlingxx · 2021-02-21T10:38:32Z

build

sperlingxx · 2021-02-21T10:42:45Z

build

revans2 · 2021-02-22T16:11:02Z

tests/src/test/scala/com/nvidia/spark/rapids/CastOpSuite.scala

@@ -425,6 +383,14 @@ class CastOpSuite extends GpuExpressionTestSuite {
      col("c1").cast(FloatType))
  }

+  ignore("overflow double strings") {
+    testSparkResultsAreEqual("overflow double strings", badDoubleStringsDf,


badDoubleStringsDf is not as comprehensive as I would like it to be. The values in it are big enough to force an overflow, but there are still a number of corner cases where according to rapidsai/cudf#5225 are not being covered. The patch to fix it rapidsai/cudf#7410 is not in and not working 100% either in these same overflow cases. I am happy to have these go in as it, but I would like to see a follow on issue to cover the corner cases that are missing, and have a comment in badDoubleStringsDf pointing to it. Would be good to look at the float one too and make sure it is also not suffering from the same issue.

Hi @revans2, I tried with many other similar (slightly overflow) cases. And I found GPU can produce correct results for almost all of them, except some special cases included in badDoubleStringsDf.
For float type, I failed to find any mismatching case.
For double type, I only found numbers within range [1.79769313486231581E308, 1.797693134862316E308) produce inconsistent results with spark.

sperlingxx · 2021-02-23T08:55:36Z

build

sperlingxx · 2021-02-23T10:38:24Z

build

revans2 · 2021-02-23T14:25:17Z

@razajafri you filed the original bug about casting from string to double. Could you take a look at this PR too?

razajafri · 2021-02-23T17:16:05Z

tests/src/test/scala/com/nvidia/spark/rapids/SparkQueryCompareTestSuite.scala

@@ -1131,14 +1131,16 @@ trait SparkQueryCompareTestSuite extends FunSuite with Arm {
      "9.8e5").toDF("c0")
  }

-  def badFloatStringsDf(session: SparkSession): DataFrame = {
+  def invalidFloatStringsDf(session: SparkSession): DataFrame = {
    import session.sqlContext.implicits._
    Seq(("A", "null"), ("1.3", "43.54")).toDF("c0", "c1")
  }

  def badDoubleStringsDf(session: SparkSession): DataFrame = {


nit: for consistency may be this should be renamed to invalidDoubleStringsDf

razajafri · 2021-02-23T17:16:51Z

@razajafri you filed the original bug about casting from string to double. Could you take a look at this PR too?

Just one nit otherwise LGTM

* enhance float/double cases for casting tests Signed-off-by: sperlingxx <[email protected]> * continue * code clean * code clean * fix typo * fix typo * some updates * fix typo

sperlingxx added 4 commits February 21, 2021 17:41

enhance float/double cases for casting tests

ea3766a

Signed-off-by: sperlingxx <[email protected]>

continue

5c11d5a

code clean

6d308ef

code clean

ff2fa8d

sperlingxx requested a review from revans2 February 21, 2021 10:38

sperlingxx added 2 commits February 21, 2021 18:40

fix typo

e33e0c4

fix typo

1c569cc

sameerz added the test Only impacts tests label Feb 22, 2021

revans2 reviewed Feb 22, 2021

View reviewed changes

some updates

15c2fa5

fix typo

d612e86

revans2 approved these changes Feb 23, 2021

View reviewed changes

razajafri reviewed Feb 23, 2021

View reviewed changes

razajafri merged commit 52d95b1 into NVIDIA:branch-0.4 Feb 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better float/double cases for casting tests #1781

Better float/double cases for casting tests #1781

sperlingxx commented Feb 21, 2021 •

edited

Loading

sperlingxx commented Feb 21, 2021

sperlingxx commented Feb 21, 2021

revans2 Feb 22, 2021

sperlingxx Feb 23, 2021

sperlingxx commented Feb 23, 2021

sperlingxx commented Feb 23, 2021

revans2 commented Feb 23, 2021

razajafri Feb 23, 2021

razajafri commented Feb 23, 2021

Better float/double cases for casting tests #1781

Better float/double cases for casting tests #1781

Conversation

sperlingxx commented Feb 21, 2021 • edited Loading

sperlingxx commented Feb 21, 2021

sperlingxx commented Feb 21, 2021

revans2 Feb 22, 2021

Choose a reason for hiding this comment

sperlingxx Feb 23, 2021

Choose a reason for hiding this comment

sperlingxx commented Feb 23, 2021

sperlingxx commented Feb 23, 2021

revans2 commented Feb 23, 2021

razajafri Feb 23, 2021

Choose a reason for hiding this comment

razajafri commented Feb 23, 2021

sperlingxx commented Feb 21, 2021 •

edited

Loading