You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Max() on a column with NaN values , with or without a grouping key which has NaNs generates wrong results on the GPU. Steps/Code to reproduce bug
//GPU plugin on
val rdd = sc.parallelize(Seq(Float.NaN, 1.0, 2.0), 2)
val df = rdd.toDF("c0")
val res= df.agg(max("c0"))
res.collect
res.explain
This looks like an issue that we cannot work around without the help of cudf. Because float aggregations are off by default I think in the short term if we just update the documentation we can live with this. But we should file a cudf specific issue to try and address this properly.
Describe the bug
Max() on a column with NaN values , with or without a grouping key which has NaNs generates wrong results on the GPU.
Steps/Code to reproduce bug
Output:
Output:
Expected behavior
CPU and GPU answers should match.
Additional context
rapidsai/cudf#4753 is related on cudf side.
The text was updated successfully, but these errors were encountered: