[BUG] NPE during serialization for shuffle in array-aggregation-with-limit query #5469
Labels
bug
Something isn't working
cudf_dependency
An issue or PR with this label depends on a new feature in cudf
Describe the bug
Similar to #5140 when the column partition has only null data we hit an NPE with a query like this:
throws
If there a non-null row then the result is correct:
Steps/Code to reproduce bug
Start pyspark 3.2.1 REPL to execute steps above with master
local[1]
.Conf:
Expected behavior
the failing query should return NULL for a NULL-valued array-typed row
[Row(array_max(a)=None)]
Environment details (please complete the following information)
Additional context
N/A
The text was updated successfully, but these errors were encountered: