You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SELECT
t.partner_id,
t.qualifi_name as name,
t.qualifi_type as type_id,
row_number() over(partition by partner_id,qualifi_type order by modify_time,create_time,end_date desc) row_id
FROM
tbl
WHERE partition_date = '2024-08-02'
and status = 1
AND qualifi_type in (201,202,1);
It seems that row_number() over(...) produced different result for the same order of rows (the columns used to sort the two rows are equal).
Here is gluten physical plan:
gluten version : 1.2-rc
Spark version
3.0
The text was updated successfully, but these errors were encountered:
Looks velox sort doesn't respect the input order when some rows have same values for sort keys. Agreed with @kecookier, maybe we can ignore this issue.
This inconsistent behavior seems acceptable, and vanilla Spark sort is not deterministic either. Therefore, I will close this issue. Thanks for your help. @kecookier@PHILO-HE
Backend
VL (Velox)
Bug description
SQL:
gluten results that mismatch with vanilla:
vanilla results:
the original rows of mismatch results
It seems that
row_number() over(...)
produced different result for the same order of rows (the columns used to sort the two rows are equal).Here is gluten physical plan:
gluten version : 1.2-rc
Spark version
3.0
The text was updated successfully, but these errors were encountered: