-
Notifications
You must be signed in to change notification settings - Fork 453
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GLUTEN-4652][VL] Fix min_by/max_by result mismatch when RDD partition num > 1 #5711
Conversation
…D partition num > 1
I think I used to test the |
@yma11 Did you reproduced the problem I mentioned above based on the master gluten branch? |
@yma11 I guess you didn't reproduce the problem because there was only 1 parquet file under test table dir. |
I mean which part of your code changes in this PR is the fix? Seems only some code refactor? |
Before this PR, In this PR, I take function name as a member of |
That's interesting. Actually, I verified the previous code using real workload but with single parquet file so may not trigger your scenario. Thanks for your fix! |
@yma11 CI |
===== Performance report for TPCDS SF2000 with Velox backend, for reference only ====
|
===== Performance report for TPCH SF2000 with Velox backend, for reference only ====
|
What changes were proposed in this pull request?
A follow-up of PR #5544 to fix min_by/max_by result mismatch when RDD partition num > 1.
Run SQL:
Current result is
Expected result is
How was this patch tested?
Modified test case
VeloxAggregateFunctionsSuite - min_by/max_by