-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GH-43382: [C++][Parquet] min-max Statistics doesn't work well when one of min-max is truncated #43383
Conversation
|
This is just a try, I can also extract has-min and has-max |
I'm fine to be conservative. So this change looks good. However, the encoded_min and encoded_max are still set. I'd propose to deprecate |
Updated: I'll separate the implemetation into two separate patch:
Anyway this is a bug rather than feature so we should fix it first |
@kou @emkornfield @wgtmac would you mind take a look? This patch only change |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
Could you uncomment <!-- **This PR includes breaking changes to public APIs.** -->
in the PR description?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
After merging your PR, Conbench analyzed the 4 benchmarking runs that have been run so far on merge-commit 66cb749. There were no benchmark performance regressions. 🎉 The full Conbench report has more details. It also includes information about 3 possible false positives for unstable benchmarks that are known to sometimes produce them. |
Rationale for this change
See #43382
What changes are included in this PR?
Change stats has min-max from min || max to &&
Are these changes tested?
Are there any user-facing changes?
Might affect interface using HasMinMax
This PR includes breaking changes to public APIs.