-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Address poor performance of Parquet string decoding #15304
Conversation
/ok to test |
Thank you @etseidl for looking into this performance case! |
@etseidl would you please let me know if there are more changes you would like to make - or is this ready for review? |
@GregoryKimball If a quick fix is wanted for 24.04, then I think this is ready. It will take longer to evaluate more complicated solutions. My first attempt at going block wide didn't pan out :( |
/ok to test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
❤️
/merge |
Description
See #15297. The Parquet string decoder can become a bottleneck in the presence of strings of widely varying sizes. This PR is an attempt to address this, at least as a stop gap solution. A more complete solution may be to rework the string decoder to work in a block-wide fashion, such as the new micro-kernels added in #15159.
Checklist