Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Parallelize gpuInitStringDescriptors when Parquet input type is FIXED_LEN_BYTE_ARRAY #14113

Closed
etseidl opened this issue Sep 14, 2023 · 0 comments · Fixed by #16109
Closed
Assignees
Labels
2 - In Progress Currently a work in progress cuIO cuIO issue feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue

Comments

@etseidl
Copy link
Contributor

etseidl commented Sep 14, 2023

As part of the preprocessing of PLAIN encoded string data in the parquet reader, a pass through the page data is performed to either gather string sizes, or initialize {ptr, length} tuples for use by the decoder. For variable width string data, this pass must be performed by a single thread. But in the case of fixed width data, all threads in the warp should be able to participate.

__device__ size_type gpuInitStringDescriptors(page_state_s volatile* s,

@etseidl etseidl added Needs Triage Need team to review and classify feature request New feature or request labels Sep 14, 2023
@GregoryKimball GregoryKimball added 0 - Backlog In queue waiting for assignment libcudf Affects libcudf (C++/CUDA) code. cuIO cuIO issue Performance Performance related issue and removed Needs Triage Need team to review and classify labels Sep 27, 2023
@GregoryKimball GregoryKimball removed this from libcudf Oct 26, 2023
@mhaseeb123 mhaseeb123 self-assigned this Jun 27, 2024
@mhaseeb123 mhaseeb123 added 2 - In Progress Currently a work in progress and removed 0 - Backlog In queue waiting for assignment labels Jun 28, 2024
@rapids-bot rapids-bot bot closed this as completed in 7cc01be Jul 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2 - In Progress Currently a work in progress cuIO cuIO issue feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants