[FEA] Parallelize gpuInitStringDescriptors when Parquet input type is FIXED_LEN_BYTE_ARRAY #14113
Labels
2 - In Progress
Currently a work in progress
cuIO
cuIO issue
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Performance
Performance related issue
Milestone
As part of the preprocessing of PLAIN encoded string data in the parquet reader, a pass through the page data is performed to either gather string sizes, or initialize
{ptr, length}
tuples for use by the decoder. For variable width string data, this pass must be performed by a single thread. But in the case of fixed width data, all threads in the warp should be able to participate.cudf/cpp/src/io/parquet/page_decode.cuh
Line 409 in 1bfeee7
The text was updated successfully, but these errors were encountered: