[FEA] Improve performance of copy_if_else for long strings #15014

revans2 · 2024-02-09T15:52:22Z

Is your feature request related to a problem? Please describe.
We have a number of use cases where we do a copy_if_else on long strings. This can end up taking a long time, like in the case of from_json, when the input strings are large it ends up being the biggest kernel by total time. Larger than unsnap to decompress the original input string data. Larger that the tokenization kernels to tokenize the JSON data.

That first line is the string copy_if_else kernel that took 36.4% of the total kernel time.

When I look at the strings copy_if_else code I see a single thread per string and it ends up doing a memcpy.

cudf/cpp/include/cudf/strings/detail/copy_if_else.cuh

Line 107 in 6638b52

memcpy(d_chars + d_offsets[idx], d_str.data(), d_str.size_bytes());

I am not CUDA expert so I could be wrong about all of this, but I think we should be able to detect if the average string size is larger than a specific amount, and do a string per warp or something to help coalesce the memory read and write performance.

The text was updated successfully, but these errors were encountered:

Reworks the `cudf::strings::detail::copy_if_else()` to improve performance for long strings. The rework builds a vector of rows to pass to the `make_strings_column` factory that uses the optimized `gather_chars` function. Also includes a benchmark for copy_if_else specifically for strings columns. Closes #15014 Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Bradley Dice (https://github.com/bdice) - Vukasin Milovanovic (https://github.com/vuule) URL: #15017

revans2 added feature request New feature or request Performance Performance related issue Spark Functionality that helps Spark RAPIDS labels Feb 9, 2024

davidwendt self-assigned this Feb 9, 2024

davidwendt mentioned this issue Feb 9, 2024

Improve performance of copy_if_else for long strings #15017

Merged

3 tasks

rapids-bot bot closed this as completed in #15017 Feb 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Improve performance of copy_if_else for long strings #15014

[FEA] Improve performance of copy_if_else for long strings #15014

revans2 commented Feb 9, 2024

[FEA] Improve performance of copy_if_else for long strings #15014

[FEA] Improve performance of copy_if_else for long strings #15014

Comments

revans2 commented Feb 9, 2024