Don't unnecessarily read string offsets when doing concatenate overflow checking. #8968

nvdbaranec · 2021-08-05T16:31:18Z

We were always reading string offsets (device->gpu memcpy) during the concatenation overflow checking which was unnecessary when dealing with an unsliced column, resulting in a performance degradation. This fixes that.

…n't read the offsets to compute size.

abellina

q4, q47, q57 are back to wall clock times seen in 21.06 with this change as is (with the last commit). Thanks @nvdbaranec!

nvdbaranec · 2021-08-05T23:40:19Z

rerun tests

robertmaynard · 2021-08-06T13:27:46Z

cpp/src/copying/concatenate.cu

-                    : cudf::detail::get_value<offset_type>(
-                        scv.offsets(), scv.offset() + b.size(), stream) -
-                        cudf::detail::get_value<offset_type>(scv.offsets(), scv.offset(), stream));
+      return a + (scv.is_empty() ? 0


Not urgent but this feels ripe for refactoring to a lambda given we have dual nested ternary statements.

something like

auto computed_length = [](...) { .... }; return a + (scv.is_empty() ? 0 : computed_length);

During overflow checking, if we are dealing with unsliced columns, do…

73ebadc

…n't read the offsets to compute size.

nvdbaranec added libcudf Affects libcudf (C++/CUDA) code. improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Aug 5, 2021

nvdbaranec requested a review from a team as a code owner August 5, 2021 16:31

nvdbaranec requested review from robertmaynard and codereport and removed request for a team August 5, 2021 16:31

jlowe linked an issue Aug 5, 2021 that may be closed by this pull request

[BUG] 50% performance regression in concatenate for string columns observed in 21.08 #8960

Closed

davidwendt approved these changes Aug 5, 2021

View reviewed changes

ttnghia approved these changes Aug 5, 2021

View reviewed changes

nvdbaranec added the 5 - DO NOT MERGE Hold off on merging; see PR for details label Aug 5, 2021

Flipped the logic on the is-unsliced check.

124272d

abellina approved these changes Aug 5, 2021

View reviewed changes

nvdbaranec removed the 5 - DO NOT MERGE Hold off on merging; see PR for details label Aug 5, 2021

robertmaynard reviewed Aug 6, 2021

View reviewed changes

raydouglass approved these changes Aug 6, 2021

View reviewed changes

raydouglass merged commit 29302e0 into rapidsai:branch-21.08 Aug 6, 2021

pxLi mentioned this pull request Aug 9, 2021

[BUG] Regression seen in concatenate in NDS with RAPIDS Shuffle Manager enabled NVIDIA/spark-rapids#3135

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't unnecessarily read string offsets when doing concatenate overflow checking. #8968

Don't unnecessarily read string offsets when doing concatenate overflow checking. #8968

nvdbaranec commented Aug 5, 2021 •

edited

Loading

abellina left a comment

nvdbaranec commented Aug 5, 2021

robertmaynard Aug 6, 2021

Don't unnecessarily read string offsets when doing concatenate overflow checking. #8968

Don't unnecessarily read string offsets when doing concatenate overflow checking. #8968

Conversation

nvdbaranec commented Aug 5, 2021 • edited Loading

abellina left a comment

Choose a reason for hiding this comment

nvdbaranec commented Aug 5, 2021

robertmaynard Aug 6, 2021

Choose a reason for hiding this comment

nvdbaranec commented Aug 5, 2021 •

edited

Loading