Rework logic in cudf::strings::split_record to improve performance #12729

davidwendt · 2023-02-08T00:43:53Z

Description

Updates the cudf::strings::split_record logic to match the more optimized code in cudf::strings:split.
The optimized code performs much better for longer strings (>64 bytes) by parallelizing over the character bytes to find delimiters before determining split tokens.
This led to refactoring the code so it both APIs can share the optimized code.
Also fixes a bug found when using overlapped delimiters.
Additional tests were added for multi-byte delimiters which can overlap and span multiple adjacent strings.

Closes #12694

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

codecov · 2023-02-08T02:06:14Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-23.04@ec8704a). Click here to learn what that means.
Patch has no changes to coverable lines.

❗ Current head d7dcb2a differs from pull request most recent head 524e038. Consider uploading reports for the commit 524e038 to get more accurate results

Additional details and impacted files

@@               Coverage Diff               @@
##             branch-23.04   #12729   +/-   ##
===============================================
  Coverage                ?   85.85%           
===============================================
  Files                   ?      158           
  Lines                   ?    25204           
  Branches                ?        0           
===============================================
  Hits                    ?    21638           
  Misses                  ?     3566           
  Partials                ?        0

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

…erf-split-record

davidwendt · 2023-02-13T21:27:58Z

Performance numbers for cudf::strings::split_record

/record/4096/32          0.186         0.262      0.71x
/record/4096/64          0.254         0.270      0.94x
/record/4096/128         0.408         0.281      1.45x
/record/4096/256         0.845         0.307      2.75x
/record/4096/512          2.25         0.280      8.04x
/record/4096/1024         7.89         0.307     25.70x
/record/4096/2048         29.3         0.376     77.93x
/record/4096/4096          115         0.498    230.92x
/record/4096/8192          464         0.752    617.02x
/record/32768/32         0.204         0.274      0.74x
/record/32768/64         0.279         0.297      0.94x
/record/32768/128        0.506         0.349      1.45x
/record/32768/256         1.05         0.444      2.36x
/record/32768/512         2.71         0.413      6.56x
/record/32768/1024        8.36         0.561     14.90x
/record/32768/2048        36.5         0.861     42.39x
/record/32768/4096         164          1.46    112.33x
/record/32768/8192         706          2.70    261.48x
/record/262144/32        0.328         0.419      0.78x
/record/262144/64        0.576         0.581      0.99x
/record/262144/128        1.47          1.16      1.27x
/record/262144/256        5.65          3.62      1.56x
/record/262144/512        12.3          1.51      8.15x
/record/262144/1024       48.3          2.71     17.82x
/record/262144/2048        216          5.59     38.64x
/record/262144/4096       1066          11.8     90.34x
/record/2097152/32        1.24          1.47      0.84x
/record/2097152/64        2.85          2.73      1.04x
/record/2097152/128       8.27          6.51      1.27x
/record/2097152/256       31.9          23.1      1.38x
/record/2097152/512       55.6          10.5      5.30x
/record/16777216/32       8.79          10.1      0.87x
/record/16777216/64       21.3          20.2      1.05x

cpp/src/strings/split/split_record.cu

cpp/tests/strings/split_tests.cpp

cpp/src/strings/split/split_record.cu

cpp/src/strings/split/split.cuh

…erf-split-record

PointKernel

LGTM with some nits (not necessarily a change request though)

cpp/benchmarks/string/split.cpp

cpp/src/strings/split/split.cu

cpp/src/strings/split/split.cuh

davidwendt · 2023-02-21T12:48:26Z

/merge

Rework logic in cudf::strings::split_record to improve performance

a8f4509

davidwendt added bug Something isn't working 2 - In Progress Currently a work in progress libcudf Affects libcudf (C++/CUDA) code. strings strings issues (C++ and Python) non-breaking Non-breaking change labels Feb 8, 2023

davidwendt self-assigned this Feb 8, 2023

Merge branch 'branch-23.04' into perf-split-record

596eb86

davidwendt added 20 commits February 8, 2023 10:42

Merge branch 'branch-23.04' into perf-split-record

0d8c67a

improve gpu utilization

436cd58

Merge branch 'branch-23.04' into perf-split-record

d7dcb2a

Merge branch 'branch-23.04' into perf-split-record

f5d1bb0

Merge branch 'perf-split-record' of github.com:davidwendt/cudf into p…

8132d44

…erf-split-record

Merge branch 'branch-23.04' into perf-split-record

cae6bb0

use CRTP for split/rsplit functors

498f697

Merge branch 'branch-23.04' into perf-split-record

7a55f0f

Merge branch 'branch-23.04' into perf-split-record

d37c0e5

Merge branch 'perf-split-record' of github.com:davidwendt/cudf into p…

e49eb02

…erf-split-record

refactor common code to split.cuh

5f61d59

Merge branch 'branch-23.04' into perf-split-record

6315e17

fix style check

a68cb97

fix overlapping delimiter logic in forward split

b7c89bb

Merge branch 'branch-23.04' into perf-split-record

9637d13

Merge branch 'branch-23.04' into perf-split-record

3476581

Merge branch 'perf-split-record' of github.com:davidwendt/cudf into p…

309c23d

…erf-split-record

fix overlapped delimiters logic

ce1dae1

null rows should not create tokens

cd041a5

Merge branch 'branch-23.04' into perf-split-record

9bfb9aa

davidwendt removed the 2 - In Progress Currently a work in progress label Feb 13, 2023

davidwendt added the 3 - Ready for Review Ready for review by team label Feb 13, 2023

davidwendt marked this pull request as ready for review February 13, 2023 21:31

davidwendt requested a review from a team as a code owner February 13, 2023 21:31

davidwendt requested review from ttnghia and nvdbaranec February 13, 2023 21:31

ttnghia reviewed Feb 13, 2023

View reviewed changes

cpp/src/strings/split/split_record.cu Outdated Show resolved Hide resolved

ttnghia reviewed Feb 13, 2023

View reviewed changes

cpp/tests/strings/split_tests.cpp Outdated Show resolved Hide resolved

add multi-byte delim gtests for split/rsplit

cec08f2

ttnghia mentioned this pull request Feb 14, 2023

[BUG] cudf::strings::split_record can be over 15x slower than a single thread on the CPU for some cases #12694

Closed

ttnghia reviewed Feb 14, 2023

View reviewed changes

cpp/src/strings/split/split_record.cu Show resolved Hide resolved

add CUDF_CUDA_TRY to cudaMemSetAsync call

f98fb29

ttnghia approved these changes Feb 15, 2023

View reviewed changes

davidwendt and others added 2 commits February 15, 2023 17:24

Merge branch 'branch-23.04' into perf-split-record

df4fa9e

Merge branch 'branch-23.04' into perf-split-record

06222ca

nvdbaranec reviewed Feb 16, 2023

View reviewed changes

cpp/src/strings/split/split.cuh Show resolved Hide resolved

nvdbaranec requested changes Feb 16, 2023

View reviewed changes

cpp/src/strings/split/split.cuh Outdated Show resolved Hide resolved

cpp/src/strings/split/split.cuh Show resolved Hide resolved

cpp/src/strings/split/split.cuh Outdated Show resolved Hide resolved

davidwendt added 3 commits February 16, 2023 15:47

Merge branch 'perf-split-record' of github.com:davidwendt/cudf into p…

23ec34a

…erf-split-record

Merge branch 'branch-23.04' into perf-split-record

795841e

fix comments per review

c73f2dd

PointKernel approved these changes Feb 16, 2023

View reviewed changes

cpp/benchmarks/string/split.cpp Outdated Show resolved Hide resolved

cpp/src/strings/split/split.cu Outdated Show resolved Hide resolved

cpp/src/strings/split/split.cuh Outdated Show resolved Hide resolved

cpp/src/strings/split/split.cuh Outdated Show resolved Hide resolved

make some const vars constexpr

efa0490

davidwendt requested a review from nvdbaranec February 16, 2023 21:53

nvdbaranec approved these changes Feb 16, 2023

View reviewed changes

Merge branch 'branch-23.04' into perf-split-record

524e038

rapids-bot bot merged commit 7da233b into rapidsai:branch-23.04 Feb 21, 2023

davidwendt deleted the perf-split-record branch February 21, 2023 16:36

GregoryKimball mentioned this pull request Apr 3, 2023

[FEA] Story - Improve performance with long strings #13048

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework logic in cudf::strings::split_record to improve performance #12729

Rework logic in cudf::strings::split_record to improve performance #12729

davidwendt commented Feb 8, 2023 •

edited

Loading

codecov bot commented Feb 8, 2023 •

edited

Loading

davidwendt commented Feb 13, 2023

PointKernel left a comment

davidwendt commented Feb 21, 2023

Rework logic in cudf::strings::split_record to improve performance #12729

Rework logic in cudf::strings::split_record to improve performance #12729

Conversation

davidwendt commented Feb 8, 2023 • edited Loading

Description

Checklist

codecov bot commented Feb 8, 2023 • edited Loading

Codecov Report

davidwendt commented Feb 13, 2023

PointKernel left a comment

Choose a reason for hiding this comment

davidwendt commented Feb 21, 2023

davidwendt commented Feb 8, 2023 •

edited

Loading

codecov bot commented Feb 8, 2023 •

edited

Loading