Compile-time ipow computation with array lookup #15110

pmattione-nvidia · 2024-02-21T19:06:07Z

Description

Compile-time ipow() computation with array lookup. Results in up to 8% speed improvement for decimal64 -> double conversions. Improvement is negligible for other conversions but is not worse. New benchmark test will be in a separate PR. Fix fixed_point -> string conversion test. Also fix rounding comments. Closes #9346

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

…mments.

copy-pr-bot · 2024-02-21T19:06:12Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

davidwendt · 2024-02-21T19:19:48Z

/ok to test

davidwendt · 2024-02-21T22:21:44Z

/ok to test

cpp/include/cudf/fixed_point/fixed_point.hpp

…into issue_9346 Update local branch with changes from remote

davidwendt · 2024-02-21T23:04:58Z

/ok to test

PointKernel

LGTM

I've updated the PR description a bit so the corresponding issue can be closed once this PR gets merged. Thanks!

cpp/include/cudf/fixed_point/fixed_point.hpp

@note

Use doxygen @note Co-authored-by: Yunsong Wang <[email protected]>

davidwendt · 2024-02-22T23:42:12Z

/ok to test

pmattione-nvidia · 2024-02-28T20:12:03Z

Note that it's unclear (to me) what the optimal algorithm is for get_power(). It was previously the (logarithmic) squaring algorithm instead of this recursive one. However since we have to compute every power to fill the array, the compiler may be smart enough to optimize that, or benefit from caching effects. Either way this call is only performed at compile time (good candidate for consteval in C++20), and the compile time is dominated by the type dispatcher anyway. So we'll just use the recursive algorithm for now (simplest, perhaps easiest for compiler to optimize).

shrshi

Looks good to me, thank you for adding the comment!

PointKernel · 2024-02-28T20:15:38Z

Either way this call is only performed at compile time (good candidate for consteval in C++20), and the compile time is dominated by the type dispatcher anyway.

Valid point. I won't worry much about a build-time recursive call.

pmattione-nvidia · 2024-02-28T20:20:18Z

/merge

The addition of an array of integers in this function placed too much register pressure on our code base. This function is used by the fixed_point constructor and cast operators, so it potentially affects every kernel. Too many unrelated kernels were impacted and suffered performance degradations to justify this change. This reverts the algorithm introduced in #15110 to what it was previously, with some very minor tweaks. Authors: - Paul Mattione (https://github.com/pmattione-nvidia) Approvers: - Yunsong Wang (https://github.com/PointKernel) - Mike Wilson (https://github.com/hyperbolic2346) - Shruti Shivakumar (https://github.com/shrshi) - MithunR (https://github.com/mythrocks) URL: #15242

pmattione-nvidia added 2 commits February 21, 2024 12:15

Compile-time ipow computation with array lookup. Also fix rounding co…

4bb5ff9

…mments.

Fix strings convert text, add some comments.

5b9f82c

pmattione-nvidia requested a review from a team as a code owner February 21, 2024 19:06

pmattione-nvidia requested review from shrshi and PointKernel February 21, 2024 19:06

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Feb 21, 2024

davidwendt added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 21, 2024

pmattione-nvidia and others added 2 commits February 21, 2024 16:56

Fix style

62f19b4

Merge branch 'rapidsai:branch-24.04' into issue_9346

7b12cf9

davidwendt reviewed Feb 21, 2024

View reviewed changes

cpp/include/cudf/fixed_point/fixed_point.hpp Outdated Show resolved Hide resolved

pmattione-nvidia added 2 commits February 21, 2024 17:31

Yet another style fix

ba2d186

Merge branch 'issue_9346' of https://github.com/pmattione-nvidia/cudf …

9df42e9

…into issue_9346 Update local branch with changes from remote

PointKernel approved these changes Feb 22, 2024

View reviewed changes

cpp/include/cudf/fixed_point/fixed_point.hpp Outdated Show resolved Hide resolved

Update cpp/include/cudf/fixed_point/fixed_point.hpp

5dccd25

Use doxygen @note Co-authored-by: Yunsong Wang <[email protected]>

mattahrens assigned pmattione-nvidia Feb 27, 2024

shrshi approved these changes Feb 28, 2024

View reviewed changes

rapids-bot bot merged commit 896b5bc into rapidsai:branch-24.04 Feb 28, 2024
69 checks passed

GregoryKimball mentioned this pull request Mar 6, 2024

Roll back ipow changes due to register pressure. #15242

Merged

3 tasks

pmattione-nvidia mentioned this pull request Apr 3, 2024

For powers of 10, replace ipow with switch #15353

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compile-time ipow computation with array lookup #15110

Compile-time ipow computation with array lookup #15110

pmattione-nvidia commented Feb 21, 2024 •

edited by PointKernel

Loading

copy-pr-bot bot commented Feb 21, 2024

davidwendt commented Feb 21, 2024

davidwendt commented Feb 21, 2024

davidwendt commented Feb 21, 2024

PointKernel left a comment

davidwendt commented Feb 22, 2024

pmattione-nvidia commented Feb 28, 2024

shrshi left a comment

PointKernel commented Feb 28, 2024

pmattione-nvidia commented Feb 28, 2024

Compile-time ipow computation with array lookup #15110

Compile-time ipow computation with array lookup #15110

Conversation

pmattione-nvidia commented Feb 21, 2024 • edited by PointKernel Loading

Description

Checklist

copy-pr-bot bot commented Feb 21, 2024

davidwendt commented Feb 21, 2024

davidwendt commented Feb 21, 2024

davidwendt commented Feb 21, 2024

PointKernel left a comment

Choose a reason for hiding this comment

davidwendt commented Feb 22, 2024

pmattione-nvidia commented Feb 28, 2024

shrshi left a comment

Choose a reason for hiding this comment

PointKernel commented Feb 28, 2024

pmattione-nvidia commented Feb 28, 2024

pmattione-nvidia commented Feb 21, 2024 •

edited by PointKernel

Loading