Add CUDF_UNREACHABLE macro. #9727

bdice · 2021-11-19T01:29:41Z

Resolves #7753. I replaced all instances of cudf_assert(false && "message"); with CUDF_UNREACHABLE("message");. There are a few instances where the condition of the assertion is not always false, and thus the code following it may still be reachable. I did not change those cases.

cpp/include/cudf/detail/utilities/assert.cuh

cpp/include/cudf/utilities/type_dispatcher.hpp

…reachable

codecov · 2021-11-20T00:08:44Z

Codecov Report

Merging #9727 (3d3000c) into branch-22.04 (4596244) will decrease coverage by 0.12%.
The diff coverage is 93.70%.

@@               Coverage Diff                @@
##           branch-22.04    #9727      +/-   ##
================================================
- Coverage         86.13%   86.01%   -0.13%     
================================================
  Files               139      139              
  Lines             22438    22426      -12     
================================================
- Hits              19328    19290      -38     
- Misses             3110     3136      +26

Impacted Files	Coverage Δ
python/cudf/cudf/core/dataframe.py	`93.57% <ø> (ø)`
python/cudf/cudf/core/series.py	`95.16% <ø> (ø)`
python/dask_cudf/dask_cudf/tests/test_accessor.py	`98.41% <ø> (ø)`
python/cudf/cudf/core/column/decimal.py	`91.30% <73.68%> (-1.01%)`	⬇️
python/cudf/cudf/core/column/categorical.py	`89.63% <84.61%> (-0.29%)`	⬇️
python/cudf/cudf/core/column/string.py	`88.91% <94.44%> (+0.64%)`	⬆️
python/cudf/cudf/core/column/numerical.py	`95.62% <95.83%> (+0.64%)`	⬆️
python/cudf/cudf/api/types.py	`89.79% <100.00%> (ø)`
python/cudf/cudf/core/column/column.py	`89.27% <100.00%> (+0.10%)`	⬆️
python/cudf/cudf/core/column/datetime.py	`88.55% <100.00%> (-0.52%)`	⬇️
... and 15 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 621d26f...3d3000c. Read the comment docs.

…reachable

cpp/include/cudf/ast/detail/operators.hpp

…reachable

…ail rather than be unreachable).

github-actions · 2022-03-02T06:08:01Z

This PR has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates. This PR will be labeled inactive-90d if there is no activity in the next 60 days.

jrhemstad · 2022-03-02T15:11:03Z

@bdice can we get this wrapped up?

bdice · 2022-03-02T15:16:55Z

@jrhemstad Yes! I might need guidance on a few cases where I am unsure about whether the path should actually be marked as unreachable. I think the current state of the PR may be too aggressive in applying that.

…reachable

cpp/src/hash/hashing.cu

bdice · 2022-03-15T21:11:03Z

rerun tests

cpp/include/cudf/ast/detail/expression_evaluator.cuh

nvdbaranec · 2022-03-16T15:30:52Z

rerun tests

cpp/src/io/parquet/writer_impl.cu

cpp/include/cudf/detail/utilities/assert.cuh

…reachable

vuule · 2022-03-17T19:18:45Z

cpp/include/cudf/utilities/type_dispatcher.hpp

 #ifndef __CUDA_ARCH__
-      CUDF_FAIL("Unsupported type_id.");
+      CUDF_FAIL("Invalid type_id.");
 #else
-      cudf_assert(false && "Unsupported type_id.");
-
-      // The following code will never be reached, but the compiler generates a
-      // warning if there isn't a return value.
-
-      // Need to find out what the return type is in order to have a default
-      // return value and solve the compiler warning for lack of a default
-      // return
-      using return_type = decltype(f.template operator()<int8_t>(std::forward<Ts>(args)...));
-      return return_type();
+      CUDF_UNREACHABLE("Invalid type_id.");
 #endif


Would it be useful to make this into a single macro (maybe this should be CUDF_UNREACHABLE, so it covers both host and device code)? I see the pattern in a few places in the PR.

I considered that, but I didn't want to hide the dependence on #ifndef __CUDA_ARCH__. Failure/raising an error and unreachable code mean very different things in my opinion, and I didn't want to conflate them by replacing this with an idiom that has potential for misuse. What do you think?

I'm not sure. It's weird because we do have the uneven handling between host and device as it is. Maybe it should be the other way around, and CUDF_FAIL can call CUDF_UNREACHABLE if in device code. As in - "we failed on the device, here's an assert if debug and don't expect a return".

Tagging @jrhemstad for thoughts on this. I would defer that change to a later PR if possible.

I think I'm still in favor of keeping these macros separate. Letting CUDF_FAIL defer to an unreachable path seems dangerous. Developers that see CUDF_FAIL should be able to reasonably expect an error, and should not use it to signify branches that can be optimized out as impossible to reach. A macro named something like CUDF_IMPOSSIBLE might be a compromise, but I think a combined macro like that would obscure the intention (in harmful ways) more than it helps with cleanliness/brevity.

Yeah, obscuring the intention is the main issue I can see.

Here's what bugs me: we are using CUDF_UNREACHABLE both for truly unreachable code and failure. Ideally, CUDF_UNREACHABLE macro would call GCC's __builtin_unreachable() if in host code. But we call CUDF_FAIL instead in such cases.
Feels like code that should not be executed should use CUDF_FAIL (both host and device) and truly unreachable code should use CUDF_UNREACHABLE (both host and device). I understand that this may do more hard than good, just bringing it up for consideration.

I believe all the cases handled in this way are actually unreachable (by enum exhaustion, in most cases). We’re just taking the opportunity to raise an error on the host because we can do that without any significant performance or compile time penalty.

vuule

couple more nitpicks, looks 🔥 otherwise

cpp/src/io/parquet/chunk_dict.cu

bdice · 2022-03-17T23:34:10Z

rerun tests

jrhemstad · 2022-03-18T13:21:47Z

rerun tests

…reachable

bdice · 2022-03-18T21:27:20Z

@gpucibot merge

This reverts commit 48cebf7.

@nvdbaranec

…seen in NDS q72 in Spark (#10534) The following change addresses a performance degradation we noticed in the `mixed_join` and `compute_mixed_join_output_size` that looks to be tied to the theoretical occupancy of these kernels, as limited by the number of registers used. The regression is triggered by this patch: #9727, which improves handling of unreachable code paths. That said, somehow, this change is altering the number of registers these kernels need. Both `mixed_join` and `compute_mixed_join_output_size` are very sensitive to the register count, per NSight compute. With the patch, the register required changed from 92 to 102, and 118 to 141 respectively. The fix here hints the compiler what our block size is (128 threads). This, from our testing, allows the compiler to reduce the number of registers required to 128 for `compute_mixed_join_output_size` and 96 for `mixed_join`. This lead to better occupancy (I think @nvdbaranec measured it going from 30% to 50%) and I saw the wall clock time of q72 (which started all this) to go from 133s to 121s, which is within the ballpark I'd expect. Authors: - Alessandro Bellina (https://github.com/abellina) Approvers: - Mike Wilson (https://github.com/hyperbolic2346)

This reverts commit 48cebf7.

bdice added 3 commits November 18, 2021 15:51

Add CUDF_UNREACHABLE macro.

937a512

Add unreachable macro to type dispatcher.

106b708

Add unreachable macro to AST operator dispatcher.

f9894d1

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Nov 19, 2021

bdice changed the title ~~Add CUDF_UNREACHABLE~~ Add CUDF_UNREACHABLE macro. Nov 19, 2021

bdice added non-breaking Non-breaking change improvement Improvement / enhancement to an existing function labels Nov 19, 2021

jrhemstad reviewed Nov 19, 2021

View reviewed changes

cpp/include/cudf/detail/utilities/assert.cuh Outdated Show resolved Hide resolved

jrhemstad reviewed Nov 19, 2021

View reviewed changes

cpp/include/cudf/utilities/type_dispatcher.hpp Outdated Show resolved Hide resolved

bdice self-assigned this Nov 19, 2021

bdice added 3 commits November 19, 2021 11:30

Merge remote-tracking branch 'upstream/branch-22.02' into add-cudf-un…

8f41a74

…reachable

Update CUDF_UNREACHABLE docs.

ab1cdb9

Throw error in host-side type dispatcher if invalid type is passed.

77c97ae

jrhemstad approved these changes Dec 13, 2021

View reviewed changes

bdice changed the base branch from branch-22.02 to branch-22.04 January 20, 2022 04:41

Merge remote-tracking branch 'upstream/branch-22.04' into add-cudf-un…

2c39210

…reachable

bdice commented Jan 24, 2022

View reviewed changes

cpp/include/cudf/ast/detail/operators.hpp Outdated Show resolved Hide resolved

bdice added 4 commits January 24, 2022 14:03

Use CUDF_UNREACHABLE only on device in AST operator dispatch.

2d61ea9

Merge remote-tracking branch 'upstream/branch-22.04' into add-cudf-un…

a0d9baa

…reachable

Merge remote-tracking branch 'upstream/branch-22.04' into add-cudf-un…

8c7d978

…reachable

Use CUDF_UNREACHABLE (may be too aggressive, some cases may need to f…

52ad3a0

…ail rather than be unreachable).

github-actions bot added the inactive-30d label Mar 2, 2022

Merge remote-tracking branch 'upstream/branch-22.04' into add-cudf-un…

c320c1f

…reachable

bdice marked this pull request as ready for review March 15, 2022 21:02

bdice requested a review from a team as a code owner March 15, 2022 21:02

bdice requested review from vuule and nvdbaranec March 15, 2022 21:02

bdice commented Mar 15, 2022

View reviewed changes

cpp/src/hash/hashing.cu Outdated Show resolved Hide resolved

jrhemstad reviewed Mar 15, 2022

View reviewed changes

cpp/include/cudf/ast/detail/expression_evaluator.cuh Show resolved Hide resolved

nvdbaranec approved these changes Mar 16, 2022

View reviewed changes

vuule reviewed Mar 16, 2022

View reviewed changes

cpp/src/io/parquet/writer_impl.cu Outdated Show resolved Hide resolved

cpp/include/cudf/detail/utilities/assert.cuh Show resolved Hide resolved

vyasr mentioned this pull request Mar 16, 2022

Faster struct row comparator #10164

Merged

bdice added 3 commits March 17, 2022 12:11

Merge remote-tracking branch 'upstream/branch-22.04' into add-cudf-un…

6d8d498

…reachable

Add comment about assert in debug mode.

8c2b1dc

Remove unnecessary returns.

1407315

vuule reviewed Mar 17, 2022

View reviewed changes

cpp/src/io/parquet/chunk_dict.cu Outdated Show resolved Hide resolved

cpp/src/io/parquet/chunk_dict.cu Outdated Show resolved Hide resolved

Add explicit type since auto deduction fails with CUDF_UNREACHABLE.

b66c4c4

Merge remote-tracking branch 'upstream/branch-22.04' into add-cudf-un…

3d3000c

…reachable

bdice added the 5 - Ready to Merge Testing and reviews complete, ready to merge label Mar 18, 2022

rapids-bot bot merged commit 48cebf7 into rapidsai:branch-22.04 Mar 18, 2022

abellina added a commit to abellina/cudf that referenced this pull request Mar 28, 2022

Revert "Add CUDF_UNREACHABLE macro. (rapidsai#9727)"

ce08bfd

This reverts commit 48cebf7.

abellina mentioned this pull request Mar 29, 2022

Adds launch bounds hints to mixed join kernels to address regression seen in NDS q72 in Spark #10534

Merged

abellina added a commit to abellina/cudf that referenced this pull request Apr 14, 2022

Revert "Add CUDF_UNREACHABLE macro. (rapidsai#9727)"

bc6d800

This reverts commit 48cebf7.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CUDF_UNREACHABLE macro. #9727

Add CUDF_UNREACHABLE macro. #9727

bdice commented Nov 19, 2021 •

edited

Loading

codecov bot commented Nov 20, 2021 •

edited

Loading

github-actions bot commented Mar 2, 2022

jrhemstad commented Mar 2, 2022

bdice commented Mar 2, 2022

bdice commented Mar 15, 2022

nvdbaranec commented Mar 16, 2022

vuule Mar 17, 2022

bdice Mar 17, 2022 •

edited

Loading

vuule Mar 17, 2022

bdice Mar 17, 2022

bdice Mar 17, 2022 •

edited

Loading

vuule Mar 17, 2022

bdice Mar 17, 2022 •

edited

Loading

vuule left a comment

bdice commented Mar 17, 2022

jrhemstad commented Mar 18, 2022

bdice commented Mar 18, 2022

Add CUDF_UNREACHABLE macro. #9727

Add CUDF_UNREACHABLE macro. #9727

Conversation

bdice commented Nov 19, 2021 • edited Loading

codecov bot commented Nov 20, 2021 • edited Loading

Codecov Report

github-actions bot commented Mar 2, 2022

jrhemstad commented Mar 2, 2022

bdice commented Mar 2, 2022

bdice commented Mar 15, 2022

nvdbaranec commented Mar 16, 2022

vuule Mar 17, 2022

Choose a reason for hiding this comment

bdice Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

vuule Mar 17, 2022

Choose a reason for hiding this comment

bdice Mar 17, 2022

Choose a reason for hiding this comment

bdice Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

vuule Mar 17, 2022

Choose a reason for hiding this comment

bdice Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

vuule left a comment

Choose a reason for hiding this comment

bdice commented Mar 17, 2022

jrhemstad commented Mar 18, 2022

bdice commented Mar 18, 2022

bdice commented Nov 19, 2021 •

edited

Loading

codecov bot commented Nov 20, 2021 •

edited

Loading

bdice Mar 17, 2022 •

edited

Loading

bdice Mar 17, 2022 •

edited

Loading

bdice Mar 17, 2022 •

edited

Loading