Consider always discarding device lambdas' results #779

jaredhoberock · 2016-04-20T22:35:37Z

Host functions can't rely on the result of std::result_of when it is instantiated with a __device__ lambda.

If we have the means to detect their presence at compile time, then one option is to entirely ban them from bulk_invoke and friends. A more relaxed policy might be to always interpret their result to be void. That would still allow them to be used, but it would not be possible to return results from them.

The text was updated successfully, but these errors were encountered:

brycelelbach · 2017-10-13T07:08:50Z

I don't think there's a way to detect that it's a lambda.

I'm not sure I understand why result_of doesn't work in this instance?

Would decltype work?

jaredhoberock · 2017-10-13T20:39:57Z

The issue is that __device__ lambdas' always return int! Even if there is no return used inside at all. Ugh! That means utilities like result_of or invoke_result or decltype to can't reliably determine what __device__ lambdas return. :-(

Fortunately, we are able to distinguish __device__ lambdas from other types.

Agency uses this inside its implementation of result_of to treat all __device__ lambdas as if they return void.

alliepiper · 2020-10-01T18:20:00Z

I'm going to start using this issue to track the various "__device__ lambda aren't working" issues.

The issues Jared mentioned above are documented here in the current toolkit documentation.

Until we have a better system in place, users should stick with __host__ __device__ lambda or explicit __device__ functor objects instead.

jrhemstad · 2020-10-01T19:02:38Z

Just for completeness, the exact restriction is G.6.2p13:

As described above, the CUDA compiler replaces a device extended lambda defined in a host function with a placeholder type defined in namespace scope. This placeholder type does not define a operator() function equivalent to the original lambda declaration. An attempt to determine the return type or parameter types of the operator() function may therefore work incorrectly in host code, as the code processed by the host compiler will be semantically different than the input code processed by the CUDA compiler.

I've stopped using __device__ lambdas all together with Thrust after I learned about this restriction. Too great a chance of silent or non-obvious failures.

harrism · 2020-10-02T00:49:55Z

@jaredhoberock said "device lambdas always return int". That can't be true, I have used device lambdas that return non-int before. I guess what Jared means is that result_of on a device lambda always evaluates to int?

alliepiper · 2020-10-02T01:37:23Z

Correct. The device lambda object is replaced with a placeholder object in host code, and that placeholder's API does not accurately represent the device lambda's actual call signature.

jrhemstad · 2020-10-02T03:18:52Z

I guess what Jared means is that result_of on a device lambda always evaluates to int

https://godbolt.org/z/6ob7q5

Not necessarily, no. It's moreso that the return type of the callable (as determined from host code) cannot be relied upon to be correct.

harrism · 2020-10-02T04:36:44Z

I just wanted reassurance that if I call a device lambda that returns float from a kernel, it doesn't return int.

pauleonix · 2022-12-06T17:47:05Z

I'm going to start using this issue to track the various "__device__ lambda aren't working" issues.

#1650 (comment) has a reproducer for an issue with both __device__ and __host__ __device__ lambdas causing thrust::transform_output_iterator< ... >::operator= to be deleted. Further information in #1650 (comment)

jrhemstad · 2023-03-07T01:16:54Z

Thanks to NVIDIA/libcudacxx#284, we will automatically error at compile time in instances where we attempt to query the return type of an extended lambda in a context where that is not supported.

Use __host__ __device__ lambdas instead of pure __device__ lambdas to avoid problems with their return type. See NVIDIA/thrust#779 (comment) In CUDA 12 one could use cuda::proclaim_return_type from libcu++ instead, but this wont work for earlier CUDA versions. Add -DCUDA_FORCE_CDP1_IF_SUPPORTED to allow the usage of legacy CUDA Dynamic Parallelism

jaredhoberock added the type: enhancement New feature or request. label Apr 20, 2016

brycelelbach added the triage Needs investigation and classification. label Oct 13, 2017

brycelelbach self-assigned this Oct 13, 2017

alliepiper mentioned this issue Oct 1, 2020

[BUG] Reduction core-dump with transform iterator using device only lambda. #1259

Closed

alliepiper mentioned this issue Oct 1, 2020

Thrust logical algorithms fail to compile with device lambdas #1062

Closed

alliepiper mentioned this issue Oct 4, 2020

Unexpected behaviour when return type is specified for transform iterator. #1299

Closed

alliepiper mentioned this issue Dec 14, 2021

"illegal memory access" when using a custom copy constructor with thrust::transform #1578

Closed

alliepiper mentioned this issue May 13, 2022

Emit diagnostic when attempting to use extended device lambda in contexts that require querying the return type #1688

Closed

jrhemstad added this to CCCL Aug 11, 2022

gevtushenko mentioned this issue Dec 6, 2022

Unable to use transform_output_iterator for output of copy_if with CUDA #1650

Closed

jrhemstad added the thrust label Feb 22, 2023

jrhemstad closed this as completed Mar 7, 2023

github-project-automation bot moved this to Done in CCCL Mar 7, 2023

ahendriksen mentioned this issue Jun 9, 2023

[FEA] Masked NN for connect_components rapidsai/raft#1445

Merged

tarang-jain mentioned this issue Jun 22, 2023

Add libcudacxx as dependency rapidsai/raft#1606

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider always discarding device lambdas' results #779

Consider always discarding device lambdas' results #779

jaredhoberock commented Apr 20, 2016 •

edited

Loading

brycelelbach commented Oct 13, 2017

jaredhoberock commented Oct 13, 2017

alliepiper commented Oct 1, 2020

jrhemstad commented Oct 1, 2020 •

edited

Loading

harrism commented Oct 2, 2020

alliepiper commented Oct 2, 2020

jrhemstad commented Oct 2, 2020 •

edited

Loading

harrism commented Oct 2, 2020

pauleonix commented Dec 6, 2022

jrhemstad commented Mar 7, 2023

Consider always discarding __device__ lambdas' results #779

Consider always discarding __device__ lambdas' results #779

Comments

jaredhoberock commented Apr 20, 2016 • edited Loading

brycelelbach commented Oct 13, 2017

jaredhoberock commented Oct 13, 2017

alliepiper commented Oct 1, 2020

jrhemstad commented Oct 1, 2020 • edited Loading

harrism commented Oct 2, 2020

alliepiper commented Oct 2, 2020

jrhemstad commented Oct 2, 2020 • edited Loading

harrism commented Oct 2, 2020

pauleonix commented Dec 6, 2022

jrhemstad commented Mar 7, 2023

Consider always discarding device lambdas' results #779

Consider always discarding device lambdas' results #779

jaredhoberock commented Apr 20, 2016 •

edited

Loading

jrhemstad commented Oct 1, 2020 •

edited

Loading

jrhemstad commented Oct 2, 2020 •

edited

Loading