-
Notifications
You must be signed in to change notification settings - Fork 184
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build alpaka code in GPU-only mode [14.0.x] #9125
Build alpaka code in GPU-only mode [14.0.x] #9125
Conversation
Build alpaka device code for the CUDA and ROCm back-ends in "GPU only" mode. In this mode, functions marked as ALPAKA_FN_ACC are marked as __device__ functions, and are compiled only for the corresponding GPU device back-ends. Currently, functions marked as ALPAKA_FN_ACC are marked as __host__ __device__ functions, and may be compiled for both device and host back-ends. The latter leads to linker errors in kernels that use device symbols like threadIdx, blockIdx, etc. that are compiled for the ROCm back-end.
backport #9121 |
A new Pull Request was created by @fwyzard for branch IB/CMSSW_14_0_X/master. @aandvalenzuela, @smuzaffar, @cmsbuild, @iarspider can you please review it and eventually sign? Thanks.
|
cms-bot internal usage |
enable gpu |
please test with cms-sw/cmssw#44650 |
+1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-70672a/38660/summary.html Comparison SummarySummary:
GPU Comparison SummarySummary:
|
+externals @cms-sw/orp-l2 , feel free to include it for next 14.0.X IB/release |
This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_14_0_X/master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @rappoccio, @sextonkennedy, @antoniovilela (and backports should be raised in the release meeting by the corresponding L2) |
However, cms-sw/cmssw#44650 needs to be merged before (or at the same time as) this PR. |
Will merge it with cms-sw/cmssw#44650, after passing an IB in master. |
+1 |
Build alpaka device code for the CUDA and ROCm back-ends in "GPU only" mode. In this mode, functions marked as ALPAKA_FN_ACC are marked as
__device__
functions, and are compiled only for the corresponding GPU device back-ends.Currently, functions marked as ALPAKA_FN_ACC are marked as
__host__ __device__
functions, and may be compiled for both device and host back-ends. The latter leads to linker errors in kernels that use device symbols likethreadIdx
,blockIdx
, etc. that are compiled for the ROCm back-end.No impact on the HLT performance, as expected.
CMSSW_14_0_4
:CMSSW_14_0_4
with cms-sw/cmssw#44650 and #9125:Backport #9121 to CMSSW 14.0.x for data taking.