Rebuild for CUDA 12 #148

regro-cf-autotick-bot · 2024-06-14T23:53:57Z

This PR has been triggered in an effort to update cuda120.

Notes and instructions for merging this PR:

Please merge the PR only after the tests have passed.
Feel free to push to the bot's branch to update this PR if needed.

Please note that if you close this PR we presume that the feedstock has been rebuilt, so if you are going to perform the rebuild yourself don't close this PR until the your rebuild has been merged.

Here are some more details about this specific migrator:

The transition to CUDA 12 SDK includes new packages for all CUDA libraries and
build tools. Notably, the cudatoolkit package no longer exists, and packages
should depend directly on the specific CUDA libraries (libcublas, libcusolver,
etc) as needed. For an in-depth overview of the changes and to report problems
see this issue.
Please feel free to raise any issues encountered there. Thank you! 🙏

If this PR was opened in error or needs to be updated please add the bot-rerun label to this PR. The bot will close this PR and schedule another one. If you do not have permissions to add this label, you can use the phrase @conda-forge-admin, please rerun bot in a PR comment to have the conda-forge-admin add it for you.

_{This PR was created by the regro-cf-autotick-bot. The regro-cf-autotick-bot is a service to automatically track the dependency graph, migrate packages, and propose package version updates for conda-forge. Feel free to drop us a line if there are any issues! This PR was generated by - please use this URL for debugging.}

The transition to CUDA 12 SDK includes new packages for all CUDA libraries and build tools. Notably, the cudatoolkit package no longer exists, and packages should depend directly on the specific CUDA libraries (libcublas, libcusolver, etc) as needed. For an in-depth overview of the changes and to report problems [see this issue]( conda-forge/conda-forge.github.io#1963 ). Please feel free to raise any issues encountered there. Thank you! 🙏

conda-forge-webservices · 2024-06-14T23:54:05Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe) and found it was in an excellent condition.

h-vetinari · 2024-08-09T04:59:39Z

@conda-forge-admin, please rerender

conda-forge-webservices · 2024-08-09T05:02:17Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found it was in an excellent condition.

h-vetinari · 2024-08-09T05:31:37Z

@conda-forge-admin, please rerender

…nda-forge-pinning 2024.08.09.05.46.50

h-vetinari · 2024-08-09T08:05:49Z

Haven't seen such a failure before:

[ 48%] Building NVCC (Device) object AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/__/__/cuda/xc_cuda_generated_gpu_getxc.cu.o
sh: cicc: command not found
CMake Error at xc_cuda_generated_gpu_getxc.cu.o.Release.cmake:278 (message):
  Error generating file
  /home/conda/feedstock_root/build_artifacts/ambertools_1723188819308/work/build/AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/__/__/cuda/./xc_cuda_generated_gpu_getxc.cu.o

Looks like this is a known issue and we need to point to $PREFIX/nvvm/bin/cicc

mattwthompson · 2024-08-09T14:54:47Z

I gather from the diff that the changes here cover Windows but Windows support isn't added?

h-vetinari · 2024-08-09T19:48:38Z

I gather from the diff that the changes here cover Windows but Windows support isn't added?

The migrator would be adding CUDA 12.0 builds on windows, if windows weren't skipped completely here. That's okay though, it's just the default title of PRs opened by this migrator. Actual windows enablement should be done independently from this PR.

h-vetinari · 2024-08-14T10:27:08Z

OK, moved past the cicc issue, now getting:

[ 58%] Building NVCC intermediate link file AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/xc_cuda_intermediate_link.o
cc1plus: fatal error: $BUILD_PREFIX/targets/x86_64-linux/bin/crt/link.stub: No such file or directory
compilation terminated.

I cannot tell from the recipe where things would refer to link.stub; I'm presuming this should be part of the cuda-nvcc setup, but there, that stub is under $BUILD_PREFIX/bin/crt/link.stub. Could something still be configured incorrectly @conda-forge/cuda?

jakirkham · 2024-08-27T20:49:30Z

Sorry for the slow reply here Axel

Discussed this with my colleagues today

When we have seen similar issues before, they have tended to trace back to using the legacy CMake find_package(CUDA), which is deprecated. In these cases, libraries are recommended to move to adding the CUDA language where appropriate and start using find_package(CUDAToolkit REQUIRED) to pick up any CTK contents for linking into relevant artifacts. It's possible that other steps may need to be as well ( scopetools/cudadecon#29 (comment) )

Am not entirely sure the right place to look at the source code for ambertools, but was able to find the Amber-MD GitHub org, which references the webpage ( https://ambermd.org/ ) used in downloads here

ambertools-feedstock/recipe/meta.yaml

Line 15 in 725d666

url: https://ambermd.org/downloads/AmberTools23_rc6.tar.bz2

Looking in that org do see usage of find_package(CUDA). So think the first step would be for ambertools to complete this upgrade

As an interesting note did see this comment in that ambertools code:

# With CMake 3.7, FindCUDA.cmake crashes when crosscompiling.

if(CROSSCOMPILE)
	message(STATUS "CUDA disabled when crosscompiling.")
	set(CUDA FALSE)
else()

One of the things the CMake team solved by adding the CUDA language and find_package(CUDAToolkit) was cross-compilation support with CUDA. In fact this was done with an eye toward working in Conda with the Conda compilers

Think to move this forward, would recommend working with upstream to adopt these changes. Possibly the build here can be patched to use those upstream changes (though it may be simpler to update to a new release with the build fixes)

cc @robertmaynard @bdice (for awareness & in case revisions to the above are needed)

mattwthompson · 2024-08-27T20:58:12Z

As far as I recall, the canonical source code is non-public and on GitLab. But it's several projects stapled together, including cpptraj which is hosted here, so what you found there occurs at least once (probably several times).

cc: @dacase who is likely the best person to coordinate making any needed changes away from deprecated calls

h-vetinari · 2024-08-27T23:24:39Z

Thanks for the analysis John!

h-vetinari · 2024-08-27T23:43:23Z

So I downloaded the tarball (man there's a lot of stuff in there; a cool 3GB when unpacked, and a mass of vendored bits), and searched for the occurrences of find_package(CUDA:

>findstr /L /S /N /C:"find_package(CUDA" *.*
AmberTools\src\cpptraj\cmake-cpptraj\CudaConfig.cmake:11:       find_package(CUDA)
AmberTools\src\quick\cmake\CudaConfig.cmake:11: find_package(CUDA)
AmberTools\src\quick\quick-cmake\QUICKCudaConfig.cmake:11:    find_package(CUDA REQUIRED)
cmake\CudaConfig.cmake:11:      find_package(CUDA)

Given that there's only 4, this sounds quite patchable.

dacase · 2024-08-28T03:22:10Z

The "canonical source code" is actually public, and available here:
https://ambermd.org/downloads/AmberTools24_rc5.tar.bz2

[It has an odd file name just because we encourage most folks to download via a web link, so that we can monitor total usage. But it's all open source, and the link above was actually made to be used by conda builds, among other purposes.]

I'm not sure how @jakirkham (above) found the corresponding link to AmberTools23, but that is outdated.

If it is indeed the case that find_package(CUDAToolkit REQUIRED) helps, we can easily make that change to the source code. I'm completely out of bounds in thinking about what is required for Windows support.

jakirkham · 2024-08-28T03:42:59Z

Thanks David! 🙏

Does this live in a source controlled repo somewhere (GitHub, GitLab, Bitbucket, etc.)?

The version of the file is just coming from the recipe. It looks like Matt started a PR to update ( #141 ). Though appears that ran into build issues

AIUI this feedstock doesn't build Windows ( #148 (comment) ). So that is not relevant

In any event, yes, migrating to find_package(CUDAToolkit REQUIRED) would be quite helpful for just getting build with CUDA 12+ here working

It would also help when building via cross-compilation for other Linux architectures (like ARM)

jakirkham · 2024-08-28T04:17:52Z

Am renaming the PR to avoid further confusion. Hope that is ok

h-vetinari · 2024-08-28T07:44:56Z

Well, I'm several patches deep into trying to make this work, and I think I'm hitting a CMake bug.

Surely it would be better to do less hacky changes in AmberTools upstream; I was mainly trying to see what would be necessary to unblock the build and tried to keep patching ~minimal, at least conceptually (feel free to pick up anything, though these were not really written with being upstreamed in mind - not least because there's no public repo to contribute to - but rather as the most immediately necessary fixes to overcome the failures here).

mattwthompson · 2024-08-28T13:32:49Z

Just to avoid any confusion - this PR would have to be for AmberTools 23 until #141 or a similar build is complete, so using the AmberTools23_rc6.tar.bz2 blob is the only option. Since building AmberTools 24 is stalled, updating it for these CUDA changes can't happen with that version.

jakirkham · 2024-08-28T21:21:00Z

Am deeply impressed by the amount of effort you spent patching here Axel! 🙏

Subscribed to that issue. Though it looks like my colleague Rob already replied to you over there. Agree with him we likely need enable_language

That all being said, agree this is work probably best taken on upstream. Think the other pieces you included here are a good starting point for anyone wanting to push this forward

Agreed Matt. Was trying to capture that in my comments above. Apologies if that was too muddled with other details

h-vetinari · 2024-08-29T02:34:07Z

Just to avoid any confusion - this PR would have to be for AmberTools 23 until #141 or a similar build is complete, so using the AmberTools23_rc6.tar.bz2 blob is the only option.

This is what I've been doing, the sources are unchanged in this PR.

mattwthompson · 2024-08-29T14:37:27Z

👍 yep just wanted to be sure we were all on the same page, that comment was mostly to explain to David why this is being applied to 23, not 24

while languages usually get defined around where the (sub)project has its own CMakeLists.txt, this still doesn't work, so move it to the very top

h-vetinari · 2024-08-30T00:29:27Z

Well, I got things to build, but then run into:

$SRC_DIR/AmberTools/src/quick/src/cuda/gpu_getxc.h: error: no instance of overloaded function "atomicAdd" matches the argument list

jakirkham · 2024-08-30T00:58:43Z

Does the header in question have #include <cooperative_groups.h>?

That seems like the kind of thing we would need. It is also covered in this blogpost

Also worth noting this header lives in cuda-cudart-dev_{{ target_platform }}, which {{ compiler("cuda") }} pulls in as a dependency. So it should be available

h-vetinari

Does the header in question have #include <cooperative_groups.h>?

Actually, looking at the source code, it does something like this:

#ifdef USE_LEGACY_ATOMICS
      QUICKULL val1 = (QUICKULL) (fabs( _tmp * OSCALE) + (QUICKDouble)0.5);
      if ( _tmp * weight < (QUICKDouble)0.0)
          val1 = 0ull - val1;
      QUICKADD(devSim_dft.DFT_calculated[0].Eelxc, val1);
#else
      atomicAdd(&devSim_dft.DFT_calculated[0].Eelxc, _tmp);
#endif

The header is missing though, so realistically only the USE_LEGACY_ATOMICS branch has a chance of working.

recipe/patches/0002-rely-on-DCMAKE_CUDA_ARCHITECTURES.patch

h-vetinari · 2024-10-10T08:10:37Z

@conda-forge-admin, please rerender

…nda-forge-pinning 2024.10.10.05.31.44

mikemhenry · 2024-10-11T23:25:30Z

[ 57%] Building CUDA object AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/gga_c_am05.cu.o
/bin/sh: -c: line 0: syntax error near unexpected token `;'
/bin/sh: -c: line 0: `cd /home/conda/feedstock_root/build_artifacts/ambertools_1728550220697/work/build/AmberTools/src/quick/src/libxc/maple2c_device && /home/conda/feedstock_root/build_artifacts/ambertools_1728550220697/_build_env/bin/nvcc -forward-unknown-to-host-compiler -DCEW -DCEW_USE_DLL -DCUDA -DGNU --options-file CMakeFiles/xc_cuda.dir/includes_CUDA.rsp -Wno-deprecated-gpu-targets;-Wno-deprecated-declarations;-DUSE_LEGACY_ATOMICS;-O2;AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/compiler_depend.tsAmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/compiler_depend.tsCONFIG:Debug>:-g>;-use_fast_math;--compiler-options;-fPIC -O3 -DNDEBUG -std=c++11 "--generate-code=arch=compute_35,code=[sm_35]" "--generate-code=arch=compute_53,code=[sm_53]" "--generate-code=arch=compute_62,code=[sm_62]" "--generate-code=arch=compute_72,code=[sm_72]" "--generate-code=arch=compute_75,code=[sm_75]" "--generate-code=arch=compute_80,code=[sm_80]" "--generate-code=arch=compute_86,code=[sm_86]" "--generate-code=arch=compute_89,code=[compute_89,sm_89]" -Xptxas --disable-optimizer-constants -I/home/conda/feedstock_root/build_artifacts/ambertools_1728550220697/work/AmberTools/src/quick/src/libxc/maple2c_device/.. -MD -MT AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/gga_c_am05.cu.o -MF CMakeFiles/xc_cuda.dir/gga_c_am05.cu.o.d -x cu -c /home/conda/feedstock_root/build_artifacts/ambertools_1728550220697/work/AmberTools/src/quick/src/libxc/maple2c_device/gga_c_am05.cu -o CMakeFiles/xc_cuda.dir/gga_c_am05.cu.o'
make[2]: *** [AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/build.make:77: AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/gga_c_am05.cu.o] Error 1
make[1]: *** [CMakeFiles/Makefile2:7267: AmberTools/src/quick/src/libxc/maple2c_device/CMakeFiles/xc_cuda.dir/all] Error 2
make: *** [Makefile:156: all] Error 2

looks like there is an extra or missing ; somewhere

regro-cf-autotick-bot requested review from dacase, j-wags, jaimergp, mattwthompson, mikemhenry, simonbray and swails as code owners June 14, 2024 23:53

h-vetinari added 3 commits August 9, 2024 17:35

add CUDA 12 deps

356e5b2

don't store build artefacts

9f0e1f3

MNT: Re-rendered with conda-build 24.7.1, conda-smithy 3.37.2, and co…

46c3957

…nda-forge-pinning 2024.08.09.05.46.50

h-vetinari force-pushed the rebuild-cuda120-0-3_h86dae8 branch from cd7f4fc to 46c3957 Compare August 9, 2024 06:38

CUDA 12 also needs cuda-nvtx-dev

e9c0768

h-vetinari mentioned this pull request Aug 9, 2024

Close CUDA 12 migration (redux) conda-forge/conda-forge-pinning-feedstock#6263

Merged

17 tasks

add cicc to PATH on CUDA builds

7f37f2f

h-vetinari force-pushed the rebuild-cuda120-0-3_h86dae8 branch from ad43279 to 7f37f2f Compare August 14, 2024 08:42

h-vetinari mentioned this pull request Aug 22, 2024

BUG: compilation tries to link with link.stub in wrong location conda-forge/cuda-nvcc-feedstock#51

Open

switch to find_package(CUDAToolkit)

41b266e

h-vetinari force-pushed the rebuild-cuda120-0-3_h86dae8 branch from 6f070d5 to 0f5e3cb Compare August 28, 2024 03:11

replace functions that only work with find_package(CUDA)

504b0a8

h-vetinari force-pushed the rebuild-cuda120-0-3_h86dae8 branch from 0f5e3cb to 504b0a8 Compare August 28, 2024 03:43

jakirkham changed the title ~~Rebuild for CUDA 12 w/arch + Windows support~~ Rebuild for CUDA 12 Aug 28, 2024

of course the signature changes too...

3b4686d

h-vetinari force-pushed the rebuild-cuda120-0-3_h86dae8 branch from 95560e0 to 3b4686d Compare August 28, 2024 04:31

h-vetinari added 2 commits August 28, 2024 15:52

avoid CMake error due to lack of language

402e82a

handle linker_language differently

c01fe85

h-vetinari force-pushed the rebuild-cuda120-0-3_h86dae8 branch from 1e0c112 to c01fe85 Compare August 28, 2024 07:06

use enable_language(CUDA) instead of messing with LINKER_LANGUAGE

be1d139

h-vetinari added 4 commits August 30, 2024 09:02

move enable_language(CUDA) to project top-level

54570ab

set enable_language(CUDA) in top-level CMakeLists.txt

6a2f6a3

while languages usually get defined around where the (sub)project has its own CMakeLists.txt, this still doesn't work, so move it to the very top

add it back to subprojects after all?

b8408f1

remove all doubts

ee9bf42

h-vetinari reviewed Aug 30, 2024

View reviewed changes

recipe/patches/0002-rely-on-DCMAKE_CUDA_ARCHITECTURES.patch Outdated Show resolved Hide resolved

handle CUDA_NVCC_FLAGS -> CMAKE_CUDA_FLAGS

6044e7c

h-vetinari mentioned this pull request Oct 10, 2024

Rebuild for libboost 1.86 #150

Open

MNT: Re-rendered with conda-build 24.9.0, conda-smithy 3.42.0, and co…

28e9d23

…nda-forge-pinning 2024.10.10.05.31.44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rebuild for CUDA 12 #148

Rebuild for CUDA 12 #148

regro-cf-autotick-bot commented Jun 14, 2024

conda-forge-webservices bot commented Jun 14, 2024

h-vetinari commented Aug 9, 2024

conda-forge-webservices bot commented Aug 9, 2024 •

edited by conda-forge-admin

Loading

h-vetinari commented Aug 9, 2024

h-vetinari commented Aug 9, 2024

mattwthompson commented Aug 9, 2024

h-vetinari commented Aug 9, 2024

h-vetinari commented Aug 14, 2024

jakirkham commented Aug 27, 2024 •

edited

Loading

mattwthompson commented Aug 27, 2024

h-vetinari commented Aug 27, 2024

h-vetinari commented Aug 27, 2024

dacase commented Aug 28, 2024

jakirkham commented Aug 28, 2024

jakirkham commented Aug 28, 2024

h-vetinari commented Aug 28, 2024

mattwthompson commented Aug 28, 2024 •

edited

Loading

jakirkham commented Aug 28, 2024

h-vetinari commented Aug 29, 2024

mattwthompson commented Aug 29, 2024

h-vetinari commented Aug 30, 2024

jakirkham commented Aug 30, 2024 •

edited

Loading

h-vetinari left a comment •

edited

Loading

h-vetinari commented Oct 10, 2024

mikemhenry commented Oct 11, 2024

Rebuild for CUDA 12 #148

Are you sure you want to change the base?

Rebuild for CUDA 12 #148

Conversation

regro-cf-autotick-bot commented Jun 14, 2024

conda-forge-webservices bot commented Jun 14, 2024

h-vetinari commented Aug 9, 2024

conda-forge-webservices bot commented Aug 9, 2024 • edited by conda-forge-admin Loading

h-vetinari commented Aug 9, 2024

h-vetinari commented Aug 9, 2024

mattwthompson commented Aug 9, 2024

h-vetinari commented Aug 9, 2024

h-vetinari commented Aug 14, 2024

jakirkham commented Aug 27, 2024 • edited Loading

mattwthompson commented Aug 27, 2024

h-vetinari commented Aug 27, 2024

h-vetinari commented Aug 27, 2024

dacase commented Aug 28, 2024

jakirkham commented Aug 28, 2024

jakirkham commented Aug 28, 2024

h-vetinari commented Aug 28, 2024

mattwthompson commented Aug 28, 2024 • edited Loading

jakirkham commented Aug 28, 2024

h-vetinari commented Aug 29, 2024

mattwthompson commented Aug 29, 2024

h-vetinari commented Aug 30, 2024

jakirkham commented Aug 30, 2024 • edited Loading

h-vetinari left a comment • edited Loading

Choose a reason for hiding this comment

h-vetinari commented Oct 10, 2024

mikemhenry commented Oct 11, 2024

conda-forge-webservices bot commented Aug 9, 2024 •

edited by conda-forge-admin

Loading

jakirkham commented Aug 27, 2024 •

edited

Loading

mattwthompson commented Aug 28, 2024 •

edited

Loading

jakirkham commented Aug 30, 2024 •

edited

Loading

h-vetinari left a comment •

edited

Loading