-
-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Initial CUDA 11.3 Conda Packages #62
Comments
Just to add the ARM packages are for SBSA. So will not work on Jetson for example |
Nice, I'll have a look now. First thing I notice, the version strings are sometimes different across components:
I see 11.1, 11.3, 11.4, 10.4 (?), 10.2... It'd be nice to see the underlying recipe(s). I guess they are included in the metadata, but it would be more convenient to have a Gist or something like that, if possible. |
The dependency tree, if anybody is curious:
|
Thanks Jaime! Would be great to have your feedback 😄 Also cc-ing @h-vetinari who may also be interested in these and have thoughts 🙂 |
These are multiple separate recipes. So it makes sense that these differ. These would probably live as different feedstocks unless there is a reason to do something different |
How many feedstocks would that be? This includes several packages and a handful of metapackages as well, which might be difficult to maintain and keep in sync. I don't know if it would be better to have them all in the same recipe, with multiple outputs, or if that would get extra complicated (I need to take a look at the recipes to see how bad the situation would get in a single meta.yaml). The current split makes sense to me. I like that users can choose different types of granularity. It'd be nice to have the Also, is |
AIUI combining them into a single recipe would be complicated. Is that still needed with CUDA Enhanced Compatibility? My understanding is all of this would be available to conda-forge. I don't know about general redistributability. |
Having I guess we'll have to do with several feedstocks, and have |
I am trying out the packages to build OpenMM locally (maybe I'll put a PR to showcase how it's working once I get there). One thing I noticed is that |
Ok, so I have this working with OpenMM / 11.3 / Linux-64! Check conda-forge/openmm-feedstock#60. It's failing only because I am using A few notes:
So, in short, the feedback for now is super positive (at least for OpenMM), but:
I'd recommend that CUDA-heavier projects take this example and adapt it in their feedstocks in a WIP PR to confirm my findings. |
Trying out Windows now. Initial feedback below:
|
One more thought, looks like CMake needs both |
It might make sense to do away with building against every minor version at the same time we move to these packages and use CUDA Enhanced Compatibility instead. This will cutdown the amount of rebuilding we need to do while still supporting CUDA 11.0+ |
cc @seibert |
@jakirkham Is there anything else you want me test/try here? |
This comment has been minimized.
This comment has been minimized.
Ah, it's |
Hi @jakirkham, looks like the entire folder |
It seems CuPy can be compiled with the new package layout on linux64 and windows (conda-forge/cupy-feedstock#143). The steps I took is similar to @jaimergp's tests for OpenMM but with some differences. Here it is: For CBC:
For meta.yaml:
[1]: On Windows, nvcc for some reason picks up the host compiler correctly, but I think it's a bit suspicious considering the CI setup might do something when installing CUDA in the CI runner... [2]: I had a setup like this build:
- ...
- cuda-nvcc
host:
- cuda-nvcc
- cuda-cudart
- ... but apparently the nvcc from build's [3]: Currently there's no version pinning of each CUDA components to its parent CTK version. So, for example, when I request Note: I haven't checked if all CUDA components have the correct cc: @kmaehashi @jakirkham |
Confirmed |
So to summarize what are the changes we would like to see here? Could we come up with a bullet point list? |
Hey all, is there any updates or plans on upstreaming the new packages that are being produced in the |
@jaimergp would you have time to look at the 11.6 packages on the |
I'll try next week! |
@jaimergp have you had a chance to look? If not, no worries 🙂 |
I opened a PR on the openmm-feedstock as usual: conda-forge/openmm-feedstock#69 It's just the same we had for 11.5, but using 11.6. Let's see if we can clean it up! |
@jakirkham, I have just randomly bounced into this thread. I have been using the cuda packages from the nvidia channel for a little bit but recently (yesterday) the 11.6.0 subchannel (label) packages broke. I wrote this post on the nvidia developer forum: https://forums.developer.nvidia.com/t/conda-packages-from-the-nvidia-label-cuda-11-6-0-failing-to-install/204281 but I am not actually sure anybody is picking it up. Where is the right place to provide feedback on these packages? |
@hansenms Keeping that discussion in that developer forum seems like the right choice atm This thread is about integrating those packages into conda-forge as well as periodic testing to confirm issues raised previously have been addressed |
@jakirkham I will see if somebody has thoughts or a response on the developer forum. If you have any contacts you can direct there, it would be much appreciated. I would at least like to understand it a bit better. Thanks. |
I've already asked someone internally to take a look 🙂 |
@jakirkham, I hate to ping you on this again, but nobody has responded on that thread on the nvidia developer forum. Is there a better way to reach the team that make the conda packages in the nvidia channel? |
@jakirkham not sure if there is something that you could do, but I would really like to know where to post an issue/bug/problem with the official nvidia conda packages if it is not on the developer forum (where there is no response) and not here. Any good suggestions for where to raise the issue would be much appreciated. |
@jakirkham the problem mentioned above just happened overnight again for the cuda 11.6.1 packages. It is just massively disruptive that these packages keep getting broken overnight and nobody is responding to the issue raised on the nvidia developer forum. Where can one go to raise an issue about these packages? |
Looks like the old packages are removed whenever a new package is uploaded. |
@leofang yes, but that is massively disruptive. And nvidia's own instructions are broken, so for instance:
Will not work because they removed the samples package and other packages in that label. |
@jakirkham let's get this done 🚀 🌕 🙌 💎 boom boom, buzz buzz |
Is there anything non-nvidia people can do to help get this done? Are there any blockers or can we get an update on the status of this? Thanks all for the great work! |
Another gentle nudge to get this going. The JAX package requires ptxas and without it, our hard work and efforts to have cuda-enabled builds are incomplete. (Likewise for newer optimizations in tensorflow btw.) It would be really good to push this through. Please let us know how we can help. Or if there is anyway we can do to get ptxas added to the cudatoolkit package. @conda-forge/core, is there anything we can do to push this forward? |
As we are now doing this work and it is being tracked in issue ( conda-forge/staged-recipes#21382 ), closing this issue in favor of discussion there and reviews of PRs linked from that staged-recipes issue. Thanks everyone! 🙏 |
We have published some new Conda packages for public consumption. These contain the redistributable libraries, compilers, profiling tools, etc. Also they are currently in the
nvidia
channel ( https://anaconda.org/nvidia ). To get started one can just runconda install -c nvidia cuda=11.3
. This would include everything thatcudatoolkit
contains today.We would like to collect some feedback from the community here to inform how we package these going forward. Once we are more sure these fill the needs here, we can follow up on integrating them into conda-forge.
The text was updated successfully, but these errors were encountered: