-
Notifications
You must be signed in to change notification settings - Fork 867
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Building cuda aware openMPI does not seem to work #12334
Comments
adding this is not in the doc https://www.open-mpi.org/faq/?category=buildcuda but is supposedly necessary from #12264 |
Check out https://docs.open-mpi.org/en/v5.0.x/tuning-apps/networking/cuda.html#how-do-i-build-open-mpi-with-cuda-aware-support. These docs aren't the greatest, but I think they should solve your issue. FWIW, @janjust is looking at improving the situation so that |
in this linked article there seems to be some conflicts
and
show different methods, and for me the first is what I see when I build, but not the second, so I will just try and test the binaries to see if it works or not |
@janjust @hppritcha FYI -- might want to make these docs better / more clear. |
Related #12137 |
testing out the attempted cuda aware libmpi I get a segfault
This same code does indeed work for the OpenMPI ROCM version build script here https://github.com/PHARCHIVE/phare-mpi/blob/ompi/phare-mpi-cuda/build.sh
|
ok I got it working it appears that the must be and not |
Thanks. Passing the flag The macro
The impl in C++:
But when I run the command to check: |
maybe have a look at my script https://github.com/PHARCHIVE/phare-mpi/blob/master/phare-mpi-cuda/build.sh |
@shoveller86 what if you can you also compress and share your |
with version 5.0.2 from https://download.open-mpi.org/release/open-mpi/v5.0/openmpi-5.0.2.tar.bz2
UCX v1.15.0 from https://github.com/openucx/ucx
Using the guide at https://www.open-mpi.org/faq/?category=buildcuda
but without gdrcopy
always results in the support for cuda showing false, yet the extension modules lists cuda
The text was updated successfully, but these errors were encountered: