You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Given the option to enable the ASYNC allocator (#3447), UCX in this mode will currently fail when attempting to use nv_peer_mem. The reason is that memory allocated with cudaMallocAsync does not support GPUDirect RDMA, and the pool we go to allocate memory from would be using this allocator exclusively.
When rapidsai/cudf#9270 goes in, we should be able to reserve the bounce buffer amount from the RMM pool, and allocate directly, bypassing RMM. This would allow us to use nv_peer_mem and UCX as we do today with other RMM allocators.
The text was updated successfully, but these errors were encountered:
Depends on: rapidsai/cudf#9270
Given the option to enable the ASYNC allocator (#3447), UCX in this mode will currently fail when attempting to use
nv_peer_mem
. The reason is that memory allocated withcudaMallocAsync
does not support GPUDirect RDMA, and the pool we go to allocate memory from would be using this allocator exclusively.When rapidsai/cudf#9270 goes in, we should be able to reserve the bounce buffer amount from the RMM pool, and allocate directly, bypassing RMM. This would allow us to use
nv_peer_mem
and UCX as we do today with other RMM allocators.The text was updated successfully, but these errors were encountered: