Need to be able to specify mapping of GPU to local process #473

maddyscientist · 2016-05-24T22:57:18Z

With the advent of NVLink, getting the correct node topology is now more complicated than just ensuring correct NUMA placement of processes for their respective GPUs. For dense GPU systems with multiple NVLinks we need a method to easily map the local process id to GPU device ordinal. Failing to do so will result in sub-optimal use of peer-to-peer bandwidth and message being unnecessarily routed through CPU memory.

mathiaswagner · 2016-05-25T04:06:20Z

NVML's setCPUAffinity does not work? Or is the mapping needed on the mpirun level ?

maddyscientist · 2017-01-09T19:54:33Z

Closing this issue, as this required functionality is trivially obtained from the CUDA_VISIBLE_DEVICES environment variable (the order that devices are listed in this variable corresponds to the order that the CUDA runtime sees the device).

maddyscientist added feature optimization labels May 24, 2016

mathiaswagner mentioned this issue Jun 2, 2016

further twisted mass clover convergence issues #474

Closed

maddyscientist closed this as completed Jan 9, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need to be able to specify mapping of GPU to local process #473

Need to be able to specify mapping of GPU to local process #473

maddyscientist commented May 24, 2016

mathiaswagner commented May 25, 2016

maddyscientist commented Jan 9, 2017 •

edited

Loading

Need to be able to specify mapping of GPU to local process #473

Need to be able to specify mapping of GPU to local process #473

Comments

maddyscientist commented May 24, 2016

mathiaswagner commented May 25, 2016

maddyscientist commented Jan 9, 2017 • edited Loading

maddyscientist commented Jan 9, 2017 •

edited

Loading