You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This failed for both a smaller Graph500 graph and even Karate Club on a MG system with 80+GB per GPU. A minimal repro script is attached to be run on a >1 GPU machine. The output is shown below for Karate Club:
The above was also attempted using the MG APIs in a single GPU environment (using CUDA_VISIBLE_DEVICES=0) but that did not reproduce the issue, so it seems to require >1 GPU.
The text was updated successfully, but these errors were encountered:
A customer identified an issue trying to run Jaccard. In MG calls they were seeing failed memory allocation calls.
Vertices were being shuffled incorrectly in the C API, so we were getting vertices processed on the wrong GPU, resulting in out-of-bounds memory references.
Moved the shuffle before renumbering, which puts vertices to be on proper GPU
Closes#3746
Authors:
- Chuck Hastings (https://github.com/ChuckHastings)
Approvers:
- Seunghwa Kang (https://github.com/seunghwak)
URL: #3758
This failed for both a smaller Graph500 graph and even Karate Club on a MG system with 80+GB per GPU. A minimal repro script is attached to be run on a >1 GPU machine. The output is shown below for Karate Club:
repro py script
The above was also attempted using the MG APIs in a single GPU environment (using
CUDA_VISIBLE_DEVICES=0
) but that did not reproduce the issue, so it seems to require >1 GPU.The text was updated successfully, but these errors were encountered: