Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gpu-margo-p2p-bw assertion on thetagpu #59

Open
carns opened this issue Nov 27, 2022 · 2 comments
Open

gpu-margo-p2p-bw assertion on thetagpu #59

carns opened this issue Nov 27, 2022 · 2 comments

Comments

@carns
Copy link
Collaborator

carns commented Nov 27, 2022

I can compile and run the thetagpu regression test with the latest version of spack after the updates in #58 and mochi-hpc-experiments/platform-configurations#18, but the test fails at runtime with the following:

gpu-margo-p2p-bw: ../perf-regression/gpu-margo-p2p-bw.cu:275: int main(int, char**): Assertion `ret == 0' failed.

(that indicates a failure of margo_bulk_create_attr()). I assume this is the libfabric problem where it looks like the libfabric library itself needs to be explicitly configured with gpu memory support?

@vchoi-hdfgroup @jhendersonHDF

@vchoi-hdfgroup
Copy link
Contributor

vchoi-hdfgroup commented Nov 27, 2022 via email

@carns
Copy link
Collaborator Author

carns commented Nov 28, 2022

Great, thanks! I'll leave this issue open for now just to track this topic; we can close it once the fix is in place here. I put a comment on that other PR; it would be great if you could contribute the new variant directly to the mochi-spack-packages repo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants