-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EC(wISC)30to60 performance tests are failing on Perlmutter and Chicoma #497
Comments
Note: On perlmutter use the head of compass. On chicoma, use the xylar/add_chicoma-cpu branch |
Let's see if E3SM-Project/mache#100 happens to fix this as a first change. We should be able to test this by just adding:
manually to the load script. |
At this point, I'm not seeing the PIO error but the EC test is jub hanging on Chicoma. |
@mark-petersen, as I test #555, this and the probably related issue #500 are really giving me trouble. I could use some help debugging them. In every case that I'm seeing these issues, it's with Gnu compilers (not sure if that's a coincidence or not). It shows up in PIO in some cases and just as hanging in others. |
This issue makes the |
The latest example of this on Perlmutter can be found at:
|
I think this issue is the same as #500. I just fixed the hang with E3SM-Project/E3SM#5575. We can retest the |
As I commented here E3SM-Project/E3SM#5575 (comment), unfortunately, I don't think that branch has fixed this problem, although it does seem to have fixed #500. |
The |
After the recent module changes on Perlmutter and Chicoma, I'm seeing PIO errors but only for the EC performance tests:
This is on all cores except 0000.
See:
I tried changing the PIO layout but that didn't make a difference. More debugging is needed.
The text was updated successfully, but these errors were encountered: