-
Notifications
You must be signed in to change notification settings - Fork 139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The stress calculation is OOM when set kpar > 1 #4432
Comments
As we can see from the results, even using kpar can speed up the calculation of cg with less memory, the time cost is still much longer than that by dav. The speed up is about 10% for kpar is from 1 to 2 in cg. |
I also do the test on QE for example mp-1067451. CG:
DAV:
|
I set the pw_daig_ndim to 2 in ABAUCS and do the kpar test on mp-1067451 with a larger machine c64_m520_cpu (mpi parallel with 32 cores). It doesn't seem like the larger the kpar, the higher the efficiency. As the kpar larger, the memory cost is larger. For this case, kpar=1/2 can finish the SCF/FORCE/STRESS calculation, but kpar=4/6 can only finish SCF/FORCE calculation, and the memory for STRESS calculation is larger than 520G. The memory need by STRESS seems about 1.5 times to SCF/FORCE calculation.
|
The performance may not be tested by one example, and the performance of c64_m520_cpu is unstable. I have rerun the mp-1067451-new/00002, and this time the time cost of first two SCF steps are 3427.07 and 764.52 s, which is faster than previous test. |
@pxlxingliang plz double-check it after PR #4047. |
Describe the Testing Issue
I have tested two alloy cases with different kpar (1 2 4 8) on 16 cores 256 G cpu machine.
Both cg and dav methods are tested.
For dav, because of the large memory of sub-space matrix, the calculation of SCF is OOM when kpar > 1.
For cg, when kpar > 4, the SCF calculation is OOM, and when kpar is 1/2/4, the SCF is calculated normal but the calculation of stress is OOM.
The OOM in stress calculation when kpar > 2 seems abnormal.
Additional Context
No response
Task list for Issue attackers (only for developers)
The text was updated successfully, but these errors were encountered: