Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: several tests failed in code converge test #5192

Closed
16 tasks
kirk0830 opened this issue Sep 29, 2024 · 4 comments · Fixed by #5219 or #5469
Closed
16 tasks

Bug: several tests failed in code converge test #5192

kirk0830 opened this issue Sep 29, 2024 · 4 comments · Fixed by #5219 or #5469
Assignees
Labels
Compile & CICD & Docs & Dependencies Issues related to compiling ABACUS

Comments

@kirk0830
Copy link
Collaborator

kirk0830 commented Sep 29, 2024

Describe the bug

please see action https://github.com/deepmodeling/abacus-develop/actions/runs/11089849200/job/30811481597, there are several tests failed:

Warning: NG]    2 test cases out of 1499 failed.
1: 102_PW_PINT_UKS
1: 392_NO_GO_LR_HF
1: 
1: [ERROR     ] 21 test cases out of 1499 produced fatal error.
1: 101_PW_15_paw
1: 207_NO_KP_OXC
1: 260_NO_DJ_PK_PU_FM
1: 260_NO_DJ_PK_PU_SO
1: 260_NO_DJ_PK_PU_S1
1: 260_NO_DJ_PK_PU_AFM_URAMPING
1: 281_NO_KP_HSE
1: 281_NO_KP_HSE_symmetry
1: 282_NO_KP_HSE_complex
1: 283_NO_KP_HF
1: 284_NO_KP_PBE0
1: 285_NO_KP_RE_HSE
1: 286_NO_KP_CR_HSE
1: 304_NO_GO_AF_atommag
1: 381_NO_GO_S1_HSE
1: 382_NO_GO_S2_HSE
1: 383_NO_GO_SO_HSE
1: 384_NO_GO_S1_HSE_loop0_PU
1: 385_NO_GO_RE_S1_HSE
1: 386_NO_GO_MD_S1_HSE
1: 392_NO_GO_LR_HF

The following tests FAILED:
	  1 - integrated_test (Failed)
	114 - XCTest_PBE (Failed)
	115 - XCTest_HSE (Failed)
	116 - XCTest_PZ_SPN (Failed)
	157 - HSolver_bpcg (Failed)

This may due to

1: -----AUTO TESTS OF ABACUS ------
1: ABACUS path: /__w/abacus-develop/abacus-develop/build/abacus
1: Number of processes: 4
1: Number of threads: 1
1: Test accuracy totenergy: 0.0000001 eV
1: Test accuracy force: 0.0001
1: Test accuracy stress: 0.001
1: Check accuaracy: 8
1: Test cases file: CASES_CPU.txt
1: Test cases regex: ^[^#].*_.*$
1: Generate reference: 
1: --------------------------------

that different run mode compared with tests designed to be triggered by PR or commits.

Expected behavior

No response

To Reproduce

No response

Environment

No response

Additional Context

No response

Task list for Issue attackers (only for developers)

  • Verify the issue is not a duplicate.
  • Describe the bug.
  • Steps to reproduce.
  • Expected behavior.
  • Error message.
  • Environment details.
  • Additional context.
  • Assign a priority level (low, medium, high, urgent).
  • Assign the issue to a team member.
  • Label the issue with relevant tags.
  • Identify possible related issues.
  • Create a unit test or automated test to reproduce the bug (if applicable).
  • Fix the bug.
  • Test the fix.
  • Update documentation (if necessary).
  • Close the issue and inform the reporter (if applicable).
@WHUweiqingzhou
Copy link
Collaborator

WHUweiqingzhou commented Oct 24, 2024

Up to https://github.com/deepmodeling/abacus-develop/actions/runs/11436455697/job/31813916721, the converge tests still failed.

Warning: NG]    1 test cases out of 1547 failed.
1: 102_PW_PINT_UKS
1: 
1: [ERROR     ] 5 test cases out of 1547 produced fatal error.
1: 260_NO_DJ_PK_PU_FM
1: 260_NO_DJ_PK_PU_SO
1: 260_NO_DJ_PK_PU_S1
1: 260_NO_DJ_PK_PU_AFM_URAMPING
1: 304_NO_GO_AF_atommag

The following tests FAILED:
1 - integrated_test (Failed)
165 - HSolver_bpcg (Failed)

@kirk0830
Copy link
Collaborator Author

kirk0830 commented Oct 24, 2024

@WHUweiqingzhou the 102_PW_PINT_UKS failed is because of the nthreads != 2, after the introduction of omp parallization in psi initializer. So it is an expected behavior

@WHUweiqingzhou
Copy link
Collaborator

I re-run the converge test with a new branch converge, but it also fails. I do not get useful information from log, could you have a look @caic99 ?
https://github.com/deepmodeling/abacus-develop/actions/runs/11771801075/job/32786252540

@caic99
Copy link
Member

caic99 commented Nov 12, 2024

@WHUweiqingzhou The test hits the timeout limit of 6 hrs.
You can modify here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Compile & CICD & Docs & Dependencies Issues related to compiling ABACUS
Projects
None yet
3 participants