-
Notifications
You must be signed in to change notification settings - Fork 213
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
To deal with system issues, automatically retry tests #1865
Comments
It seems to me that this would only be useful (in speeding the process) if you always used an |
A switch to create_test that would refer to an existing test set and only run new tests that had a failed status in the existing test suite seems like a worthy option. |
Personally, I would always do this. I was just recording the earlier conversation, where it sounded like some people wouldn't want it.
This would be a good incremental step towards what I want. Long-term, I still think it would be good to have an option to do this automatically. The problem is this: I often kick off CLM test suites overnight with the intention of making a CLM tag first thing in the morning. (The test suites currently take about 4-6 hours to turn around.) When there are system issues like this, it has these consequences:
That said, I'd be happy with your solution as an incremental step, since it would presumably be easier to implement, and would be a good stepping stone to this longer-term solution. |
Machine file change for Livermore Computing compatibility with new CIME [BFB]
Add retry capability to create_test Plus new regression test to exercise this capability. Test suite: scripts_regression_tests --fast Test baseline: Test namelist changes: Test status: bit for bit Fixes #1865 User interface changes?: Yes, new --retry option to create_test Update gh-pages html (Y/N)?: N Code review: @jedwards4b @billsacks
Machine file change for Livermore Computing compatibility with new CIME [BFB]
Machine file change for Livermore Computing compatibility with new CIME [BFB]
This is a low-priority enhancement request; I just want to record some thoughts on it for the future.
When running a test suite, we often run into problems where a small number of tests failed due to system issues. It slows down the testing/tagging process to have to go back and manually rerun these tests. It would be nice if cime's testing facilities had an option to automatically retry tests N times before giving up.
@ekluzek raised this at a CSEG meeting June 12, 2017. @jedwards4b felt this could cause too much confusion. But I was just thinking: A way around the confusion problem would be for the test system to create a totally new version of the test, with a new test id (say, by appending "try2" to the testid) (I realize that may not be trivial to implement, though). @mnlevy1981 suggests having a flag to --create-test (like --num-tries – which is my name, not his). This would default to 1, but you could set it to something like 3 if you want
The text was updated successfully, but these errors were encountered: