instance create saga: create new DAG for each test iteration #3309

gjcolombo · 2023-06-05T22:14:33Z

Modify the instance create saga's unwind tests so that they create a new saga DAG for every iteration of the test (i.e., for each distinct node in the DAG into which a failure can be injected). This ensures that every test iteration uses a saga that attempts to create an instance with a unique instance ID, which is needed to ensure that each iteration can actually reach the node into which it intends to inject an error.

Add a step to the end of the affected tests that verifies that a previously- created-but-unused DAG can still run to completion after all the error injection tests have been run.

Modify the instance create saga's unwind tests so that they create a new saga DAG for every iteration of the test (i.e., for each distinct node in the DAG into which a failure can be injected). This ensures that every test iteration uses a DAG with a unique instance ID, which is needed to ensure that each iteration can actually reach the node into which it intends to inject an error. Add a step to the end of the affected tests that verifies that a previously- created-but-unused DAG can still run to completion after all the error injection tests have been run.

davepacheco · 2023-06-06T21:47:27Z

nexus/src/app/sagas/instance_create.rs

+
+        // Run the saga to completion without injecting any errors to help
+        // ensure that an earlier injected failure didn't prevent the saga from
+        // failing deterministically.


What kind of failure would this be?

Yeahhh, this comment is word salad. I fixed it in 1c8340f, but I think we can do even better here.

At the risk of being overly pedantic: the problem this test had is that the instance create saga selects its instance ID in SagaInstanceCreate::make_saga_dag and not as part of the saga itself. As previously written, this meant that the test used the same instance ID for all saga executions. That works for a few iterations, but eventually, the test injects a failure after the "create instance record" node. When that node gets unwound for the first time, it leaves behind an instance record bearing the Destroyed state and the reused instance ID. This causes all subsequent saga executions to fail at the "create instance record" node with an "instance already exists" error, which is enough to pass the test (the saga failed!) but isn't what the test wants to do (it wanted the saga to fail at some later node).

The idea here is to run the saga to completion using the "original" DAG just to make sure that the test body didn't reuse the DAG in this way: if it did, then the saga will fail even if no errors were injected.

This was a good quick-and-dirty way to make sure the fix was working as intended, but in retrospect, I think it's kind of squicky, because it assumes so much about the structure of the foregoing test. It would be much better if the test tried to verify directly that the saga failed at the node it expected. I'm trying this out now and think it can be done without too much surgery; will report back when I've run the revised test.

It would be much better if the test tried to verify directly that the saga failed at the node it expected. I'm trying this out now and think it can be done without too much surgery; will report back when I've run the revised test.

Took a crack at this in 7fac7a1.

davepacheco

I like that the test is more precise now. For whatever reason it seems clearer to me to put the new interface on Nexus next to run_saga (e.g., Nexus::run_saga_raw_result() rather than RunnableSaga::run_yielding_raw_result()), but it's definitely good as-is!

gjcolombo requested a review from davepacheco June 6, 2023 15:47

davepacheco approved these changes Jun 6, 2023

View reviewed changes

fix nonsense comment

1c8340f

gjcolombo linked an issue Jun 6, 2023 that may be closed by this pull request

sagas::instance_create::test::test_action_failure_can_unwind doesn't test failure after all saga nodes #3265

Closed

directly verify that the correct nodes failed

7fac7a1

davepacheco approved these changes Jun 7, 2023

View reviewed changes

tidy up interface to run a saga and get the raw result

f27199c

gjcolombo enabled auto-merge (squash) June 7, 2023 17:50

gjcolombo merged commit c726e2f into main Jun 7, 2023

gjcolombo deleted the gjcolombo/instance-create-saga-test-fix branch June 7, 2023 18:26

gjcolombo mentioned this pull request Aug 17, 2023

Instance start saga's undo steps refer to nonexistent "instance_id" node #3894

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

instance create saga: create new DAG for each test iteration #3309

instance create saga: create new DAG for each test iteration #3309

gjcolombo commented Jun 5, 2023

davepacheco Jun 6, 2023

gjcolombo Jun 6, 2023

gjcolombo Jun 6, 2023

davepacheco left a comment

instance create saga: create new DAG for each test iteration #3309

instance create saga: create new DAG for each test iteration #3309

Conversation

gjcolombo commented Jun 5, 2023

davepacheco Jun 6, 2023

Choose a reason for hiding this comment

gjcolombo Jun 6, 2023

Choose a reason for hiding this comment

gjcolombo Jun 6, 2023

Choose a reason for hiding this comment

davepacheco left a comment

Choose a reason for hiding this comment