write durability: always commit a write to both kube and spicedb, or neither #16

ecordell · 2023-08-28T15:01:27Z

Closes #3

This adds a durable saga that writes to spicedb and kube, with the goal of ensuring that a write happens in both, or neither, but not just one or the other.

There are two methods of writing implemented: a pessimistic lock that prevents other requests from attempting to create same object at the same time, and an optimistic lock that detects when there are conflicts and rolls back or forward as needed.

Pessimistic outline:

A create namespace foo call comes in from user:evan
Compute a workflow hash i.e. xxhash(create, namespace, foo)
Write to spicedb:
- TOUCH workflow:xxhash(create,namespace,foo)#id@workflow_id:caca56e8-388b-46ca-bf2a-7fe325defe68
- TOUCH namespace:foo#creator@user:evan
- with a precondition:
  - operation: OPERATION_MUST_NOT_MATCH
  - filter: `workflow:xxhash(create,namespace,foo)#id@workflow_id:*
If the SpiceDB write fails, fail the workflow (return error to user if not async)
In a loop:
- Attempt to write to kube
- If write succeeds, remove workflow lock tuple, return success to user
- If kube resp is IsAlreadyExists, remove workflow lock tuple, return success to user (lock tuple ensures no one else did this write - assuming all traffic is going through the proxy)
- If kube resp is any other error, return error to user, remove both tuples, return error to user
- If there is some other error where the kube response can't be retrieved, continue the loop

Optimistic outline:

A create namespace foo call comes in from user:evan
Write record to SpiceDB
- If SpiceDB write fails, fail the request. The client can retry / fix the error, and no data has been written to either.
- If the SpiceDB write succeeds, but the proxy sees the step as failed (i.e. because the process failed), the write is rolled back, and an error is returned to the user to try again.
Write record to Kube
- If Kube write fails:
  - Check to see if object already exists in Kube:
    - If so, work is done.
    - If object does not exist, revert the SpiceDB write.

There are pros and cons to each approach, for now both are supported and we can configure them per request type or per instance of the proxy.

The durability of this function means that inputs, outputs, and progress state are stored in a sqlite database. The goal is to be robust to service failures (SpiceDB and Kube API) and process failures (network dies, process crashes and restarts).

The tests make use of failpoints to inject faults at specific places, and then verify that either both writes effectively happened, or neither did.

This initial implementation just deals with namespace objects but should be fairly straightforward to make generic for other types. I'm assuming we'll spend time on that in #6.

this adds a durable saga that writes to spicedb and kube. if the kube write fails, the spicedb write is reverted. if the program crashes, the process picks up where it left off when it restarts by reading state from a sqlite db.

magefiles/test.go

pkg/proxy/durable.go

pkg/proxy/server.go

e2e/proxy_test.go

vroldanbet · 2023-08-29T19:20:24Z

e2e/proxy_test.go

+
+			// paul creates chani's namespace
+			Expect(failpoint.Disable("github.com/authzed/spicedb-kubeapi-proxy/pkg/proxy/panicKubeWrite")).To(Succeed())
+			_, err = paulClient.CoreV1().Namespaces().Create(ctx, &corev1.Namespace{


if the task has been made durable, wouldn't it be restarted where it left off before crashing? If so, I wouldn't expect this call to succeed, because after disabling panicKubeWrite, chani's Create call should have eventually succeeded after process restart.

What happens in this test is that task runner catches panics and returns records it as a failure of the WriteToKube activity. So the workflow continues, and because it errored, runs CheckKube and finds that the record doesn't exist, and rolls back the write.

I think you're right that that's what would happen if we crashed the whole process, and maybe we should work on a test harness to allow us to actually test that. Or maybe we can contribute something to durabletask to disable panic recovery for tests?

That would explain it, but it seems weird to have various ways to recover from an error:

one that handles scenario where the process does not crash

one that handles scenario where the process crashes

Plus there is a record of the task being failed, so why would the workflow ignore it? Or are we implicitly telling it that "we are ok" because we run CheckKube task?

pkg/proxy/durable.go

pkg/proxy/authz.go

pkg/proxy/server.go

magefiles/test.go

pkg/failpoints/failpoints_on.go

pkg/proxy/durable_activities.go

e2e/proxy_test.go

vroldanbet · 2023-09-04T12:39:46Z

e2e/proxy_test.go

+				err := CreateNamespace(ctx, chaniClient, chaniNamespace)
+				Expect(err).ToNot(BeNil())
+				// pessimistic locking reports a conflict, optimistic locking reports already exists
+				Expect(k8serrors.IsConflict(err) || k8serrors.IsAlreadyExists(err)).To(BeTrue())


why reporting different errors? this would be breaking the contract depending on which implementation is used - I think both implementations should yield the same result.

Different operations are happening in kube and spicedb in the two cases, I'm mostly just passing the errors back and not trying to obfuscate. I can look into normalizing them.

e2e/proxy_test.go

this also removes the failpoint library in favor of a quick local version that doesn't require transforming the codebase.

github-actions bot added area/dependencies Affects dependencies area/tooling Affects the dev or user toolchain area/core labels Aug 28, 2023

ecordell force-pushed the durable-writes branch from c4cd734 to b3be9e8 Compare August 28, 2023 21:06

write to both kube and spicedb

3be6842

this adds a durable saga that writes to spicedb and kube. if the kube write fails, the spicedb write is reverted. if the program crashes, the process picks up where it left off when it restarts by reading state from a sqlite db.

ecordell force-pushed the durable-writes branch from b3be9e8 to 3be6842 Compare August 29, 2023 02:55

vroldanbet reviewed Aug 29, 2023

View reviewed changes

magefiles/test.go Outdated Show resolved Hide resolved

pkg/proxy/durable.go Outdated Show resolved Hide resolved

vroldanbet reviewed Aug 29, 2023

View reviewed changes

pkg/proxy/server.go Outdated Show resolved Hide resolved

vroldanbet reviewed Aug 29, 2023

View reviewed changes

pkg/proxy/durable.go Outdated Show resolved Hide resolved

vroldanbet reviewed Aug 29, 2023

View reviewed changes

pkg/proxy/authz.go Show resolved Hide resolved

pkg/proxy/authz.go Outdated Show resolved Hide resolved

pkg/proxy/authz.go Show resolved Hide resolved

pkg/proxy/authz.go Show resolved Hide resolved

This was referenced Aug 30, 2023

highly available proxy #17

Closed

durable workflow data #18

Closed

ecordell changed the title ~~write to both kube and spicedb~~ write durability: always commit a write to both kube and spicedb, or neither Aug 31, 2023

ecordell force-pushed the durable-writes branch 4 times, most recently from e9c8ce0 to db94273 Compare September 1, 2023 21:52

vroldanbet reviewed Sep 4, 2023

View reviewed changes

vroldanbet mentioned this pull request Sep 4, 2023

Remote SpiceDB Support #24

Merged

implement a locking version of the dual write workflow

0f25b07

this also removes the failpoint library in favor of a quick local version that doesn't require transforming the codebase.

ecordell force-pushed the durable-writes branch from db94273 to 0f25b07 Compare September 5, 2023 20:07

vroldanbet approved these changes Sep 5, 2023

View reviewed changes

ecordell merged commit 323ad85 into main Sep 6, 2023

github-actions bot locked and limited conversation to collaborators Sep 6, 2023

vroldanbet deleted the durable-writes branch September 6, 2023 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

write durability: always commit a write to both kube and spicedb, or neither #16

write durability: always commit a write to both kube and spicedb, or neither #16

ecordell commented Aug 28, 2023 •

edited

Loading

vroldanbet Aug 29, 2023

ecordell Aug 30, 2023

vroldanbet Aug 30, 2023

vroldanbet Sep 4, 2023

ecordell Sep 5, 2023

write durability: always commit a write to both kube and spicedb, or neither #16

write durability: always commit a write to both kube and spicedb, or neither #16

Conversation

ecordell commented Aug 28, 2023 • edited Loading

vroldanbet Aug 29, 2023

Choose a reason for hiding this comment

ecordell Aug 30, 2023

Choose a reason for hiding this comment

vroldanbet Aug 30, 2023

Choose a reason for hiding this comment

vroldanbet Sep 4, 2023

Choose a reason for hiding this comment

ecordell Sep 5, 2023

Choose a reason for hiding this comment

ecordell commented Aug 28, 2023 •

edited

Loading