Handle rapid deletion and recreation of subgraphs more gracefully #4044

lutter · 2022-10-07T22:00:38Z

In development and similar settings, often a subgraph gets deleted, completely removed and then immediately redeployed. This usually fails because of some internal caches that graph-node keeps. This PR removes these failures on indexing nodes. If there are dedicated query nodes, they will still not pick up the deployment until their internal caches get updated (by default in at most 5 minutes) Installations that use a single node, or when the main concern is the index node, will no longer be affected by this cache issue.

lutter · 2022-10-13T15:49:16Z

This also fixes #4005

tilacog · 2022-10-13T22:26:18Z

store/postgres/src/subgraph_store.rs

+    async fn stop_subgraph(&self, loc: &DeploymentLocator) -> Result<(), StoreError> {
+        // Remove the writable from the cache and stop it
+        let deployment = loc.id.into();
+        let writable = self.writables.lock().unwrap().remove(&deployment);


If I got it right docs right, this would only panic due to a programming error, so it is relatively safe to unwrap here.
Is that correct?

It would panic if the lock is poisoned, i.e., if some other thread paniced while holding the lock. In that case, it's anybody's guess what's happening and panicing here is really the only thing we can do.

tilacog · 2022-10-13T22:35:22Z

store/postgres/src/writable.rs

+enum ExecResult {
+    Continue,
+    Stop,


Just a suggestion, so feel free to ignore this comment.

This enum somewhat similar to the ControlFlow from std.

While we don't need to return any data out of this, maybe we could redefine it as

type ExecResult = ControlFlow<()>

I know this won't change anything, but it would make room if we want to return data from those contexts in the future.

tilacog · 2022-10-13T23:08:17Z

store/postgres/src/subgraph_store.rs

@@ -499,6 +513,8 @@ impl SubgraphStoreInner {
        #[cfg(not(debug_assertions))]
        assert!(!replace);

+        self.evict(&schema.id)?;


Is the reason we call evict on deployment creation so we have more confidence that the cache will be cleared?

Yes, to make sure that we don't have a stale entry for the same hash in the cache (which could happen if a previous deployment of the same hash was stopped and then redeployed) This is really redundant with the eviction in stop_subgraph but since it's a cheap operation seemed safer to do it in both places.

tilacog

LGTM.
I left a few questions for me to understand a little more about how deployment handling goes.

…oyment Previously, the query would treat a non-existant deployment like one that had the block pointer set already.

Evict deployments from internal caches when * a new one is created, which covers the case where a deployment is deleted and then quickly recreated * when a subgraph is stopped Note that these evictions will only affect index nodes; query nodes will keep cache entries until the entry expires Fixes #4005

lutter mentioned this pull request Oct 7, 2022

graphman usage might be causing db issues #3976

Closed

leoyvens requested a review from tilacog October 12, 2022 10:08

tilacog reviewed Oct 13, 2022

View reviewed changes

tilacog approved these changes Oct 13, 2022

View reviewed changes

lutter added 4 commits October 14, 2022 10:53

graph, store: Add a way to shut down a subgraph's background writer

1c34167

core, graph: Stop the background writer when stopping subgraph

5ff71a3

store: Error when initialize_block_ptr is called for nonexistent depl…

82045fd

…oyment Previously, the query would treat a non-existant deployment like one that had the block pointer set already.

lutter force-pushed the lutter/stop branch from d38ae57 to 4c25b79 Compare October 14, 2022 17:53

lutter merged commit 4c25b79 into master Oct 14, 2022

lutter deleted the lutter/stop branch October 14, 2022 17:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle rapid deletion and recreation of subgraphs more gracefully #4044

Handle rapid deletion and recreation of subgraphs more gracefully #4044

lutter commented Oct 7, 2022

lutter commented Oct 13, 2022

tilacog Oct 13, 2022

lutter Oct 14, 2022

tilacog Oct 13, 2022

tilacog Oct 13, 2022

lutter Oct 14, 2022

tilacog left a comment

Handle rapid deletion and recreation of subgraphs more gracefully #4044

Handle rapid deletion and recreation of subgraphs more gracefully #4044

Conversation

lutter commented Oct 7, 2022

lutter commented Oct 13, 2022

tilacog Oct 13, 2022

Choose a reason for hiding this comment

lutter Oct 14, 2022

Choose a reason for hiding this comment

tilacog Oct 13, 2022

Choose a reason for hiding this comment

tilacog Oct 13, 2022

Choose a reason for hiding this comment

lutter Oct 14, 2022

Choose a reason for hiding this comment

tilacog left a comment

Choose a reason for hiding this comment