storage: Decommissioning can get stuck by dormant replicas never getting GC'ed #17288

a-robinson · 2017-07-28T18:09:35Z

While playing around with replica decommissioning, I was able to get the process stuck. It's stuck because even though all replicas have been officially replicated away from the node, it still has two dormant, non-GC'ed replicas on it, and thus it still shows up as not being empty:

To be honest, I'm not quite sure how the two replicas left on the node never received the raft commands that removed them from the range, but now that they're in this state they're stuck forever (or until restarting the node, presumably) because their dormant state keeps them from trying to send traffic to the other replicas and learning about the fact they were removed.

@tschottdorf @garvitjuniwal

tbg · 2017-07-28T18:22:00Z

Processing such a Replica requires a consistent RangeLookup, and we wanted to avoid hammering the metadata ranges, but with quiescence it looks like we're not going to try to GC the replica until after 10 days, which clearly isn't going to cut it. I think it's fine (at least for now) to wake up dormant replicas in shouldQueue.

a-robinson · 2017-07-28T18:25:00Z

Yeah, although with how fast the scanner runs on nodes that don't have many ranges, we definitely shouldn't do a consistent lookup or wake dormant replicas every time.

And to correct my initial post, even restarting the node doesn't wake them up, it turns out.

tbg · 2017-07-28T18:32:05Z

It's not that expensive to wake them up though, is it? The group will go dormant again after ~1 round.

One perhaps better solution could be to signal to replicas which are about to removed that that is happening. I'd have to page the details back in, but iirc the new configuration in a replica change goes into effect pretty early, and that's why a removed replica often doesn't learn about it until later. We could just commit a Raft command (could do a direct RPC to the node too, but that doesn't seem less onerous) that triggers "eager gc" for a while on the replica that is supposedly getting removed. Then the scanner would only do eager work for replicas with that flag (as long as the flag is reasonably fresh, say 5min).

a-robinson · 2017-07-28T18:50:17Z

Yeah, but they'll be getting woken up every 200ms * <number of replicas>, which effectively eliminates the point of dormancy on nodes with only tens or hundreds of replicas.

Do we need to worry about how fast GC happens in situations other than decommissioning? If not, we can just GC more eagerly when in a decommissioning state.

tbg · 2017-07-28T18:56:13Z

If not, we can just GC more eagerly when in a decommissioning state.

That's a good idea.

Do we need to worry about how fast GC happens in situations other than decommissioning?

Not really, though it's one of those things that's often annoying. You're debugging something, and there are these old replicas laying around -- I'd say it'd be nice to smoothen out this process, but it's shouldn't be crucial.

Fixes cockroachdb#17288

Tests the fix in cockroachdb#17304 for issue cockroachdb#17288

a-robinson added this to the 1.1 milestone Jul 28, 2017

a-robinson added a commit to a-robinson/cockroach that referenced this issue Jul 29, 2017

storage: Eagerly GC dormant replicas when decommissioning

fed3895

Fixes cockroachdb#17288

a-robinson mentioned this issue Jul 29, 2017

storage: Eagerly GC dormant replicas when decommissioning #17304

Merged

a-robinson self-assigned this Jul 29, 2017

a-robinson closed this as completed in #17304 Jul 30, 2017

a-robinson added a commit to tbg/cockroach that referenced this issue Aug 1, 2017

acceptance: Test decommissioning of down (but not dead) node

7702b65

Tests the fix in cockroachdb#17304 for issue cockroachdb#17288

tbg pushed a commit to tbg/cockroach that referenced this issue Aug 3, 2017

acceptance: Test decommissioning of down (but not dead) node

823bacd

Tests the fix in cockroachdb#17304 for issue cockroachdb#17288

tbg pushed a commit to tbg/cockroach that referenced this issue Aug 4, 2017

acceptance: Test decommissioning of down (but not dead) node

408a876

Tests the fix in cockroachdb#17304 for issue cockroachdb#17288

tbg pushed a commit to tbg/cockroach that referenced this issue Aug 4, 2017

acceptance: Test decommissioning of down (but not dead) node

9b797d4

Tests the fix in cockroachdb#17304 for issue cockroachdb#17288

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: Decommissioning can get stuck by dormant replicas never getting GC'ed #17288

storage: Decommissioning can get stuck by dormant replicas never getting GC'ed #17288

a-robinson commented Jul 28, 2017

tbg commented Jul 28, 2017

a-robinson commented Jul 28, 2017

tbg commented Jul 28, 2017

a-robinson commented Jul 28, 2017

tbg commented Jul 28, 2017

storage: Decommissioning can get stuck by dormant replicas never getting GC'ed #17288

storage: Decommissioning can get stuck by dormant replicas never getting GC'ed #17288

Comments

a-robinson commented Jul 28, 2017

tbg commented Jul 28, 2017

a-robinson commented Jul 28, 2017

tbg commented Jul 28, 2017

a-robinson commented Jul 28, 2017

tbg commented Jul 28, 2017