Do not remove flood block from indices on nodes undergoing replacement #78942

dakrone · 2021-10-11T20:03:57Z

This commit enhances DiskThresholdMonitor so that indices that have a flood-stage block will not
have the block removed while they reside on a node that is part of a "REPLACE"-type node shutdown.

This prevents a situation where a node is blocked due to disk usage, then during the replacement the
block is removed while shards are relocating to the target node, indexing occurs, and then the
target runs out of space due to the additional documents.

Relates to #70338 and #76247

This commit enhances `DiskThresholdMonitor` so that indices that have a flood-stage block will not have the block removed while they reside on a node that is part of a "REPLACE"-type node shutdown. This prevents a situation where a node is blocked due to disk usage, then during the replacement the block is removed while shards are relocating to the target node, indexing occurs, and then the target runs out of space due to the additional documents. Relates to elastic#70338 and elastic#76247

elasticmachine · 2021-10-11T20:04:00Z

Pinging @elastic/es-core-infra (Team:Core/Infra)

henningandersen

LGTM.

henningandersen · 2021-10-12T10:17:19Z

server/src/main/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitor.java

+            .collect(Collectors.toSet());
+
+        // Generate a set of all the indices that exist on either the target or source of a node replacement
+        final Set<String> indicesOnReplaceSourceOrTarget = nodesIdsPartOfReplacement.stream()


I wonder if we should special handle the case where the source is empty (no assigned shards) and allow releasing the block then? That should be safe and could shorten the time until releasing the block in case there are multiple orchestration steps involved.

I think we should leave it as-is for now, since we don't expect to be in the COMPLETE phase for a replacement for more than ~10ish seconds anyway, and if we determine that it is too long in the future, we can revisit this in subsequent work.

...er/src/test/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitorTests.java

…e-replace

elasticsearchmachine · 2021-10-12T18:07:55Z

💔 Backport failed

Status	Branch	Result
❌	7.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 78942

elastic#78942) This commit enhances `DiskThresholdMonitor` so that indices that have a flood-stage block will not have the block removed while they reside on a node that is part of a "REPLACE"-type node shutdown. This prevents a situation where a node is blocked due to disk usage, then during the replacement the block is removed while shards are relocating to the target node, indexing occurs, and then the target runs out of space due to the additional documents. Relates to elastic#70338 and elastic#76247 # Conflicts: # server/src/test/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitorTests.java

#78942) (#79008) This commit enhances `DiskThresholdMonitor` so that indices that have a flood-stage block will not have the block removed while they reside on a node that is part of a "REPLACE"-type node shutdown. This prevents a situation where a node is blocked due to disk usage, then during the replacement the block is removed while shards are relocating to the target node, indexing occurs, and then the target runs out of space due to the additional documents. Relates to #70338 and #76247 # Conflicts: # server/src/test/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitorTests.java

dakrone added v8.0.0 :Core/Infra/Node Lifecycle Node startup, bootstrapping, and shutdown v7.16.0 labels Oct 11, 2021

dakrone requested a review from henningandersen October 11, 2021 20:03

elasticmachine added the Team:Core/Infra Meta label for core/infra team label Oct 11, 2021

henningandersen approved these changes Oct 12, 2021

View reviewed changes

dakrone added 3 commits October 12, 2021 10:40

Don't override node name in all tests, use differing id and name

43163e2

Use AtomicReference for cluster state in test and deduplicate code

2380bac

Merge remote-tracking branch 'origin/master' into dont-unblock-on-nod…

d81d5a4

…e-replace

dakrone added the auto-backport-and-merge label Oct 12, 2021

Fix test compilation

9540f42

dakrone merged commit 31e7cf9 into elastic:master Oct 12, 2021

dakrone deleted the dont-unblock-on-node-replace branch October 12, 2021 18:07

dakrone mentioned this pull request Oct 12, 2021

[7.x] Do not remove flood block from indices on nodes undergoing replacement (#78942) #79008

Merged

jakelandis added v8.0.0-beta1 and removed v8.0.0 labels Oct 27, 2021

danhermann added the >non-issue label Dec 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not remove flood block from indices on nodes undergoing replacement #78942

Do not remove flood block from indices on nodes undergoing replacement #78942

dakrone commented Oct 11, 2021

elasticmachine commented Oct 11, 2021

henningandersen left a comment

henningandersen Oct 12, 2021

dakrone Oct 12, 2021

elasticsearchmachine commented Oct 12, 2021

Do not remove flood block from indices on nodes undergoing replacement #78942

Do not remove flood block from indices on nodes undergoing replacement #78942

Conversation

dakrone commented Oct 11, 2021

elasticmachine commented Oct 11, 2021

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Oct 12, 2021

Choose a reason for hiding this comment

dakrone Oct 12, 2021

Choose a reason for hiding this comment

elasticsearchmachine commented Oct 12, 2021

💔 Backport failed