Prevent Duplicate ILM Cluster State Updates from Being Created #78390

original-brownbear · 2021-09-28T13:43:13Z

Prevent duplicate ILM tasks from being enqueued to fix the most immediate issues around #78246. The ILM logic should be further improved though. I did not include MoveToErrorStepUpdateTask in this change yet as I wasn't entirely sure how valid/safe hashing/comparing arbitrary Exceptions would be. That could be looked into in a follow-up as well.

Relates #77466

Closes #78246

WIP, deduplicating these tasks.

elasticmachine · 2021-09-28T14:37:09Z

Pinging @elastic/es-data-management (Team:Data Management)

probakowski

Thanks @original-brownbear, this change looks good. I left just one question and small suggestion.
I tested this locally and with creating 1000 indices at the same time and tasks count (_cat/pending_tasks) never went above 5000 comparing to 1000000 without this change.

probakowski · 2021-09-28T20:41:19Z

x-pack/plugin/ilm/src/main/java/org/elasticsearch/xpack/ilm/IndexLifecycleRunner.java

+    }
+
+    private boolean registerTask(IndexLifecycleClusterStateUpdateTask task) {
+        synchronized (executingTasks) {


any reason for explicit synchronized block and not using Collections.synchronizedSet(new HashSet<>()) or ConcurrentHashMap?

Right ... not sure why I did it this used the sync set now and just inlined everything since it's all one liners now :)

Thanks!

probakowski · 2021-09-28T20:42:17Z

x-pack/plugin/ilm/src/main/java/org/elasticsearch/xpack/ilm/IndexLifecycleRunner.java

+        final boolean removed;
+        synchronized (executingTasks) {
+            removed = executingTasks.remove(task);
+        }
+        assert removed : "tried to unregister unknown task [" + task + "]";


Suggested change

final boolean removed;

synchronized (executingTasks) {

removed = executingTasks.remove(task);

}

assert removed : "tried to unregister unknown task [" + task + "]";

synchronized (executingTasks) {

final boolean removed = executingTasks.remove(task);

assert removed : "tried to unregister unknown task [" + task + "]";

}

probakowski

LGTM, thanks @original-brownbear!

original-brownbear · 2021-09-29T05:49:14Z

Thanks @probakowski !

…ic#78390) Prevent duplicate ILM tasks from being enqueued to fix the most immediate issues around elastic#78246. The ILM logic should be further improved though. I did not include `MoveToErrorStepUpdateTask` in this change yet as I wasn't entirely sure how valid/safe hashing/comparing arbitrary `Exception`s would be. That could be looked into in a follow-up as well. Relates elastic#77466 Closes elastic#78246

… (#78427) Prevent duplicate ILM tasks from being enqueued to fix the most immediate issues around #78246. The ILM logic should be further improved though. I did not include `MoveToErrorStepUpdateTask` in this change yet as I wasn't entirely sure how valid/safe hashing/comparing arbitrary `Exception`s would be. That could be looked into in a follow-up as well. Relates #77466 Closes #78246

Follow up to elastic#78390. The `EmptyInfo` would not compare correctly because it doesn't implement equals or hashcode, breaking deduplication for `SetStepInfoUpdateTask`. => just making it a singleton to fix this and have a fast comp via instance equality.

Follow up to #78390. The `EmptyInfo` would not compare correctly because it doesn't implement equals or hashcode, breaking deduplication for `SetStepInfoUpdateTask`. => just making it a singleton to fix this and have a fast comp via instance equality.

Follow up to elastic#78390. The `EmptyInfo` would not compare correctly because it doesn't implement equals or hashcode, breaking deduplication for `SetStepInfoUpdateTask`. => just making it a singleton to fix this and have a fast comp via instance equality.

Follow up to #78390. The `EmptyInfo` would not compare correctly because it doesn't implement equals or hashcode, breaking deduplication for `SetStepInfoUpdateTask`. => just making it a singleton to fix this and have a fast comp via instance equality.

If the current combination of current-step and index has a running CS update task enqueued there is no point in adding yet another task for this combination on the applier and we can skip the expensive inspection for the index. follow up to elastic#78390

If the current combination of current-step and index has a running CS update task enqueued there is no point in adding yet another task for this combination on the applier and we can skip the expensive inspection for the index. follow up to #78390

If the current combination of current-step and index has a running CS update task enqueued there is no point in adding yet another task for this combination on the applier and we can skip the expensive inspection for the index. follow up to elastic#78390

If the current combination of current-step and index has a running CS update task enqueued there is no point in adding yet another task for this combination on the applier and we can skip the expensive inspection for the index. follow up to #78390

Prevent Duplicate ILM Cluster State Updates from Being Created

5985b35

WIP, deduplicating these tasks.

original-brownbear added WIP :Data Management/ILM+SLM Index and Snapshot lifecycle management labels Sep 28, 2021

elasticsearchmachine added the v8.0.0 label Sep 28, 2021

nice

c8d0f89

original-brownbear added v7.16.0 >bug and removed WIP labels Sep 28, 2021

original-brownbear requested review from martijnvg, probakowski and joegallo September 28, 2021 14:36

original-brownbear marked this pull request as ready for review September 28, 2021 14:37

elasticmachine added the Team:Data Management Meta label for data/management team label Sep 28, 2021

original-brownbear mentioned this pull request Sep 28, 2021

[WIP] ILM Refactoring to Fix Task Explosion #78300

Closed

probakowski reviewed Sep 28, 2021

View reviewed changes

original-brownbear added 2 commits September 28, 2021 23:00

Merge remote-tracking branch 'elastic/master' into dedup-ilm

e9736be

CR: comments + simpler

bdde571

original-brownbear requested a review from probakowski September 28, 2021 21:05

probakowski approved these changes Sep 28, 2021

View reviewed changes

original-brownbear merged commit 990aa34 into elastic:master Sep 29, 2021

original-brownbear deleted the dedup-ilm branch September 29, 2021 05:49

original-brownbear mentioned this pull request Sep 29, 2021

Prevent Duplicate ILM Cluster State Updates from Being Created (#78390) #78427

Merged

original-brownbear mentioned this pull request Sep 29, 2021

Fix Empty Step Info not Comparing Correctly #78442

Merged

original-brownbear mentioned this pull request Sep 29, 2021

Fix Empty Step Info not Comparing Correctly (#78442) #78447

Merged

original-brownbear mentioned this pull request Sep 29, 2021

Skip Inspecting Busy Indices on ILM CS Application #78471

Merged

probakowski mentioned this pull request Sep 29, 2021

Use batching for ILM cluster state updates #78488

Closed

original-brownbear mentioned this pull request Sep 30, 2021

Skip Inspecting Busy Indices on ILM CS Application (#78471) #78496

Merged

jakelandis added v8.0.0-beta1 and removed v8.0.0 labels Oct 27, 2021

gmarouli mentioned this pull request Aug 4, 2022

[CI] ShrinkActionIT testAutomaticRetryFailedShrinkAction failing #78460

Closed

original-brownbear restored the dedup-ilm branch April 18, 2023 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent Duplicate ILM Cluster State Updates from Being Created #78390

Prevent Duplicate ILM Cluster State Updates from Being Created #78390

original-brownbear commented Sep 28, 2021 •

edited

Loading

elasticmachine commented Sep 28, 2021

probakowski left a comment

probakowski Sep 28, 2021

original-brownbear Sep 28, 2021

probakowski Sep 28, 2021

probakowski left a comment

original-brownbear commented Sep 29, 2021

Prevent Duplicate ILM Cluster State Updates from Being Created #78390

Prevent Duplicate ILM Cluster State Updates from Being Created #78390

Conversation

original-brownbear commented Sep 28, 2021 • edited Loading

elasticmachine commented Sep 28, 2021

probakowski left a comment

Choose a reason for hiding this comment

probakowski Sep 28, 2021

Choose a reason for hiding this comment

original-brownbear Sep 28, 2021

Choose a reason for hiding this comment

probakowski Sep 28, 2021

Choose a reason for hiding this comment

probakowski left a comment

Choose a reason for hiding this comment

original-brownbear commented Sep 29, 2021

original-brownbear commented Sep 28, 2021 •

edited

Loading