Speed up DiscoveryNodeFilters.trimTier #78170

original-brownbear · 2021-09-22T10:52:22Z

This shows up as expensive in profiling in at least the data tier allocation decider in 7.x.
Made it more efficient and also made copyMapWithRemovedEntry that it uses more
efficient and added tests for it.

This may be unused in `8` but it shows up as expensive in profiling in at least the data tier allocation decider in 7.x. Made it more efficient and also made `copyMapWithRemovedEntry` that it uses more efficient and added tests for it.

elasticmachine · 2021-09-22T10:52:24Z

Pinging @elastic/es-data-management (Team:Data Management)

dakrone

in at least the data tier allocation decider in 7.x.

I don't understand this, DataTierAllocationDecider uses neither DiscoveryNodeFilters nor calls the trimTier method? Do you mean the FilterAllocationDecider?

I am a little worried that this is premature optimization, other than JMH benchmarks, how big of an impact does this have in a real-world system?

dakrone · 2021-09-29T19:19:06Z

server/src/main/java/org/elasticsearch/cluster/node/DiscoveryNodeFilters.java

@@ -82,6 +85,8 @@ private boolean matchByIP(String[] values, @Nullable String hostIp, @Nullable St
        return false;
    }

+    private static final String TIER_PREFERENCE = "_tier_preference";


Can you move this up to the top where the other class variables are please?

dakrone · 2021-09-29T19:24:57Z

server/src/main/java/org/elasticsearch/common/util/Maps.java

+        if (map.containsKey(key) == false) {
+            return map;
+        }


This subtly changes the behavior because someone would assume that the returned value is always a copied map (since it's in the name), and not the original map

dakrone · 2021-09-29T19:26:12Z

server/src/main/java/org/elasticsearch/common/util/Maps.java

+        @SuppressWarnings("rawtypes")
+        final Map.Entry<K, V>[] entries = new Map.Entry[size - 1];
+        int i = 0;
+        for (Map.Entry<K, V> entry : map.entrySet()) {
+            if (key.equals(entry.getKey()) == false) {
+                entries[i++] = entry;
+            }
+        }
+        return Map.ofEntries(entries);


This feels a lot like premature optimization, is this really a bottleneck here?

This whole thing showed up hot during cluster restart + reroute tests on large shard count benchmarks. The problem with those really is that all these stream things look nice and their overhead might be irrelevant most of the time, now show up all over the place when they get nested inside other loops and we simply only have a single master update thread :)
I think the 7.x code is different for the data tier allocation decider that's why it shows up there, for 8.x this is less of a relevant change probably but as you point out will help with the filter decider.

original-brownbear · 2021-11-02T11:07:17Z

Closing this in favor of #80179 which is a simpler and much faster solution.

Speed up DiscoveryNodeFilters.trimTier

fefdffc

This may be unused in `8` but it shows up as expensive in profiling in at least the data tier allocation decider in 7.x. Made it more efficient and also made `copyMapWithRemovedEntry` that it uses more efficient and added tests for it.

original-brownbear added >non-issue :Data Management/Other v8.0.0 v7.16.0 labels Sep 22, 2021

elasticmachine added the Team:Data Management Meta label for data/management team label Sep 22, 2021

original-brownbear added 2 commits September 22, 2021 12:52

fix

8e061db

fix

daed949

original-brownbear requested a review from dakrone September 22, 2021 13:11

dakrone reviewed Sep 29, 2021

View reviewed changes

danhermann added v8.1.0 and removed v7.16.0 labels Oct 27, 2021

original-brownbear closed this Nov 2, 2021

original-brownbear removed >non-issue :Data Management/Other v8.0.0 Team:Data Management Meta label for data/management team v8.1.0 labels Nov 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up DiscoveryNodeFilters.trimTier #78170

Speed up DiscoveryNodeFilters.trimTier #78170

original-brownbear commented Sep 22, 2021 •

edited

Loading

elasticmachine commented Sep 22, 2021

dakrone left a comment

dakrone Sep 29, 2021

dakrone Sep 29, 2021

dakrone Sep 29, 2021

original-brownbear Sep 29, 2021

original-brownbear commented Nov 2, 2021

Speed up DiscoveryNodeFilters.trimTier #78170

Speed up DiscoveryNodeFilters.trimTier #78170

Conversation

original-brownbear commented Sep 22, 2021 • edited Loading

elasticmachine commented Sep 22, 2021

dakrone left a comment

Choose a reason for hiding this comment

dakrone Sep 29, 2021

Choose a reason for hiding this comment

dakrone Sep 29, 2021

Choose a reason for hiding this comment

dakrone Sep 29, 2021

Choose a reason for hiding this comment

original-brownbear Sep 29, 2021

Choose a reason for hiding this comment

original-brownbear commented Nov 2, 2021

original-brownbear commented Sep 22, 2021 •

edited

Loading