Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM #98992

original-brownbear · 2023-08-29T14:44:21Z

Going over the many shards benchmark bootstrapping I noticed it slowed down quite a bit recently.

Turns out a big contributor to this is org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM called from
org.elasticsearch.xpack.ilm.IndexLifecycleService#triggerPolicies on every cluster state update and costing O(N) in the number of indices.

This could be made more efficient in various ways:

At least we should:

remove setting read in the hot loop
stop using Metadata.getIndicesLookup, this one is extremely expensive on the applier thread

a first quick fix would be to first check if any datastreams even use DLM and if the answer is no, the whole logic can be skipped. This currently introduces an about 5% overhead into every CS update (relative to stuff like create index and shard allocation in the many shards benchmark) at 25k indices in a cluster and the overhead grows in O(number_of_indices).

The text was updated successfully, but these errors were encountered:

elasticsearchmachine · 2023-08-29T14:44:44Z

Pinging @elastic/es-data-management (Team:Data Management)

nielsbauman · 2024-02-12T09:55:56Z

I had a look at the options that @original-brownbear suggested and tried to come up with some other options myself as well.

remove setting read in the hot loop

The PREFER_ILM_SETTING is configured in the settings of individual indices, so I don't think extracting it to outside the loop is possible.
stop using Metadata.getIndicesLookup

A way to avoid using getIndicesLookup would be to first loop over all the data streams that have a DSL and collect all the indices they cover. Then, when looping over all indices in the cluster, we can use the previously generated collection to determine whether an index is covered by a DSL (and then we'll still have to check the index' settings to know which lifecycle to use).
a first quick fix would be to first check if any datastreams even use DLM and if the answer is no, the whole logic can be skipped.

While definitely a good idea, we're shipping some data streams with DLS by default, I believe, so that would generally not have any effect unfortunately.
I see that the validation part of reading a setting involves a number of method calls and object creations. Would it be worthwhile to read the setting without validation? This would assume that the settings are validated when stored.

original-brownbear · 2024-02-12T10:10:28Z

Maybe this helps:

The many-shards project effectively concluded that ILM is somewhat fundamentally flawed in how it executes policies. Policies are optionally trigger on each cluster state update by inspecting each index individually. Thus the logic scales O(N) in the number of indices in the cluster which makes it by far the most expensive CS listener in larger clusters.
The new code that caused this makes the situation considerably worse by introducing another log(N) (for the tree-map lookup) cost factor for each index so now this thing scales O(N*log(N)). This cannot end well.
The real fix here should be to stop having that O(N) cost in ILM byu fixing #80407. If we want to fix this in isolation because #80407 is too hard right now, I'd strongly suggest trying to cache information on the IndexMetadata in some form, any other fix that involves allocations or more indirection is probably not going to get us much. The cost here is looking up from the index lookup + reading the setting. Both just need to go away, optimizing them is doomed to fail in my experience from the many-shards project.

andreidan · 2024-02-13T10:45:38Z

Thanks, Armin for reporting this and Niels for working on it.

++ on making this more efficient. TIL about the cost of getIndicesLookup when called from the cluster applier thread (it is taking advantage of memoization on the surface)

It's very surprising to me that reading the PREFER_ILM setting takes so much here. That code should only execute if the data stream has a lifecycle so it should only be read 3 times (we ship 3 data streams managed by DSL right now)

This condition guarding the setting read should seldom be true in stateful:

 if (parentDataStream != null && parentDataStream.getLifecycle() != null && parentDataStream.getLifecycle().isEnabled()) {

original-brownbear added >bug :Data Management/ILM+SLM Index and Snapshot lifecycle management labels Aug 29, 2023

elasticsearchmachine added the Team:Data Management Meta label for data/management team label Aug 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM #98992

Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM #98992

original-brownbear commented Aug 29, 2023 •

edited

Loading

elasticsearchmachine commented Aug 29, 2023

nielsbauman commented Feb 12, 2024

original-brownbear commented Feb 12, 2024

andreidan commented Feb 13, 2024

Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM #98992

Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM #98992

Comments

original-brownbear commented Aug 29, 2023 • edited Loading

elasticsearchmachine commented Aug 29, 2023

nielsbauman commented Feb 12, 2024

original-brownbear commented Feb 12, 2024

andreidan commented Feb 13, 2024

original-brownbear commented Aug 29, 2023 •

edited

Loading