Large ILM Task Batches are Executed too Slowly #82708
Labels
>bug
:Data Management/ILM+SLM
Index and Snapshot lifecycle management
Team:Data Management
Meta label for data/management team
In many shards benchmarking we see a number of warnings about large ILM task batches getting executed too slowly and it in-fact breaks other cluster operations (namely index-auto-create) during benchmarking temporarily if and when a large ILM batch hits us with something like this:
The slowness in executing these batches is almost exclusively a result of needlessly rebuilding the full cluster state over and over in each task.
Each of these tasks tends to only changes the per-index metadata but otherwise leaves the cluster state as is. We should fix the batching here to not have each task output a full cluster state and instead just apply tasks to a single builder in a loop to only build one cluster state instead of potentially thousands.
relates #77466
The text was updated successfully, but these errors were encountered: