Fail put-mapping requests sooner if they will exceed the field number limit #35564

ppf2 · 2018-11-14T20:38:52Z

6.4.1

We had a scenario when an index was already hitting the field number limit (index.mapping.total_fields.limit) and subsequent (high volume) indexing requests attempted to add new fields to this index. As a result, a lot of put_mapping tasks got generated. This caused the cluster
state to be held on in memory and became non-GC-able until these mapping updates eventually got rejected (and the coordinating node ran out of memory).

This is an enhancement request to handle this situation better. Is this something the real memory circuit breaker in 7.0 will help with?

elasticmachine · 2018-11-14T20:38:53Z

Pinging @elastic/es-core-infra

dakrone · 2018-11-14T20:54:14Z

This is an enhancement request to handle this situation better. Is this something the real memory circuit breaker in 7.0 will help with?

I think we should try to address the root cause, if possible, it'd be nice if we could check the limit for mappings prior to a put-mapping request being sent to the master. For instance, if the local node's cluster state contains over 1000 fields in the mapping (with the default limit being 1000), we know that even if the cluster state is behind the number of fields cannot decrease, so no need to send an update mapping request to the master node. The request can be rejected without overloading any other node.

Bukhtawar · 2019-04-18T04:07:18Z

Hi @dakrone
Based on my understanding theTransportPutMappingAction is handled by the master which only checks for block and then goes ahead submitting a cluster state update task. Do you think it makes sense to reject it at master but before submitting the cluster state update tasks just as we check for blocks. I believe since the update task is serialized on the master and put-mapping has a priority HIGH, processing gets significantly delayed by the PutMappingExecutor(espl in cases when there are pending tasks with priority URGENT) allowing the heap build up. This would help even in cases where the local cluster state was lagging unaware of field limit breach.

Bukhtawar · 2019-04-22T06:21:22Z

Hey @dakrone, I'll be more than happy to work on this PR. Please share your thoughts on the same

Bukhtawar · 2019-05-07T08:51:10Z

@ppf2 @dakrone any thoughts on this?

DaveCTurner · 2019-05-12T10:19:38Z

I think that the coordinating node no longer runs out of memory due to failed put-mappings calls in versions ≥7.0, so I have updated the title of this issue to reflect the remaining work mentioned in this comment.

elasticmachine · 2019-05-12T10:44:10Z

Pinging @elastic/es-distributed

Today if the primary discovers that an indexing request needs a mapping update then it will send it to the master for validation and processing. If, however, the put-mapping request is invalid then the master still processes it as a (no-op) cluster state update. When there are a large number of indexing operations that result in invalid mapping updates this can overwhelm the master. However, the primary already has a reasonably up-to-date mapping against which it can check the (approximate) validity of the put-mapping request before sending it to the master. For instance it is not possible to remove fields in a mapping update, so if the primary detects that a mapping update will exceed the fields limit then it can reject it itself and avoid bothering the master. This commit adds a pre-flight check to the mapping update path so that the primary can discard obviously-invalid put-mapping requests itself. Fixes elastic#35564

Today if the primary discovers that an indexing request needs a mapping update then it will send it to the master for validation and processing. If, however, the put-mapping request is invalid then the master still processes it as a (no-op) cluster state update. When there are a large number of indexing operations that result in invalid mapping updates this can overwhelm the master. However, the primary already has a reasonably up-to-date mapping against which it can check the (approximate) validity of the put-mapping request before sending it to the master. For instance it is not possible to remove fields in a mapping update, so if the primary detects that a mapping update will exceed the fields limit then it can reject it itself and avoid bothering the master. This commit adds a pre-flight check to the mapping update path so that the primary can discard obviously-invalid put-mapping requests itself. Fixes #35564

Today if the primary discovers that an indexing request needs a mapping update then it will send it to the master for validation and processing. If, however, the put-mapping request is invalid then the master still processes it as a (no-op) cluster state update. When there are a large number of indexing operations that result in invalid mapping updates this can overwhelm the master. However, the primary already has a reasonably up-to-date mapping against which it can check the (approximate) validity of the put-mapping request before sending it to the master. For instance it is not possible to remove fields in a mapping update, so if the primary detects that a mapping update will exceed the fields limit then it can reject it itself and avoid bothering the master. This commit adds a pre-flight check to the mapping update path so that the primary can discard obviously-invalid put-mapping requests itself. Fixes elastic#35564

Today if the primary discovers that an indexing request needs a mapping update then it will send it to the master for validation and processing. If, however, the put-mapping request is invalid then the master still processes it as a (no-op) cluster state update. When there are a large number of indexing operations that result in invalid mapping updates this can overwhelm the master. However, the primary already has a reasonably up-to-date mapping against which it can check the (approximate) validity of the put-mapping request before sending it to the master. For instance it is not possible to remove fields in a mapping update, so if the primary detects that a mapping update will exceed the fields limit then it can reject it itself and avoid bothering the master. This commit adds a pre-flight check to the mapping update path so that the primary can discard obviously-invalid put-mapping requests itself. Fixes #35564 Backport of #48817

ppf2 added the :Core/Infra/Circuit Breakers Track estimates of memory consumption to prevent overload label Nov 14, 2018

Bukhtawar mentioned this issue Mar 15, 2019

Factor GC overhead in circuit breakers #40115

Closed

Bukhtawar mentioned this issue May 12, 2019

Change put mapping priority to URGENT #42105

Closed

DaveCTurner changed the title ~~Coordinating node can run out of memory due to failed put mappings call~~ Fail put-mapping requests sooner if they will exceed the field number limit May 12, 2019

DaveCTurner added the :Distributed Indexing/CRUD A catch all label for issues around indexing, updating and getting a doc by id. Not search. label May 12, 2019

danielmitterdorfer removed the :Core/Infra/Circuit Breakers Track estimates of memory consumption to prevent overload label Sep 6, 2019

DaveCTurner mentioned this issue Nov 1, 2019

Add preflight check to dynamic mapping updates #48817

Merged

DaveCTurner closed this as completed in #48817 Nov 5, 2019

DaveCTurner mentioned this issue Nov 5, 2019

Add preflight check to dynamic mapping updates #48867

Merged

This was referenced Feb 3, 2020

[meta] 7.6 release elastic/elasticsearch-net#4340

Closed

[meta] 7.6 release elastic/elasticsearch-net#4341

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail put-mapping requests sooner if they will exceed the field number limit #35564

Fail put-mapping requests sooner if they will exceed the field number limit #35564

ppf2 commented Nov 14, 2018

elasticmachine commented Nov 14, 2018

dakrone commented Nov 14, 2018 •

edited

Loading

Bukhtawar commented Apr 18, 2019

Bukhtawar commented Apr 22, 2019

Bukhtawar commented May 7, 2019

DaveCTurner commented May 12, 2019

elasticmachine commented May 12, 2019

Fail put-mapping requests sooner if they will exceed the field number limit #35564

Fail put-mapping requests sooner if they will exceed the field number limit #35564

Comments

ppf2 commented Nov 14, 2018

elasticmachine commented Nov 14, 2018

dakrone commented Nov 14, 2018 • edited Loading

Bukhtawar commented Apr 18, 2019

Bukhtawar commented Apr 22, 2019

Bukhtawar commented May 7, 2019

DaveCTurner commented May 12, 2019

elasticmachine commented May 12, 2019

dakrone commented Nov 14, 2018 •

edited

Loading