Speed up toXContent Collection Serialization in some Spots #78742

original-brownbear · 2021-10-06T09:08:36Z

Found this when benchmarking large cluster states. When serializing collections we'd mostly
not take any advantage of what we know about the collection contents (like we do in StreamOutput).
This PR adds a couple of helpers to the x-content-builder similar to what we have on StreamOutput
to allow for faster serializing by avoiding the writer lookup and some self-reference checks and uses them
in obvious spots I could quickly identify in profiling or via the IDE.

relates #77466

Found this when benchmarking large cluster states. When serializing collections we'd mostly not take any advantage of what we know about the collection contents (like we do in `StreamOutput`). This PR adds a couple of helpers to the x-content-builder similar to what we have on `StreamOutput` to allow for faster serializing by avoiding the writer lookup and some self-reference checks.

elasticmachine · 2021-10-06T09:08:40Z

Pinging @elastic/es-distributed (Team:Distributed)

elasticmachine · 2021-10-06T09:08:40Z

Pinging @elastic/es-core-infra (Team:Core/Infra)

DaveCTurner

LGTM

How much does this save us out of interest?

tlrx

LGTM

tlrx · 2021-10-06T10:46:39Z

libs/x-content/src/main/java/org/elasticsearch/common/xcontent/XContentBuilder.java

+
+    public XContentBuilder xContentList(String name, ToXContent... values) throws IOException {
+        startArray(name);
+        for (ToXContent value : values) {


Maybe check for non null array here

Huh right TIL, I I always figured I'd get [null] here if passed a null but turns out you actually get the null :)

tlrx · 2021-10-06T10:49:36Z

libs/x-content/src/main/java/org/elasticsearch/common/xcontent/XContentBuilder.java

@@ -921,6 +923,42 @@ private XContentBuilder value(ToXContent value, ToXContent.Params params) throws
    // Maps & Iterable
    //////////////////////////////////

+    public XContentBuilder stringListField(String name, Collection<String> values) throws IOException {


Nit: maybe just array(String name, Collection<String> values) ? Same remark for the other new methods.

Can't do that unfortunately because we have:

public XContentBuilder field(String name, Iterable<?> values) throws IOException { return field(name).value(values); }

which collides with it. Same with the other cases, these super generic ? or Object type methods collide with everything.

Oh right, I did not see this one. Let's keep like it is then.

…ollections

original-brownbear · 2021-10-06T12:15:45Z

Thanks David & Tanguy!

@DaveCTurner

How much does this save us out of interest?

It's not entirely trivial to isolate the effect because this changes the profile quite a bit and we seemingly also see inlining in more spots now, but I'd say we're at roughly O(5%) savings + I have a follow-up based on this in the pipeline that should have a bigger impact :)

…8742) Found this when benchmarking large cluster states. When serializing collections we'd mostly not take any advantage of what we know about the collection contents (like we do in `StreamOutput`). This PR adds a couple of helpers to the x-content-builder similar to what we have on `StreamOutput` to allow for faster serializing by avoiding the writer lookup and some self-reference checks.

…78755) Found this when benchmarking large cluster states. When serializing collections we'd mostly not take any advantage of what we know about the collection contents (like we do in `StreamOutput`). This PR adds a couple of helpers to the x-content-builder similar to what we have on `StreamOutput` to allow for faster serializing by avoiding the writer lookup and some self-reference checks.

dliappis · 2021-10-07T10:04:10Z

@original-brownbear is this related to #26907 as well?

original-brownbear · 2021-10-07T10:06:16Z

@dliappis yea to some degree for sure. A good chunk of the speedup here is from simply not having to do the self-reference check now in the changed scenarios.

original-brownbear added 2 commits October 6, 2021 10:39

nicer

ef34e75

original-brownbear added :Core/Infra/Core Core issues without another label :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. v8.0.0 v7.16.0 labels Oct 6, 2021

elasticmachine added Team:Core/Infra Meta label for core/infra team Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. labels Oct 6, 2021

fix

219f9f7

original-brownbear requested review from tlrx and DaveCTurner October 6, 2021 10:15

DaveCTurner approved these changes Oct 6, 2021

View reviewed changes

tlrx approved these changes Oct 6, 2021

View reviewed changes

original-brownbear added 2 commits October 6, 2021 12:54

Merge remote-tracking branch 'elastic/master' into faster-serialize-c…

73e94bc

…ollections

null checks

0552604

original-brownbear merged commit 3164885 into elastic:master Oct 6, 2021

original-brownbear deleted the faster-serialize-collections branch October 6, 2021 12:15

original-brownbear mentioned this pull request Oct 6, 2021

Speed up toXContent Collection Serialization in some Spots (#78742) #78755

Merged

original-brownbear mentioned this pull request Oct 7, 2021

Fix Large Shard Count Scalability Issues #77466

Open

97 tasks

jakelandis added v8.0.0-beta1 and removed v8.0.0 labels Oct 27, 2021

danhermann added the >non-issue label Dec 3, 2021

original-brownbear restored the faster-serialize-collections branch April 18, 2023 21:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up toXContent Collection Serialization in some Spots #78742

Speed up toXContent Collection Serialization in some Spots #78742

original-brownbear commented Oct 6, 2021

elasticmachine commented Oct 6, 2021

elasticmachine commented Oct 6, 2021

DaveCTurner left a comment

tlrx left a comment

tlrx Oct 6, 2021

original-brownbear Oct 6, 2021

tlrx Oct 6, 2021

original-brownbear Oct 6, 2021

tlrx Oct 6, 2021

original-brownbear commented Oct 6, 2021

dliappis commented Oct 7, 2021

original-brownbear commented Oct 7, 2021

Speed up toXContent Collection Serialization in some Spots #78742

Speed up toXContent Collection Serialization in some Spots #78742

Conversation

original-brownbear commented Oct 6, 2021

elasticmachine commented Oct 6, 2021

elasticmachine commented Oct 6, 2021

DaveCTurner left a comment

Choose a reason for hiding this comment

tlrx left a comment

Choose a reason for hiding this comment

tlrx Oct 6, 2021

Choose a reason for hiding this comment

original-brownbear Oct 6, 2021

Choose a reason for hiding this comment

tlrx Oct 6, 2021

Choose a reason for hiding this comment

original-brownbear Oct 6, 2021

Choose a reason for hiding this comment

tlrx Oct 6, 2021

Choose a reason for hiding this comment

original-brownbear commented Oct 6, 2021

dliappis commented Oct 7, 2021

original-brownbear commented Oct 7, 2021