Chunked Encoding for NodeStatsResponse #90097

original-brownbear · 2022-09-15T12:12:13Z

Turn this into a chunked response to some degree.
Only chunks per node for now, since deeper chunking needs larger changes downstream that don't fit in well with the current API.
The problem is that the "level" parameter that controls whether or not we return the very large indices or shard level responses is an x-content param so we don't have it when creating the iterator. I'd address this in a follow-up that changes the API a little.

As a result, I did not add a test here that validates the chunk count since I'd like to do more work on this anyway. I think it's a valuable change in its current form already and introduces a parent class that allows for turning other APIs into chunked encoding also.
For example, the indices level response for node stats in a 25k indices cluster across 6 data nodes is currently ~120M (and that is without pretty or human!). Without this change, each hit of the indices stats API will cause the coordinating node to allocate 120M for the response. With this change, we will only allocate ~20M for sending the same response. Serializing those 20M on the transport thread should be a non-issue from some quick benchmarking as even serializing the full 120M seems to well under one second.

relates #89838

Turn this into a chunked response to some degree. Only chunks per node for now, since deeper chunking needs larger changes downstream that don't fit in well with the current API.

elasticsearchmachine · 2022-09-15T12:12:37Z

Pinging @elastic/es-distributed (Team:Distributed)

DaveCTurner

LGTM

original-brownbear · 2022-09-20T07:52:56Z

Thanks David!

Dongzhenpu · 2023-02-21T13:54:44Z

Hi there, I found that after 8.5(the version that this commit was merged into), the nodes/stats api with level=abc query params will raise exception instead of return the error message in versions before 8.5

# version after 8.5
Error: socket hang up

# version before 8.5
{
  "error" : {
    "root_cause" : [
      {
        "type" : "illegal_argument_exception",
        "reason" : "level parameter must be one of [cluster] or [indices] or [shards] but was [abc]"
      }
    ],
    "type" : "illegal_argument_exception",
    "reason" : "level parameter must be one of [cluster] or [indices] or [shards] but was [abc]",
    "suppressed" : [
      {
        "type" : "illegal_state_exception",
        "reason" : "Failed to close the XContentBuilder",
        "caused_by" : {
          "type" : "i_o_exception",
          "reason" : "Unclosed object or array found"
        }
      }
    ]
  },
  "status" : 400
}

I see Armin Braun said that

The problem is that the "level" parameter that controls whether or not we return the very large indices or shard level responses is an x-content param so we don't have it when creating the iterator. I'd address this in a follow-up that changes the API a little.

Is this an unexpected exception to this commit or is it a bug?

DaveCTurner · 2023-02-21T14:40:16Z

I think this is a bug, although it's just one instance of a much more general problem. I opened #93981.

Chunked Encoding for NodeStatsResponse

1c426a8

Turn this into a chunked response to some degree. Only chunks per node for now, since deeper chunking needs larger changes downstream that don't fit in well with the current API.

original-brownbear added >non-issue :Distributed Coordination/Network Http and internode communication implementations v8.5.0 labels Sep 15, 2022

elasticsearchmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Sep 15, 2022

original-brownbear requested review from DaveCTurner and henningandersen September 19, 2022 13:17

DaveCTurner approved these changes Sep 20, 2022

View reviewed changes

original-brownbear merged commit 5d784d6 into elastic:main Sep 20, 2022

original-brownbear deleted the chunked-nodes-stats branch September 20, 2022 07:53

original-brownbear mentioned this pull request Sep 20, 2022

Make use of chunked REST response infrastructure in more APIs #89838

Open

19 tasks

DaveCTurner mentioned this pull request Feb 21, 2023

Deeper chunking of node stats response #93985

Closed

original-brownbear restored the chunked-nodes-stats branch April 18, 2023 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunked Encoding for NodeStatsResponse #90097

Chunked Encoding for NodeStatsResponse #90097

original-brownbear commented Sep 15, 2022

elasticsearchmachine commented Sep 15, 2022

DaveCTurner left a comment

original-brownbear commented Sep 20, 2022

Dongzhenpu commented Feb 21, 2023

DaveCTurner commented Feb 21, 2023

Chunked Encoding for NodeStatsResponse #90097

Chunked Encoding for NodeStatsResponse #90097

Conversation

original-brownbear commented Sep 15, 2022

elasticsearchmachine commented Sep 15, 2022

DaveCTurner left a comment

Choose a reason for hiding this comment

original-brownbear commented Sep 20, 2022

Dongzhenpu commented Feb 21, 2023

DaveCTurner commented Feb 21, 2023