shouldDrainReadBuffer followed up by readDisable(true) and readDisable(false) strange behaviour #12304

ppadevski · 2020-07-27T13:55:41Z

The behaviour I'm about to describe requires quite a lot of setup in order to reproduce.

To begin with I'm using v1.14.4.

My listeners use the following configuration.

listener.mutable_per_connection_buffer_limit_bytes()->set_value(16384);

The default value of 1MB just seemed way too big and opened up lots of opportunities for oom so I ended up using 16k.

Then I have a local SSL client in which data in sent in chunks, e.g. 400 followed up by 16500 bytes (e.g. HTTP headers and body).

What can be seen in the trace logs is the following:

2020-07-23T14:56:23.084Z trace [source/common/network/connection_impl.cc:534] [C187] socket event: 3
2020-07-23T14:56:23.084Z trace [source/common/network/connection_impl.cc:622] [C187] write ready
2020-07-23T14:56:23.084Z trace [source/common/network/connection_impl.cc:572] [C187] read ready
2020-07-23T14:56:23.084Z trace [source/extensions/transport_sockets/tls/ssl_socket.cc:87] [C187] ssl read returns: 393
2020-07-23T14:56:23.084Z trace [source/extensions/transport_sockets/tls/ssl_socket.cc:87] [C187] ssl read returns: 15991
2020-07-23T14:56:23.084Z trace [source/extensions/transport_sockets/tls/ssl_socket.cc:167] [C187] ssl read 16384 bytes
2020-07-23T14:56:23.084Z trace [source/common/http/http1/codec_impl.cc:475] [C187] parsing 16384 bytes
...
2020-07-23T14:56:23.085Z debug [source/common/http/conn_manager_impl.cc:2344] [C187][S7433621899365225547] Read-disabling downstream stream due to filter callbacks.
2020-07-23T14:56:23.085Z trace [source/common/network/connection_impl.cc:355] [C187] readDisable: enabled=true disable=true state=0
...
2020-07-23T14:56:23.086Z debug [source/common/http/conn_manager_impl.cc:2368] [C187][S7433621899365225547] Read-enabling downstream stream due to filter callbacks.
2020-07-23T14:56:23.086Z trace [source/common/network/connection_impl.cc:355] [C187] readDisable: enabled=false disable=false state=0
2020-07-23T14:56:23.086Z trace [source/common/network/connection_impl.cc:534] [C187] socket event: 2
2020-07-23T14:56:23.086Z trace [source/common/network/connection_impl.cc:622] [C187] write ready

At that point envoy stops reading from the downstream and either the client or the server times out. According to the client the entire request has been sent and according to the server not the entire request was received.

To me it seems like the problem is that whenever we have this situation

2020-07-23T14:56:23.084Z trace [source/extensions/transport_sockets/tls/ssl_socket.cc:87] [C187] ssl read returns: 393
2020-07-23T14:56:23.084Z trace [source/extensions/transport_sockets/tls/ssl_socket.cc:87] [C187] ssl read returns: 15991
2020-07-23T14:56:23.084Z trace [source/extensions/transport_sockets/tls/ssl_socket.cc:167] [C187] ssl read 16384 bytes

only then do we execute the following code:

105 Network::IoResult SslSocket::doRead(Buffer::Instance& read_buffer) {
...
158 if (slices_to_commit > 0) {
159 read_buffer.commit(slices, slices_to_commit);
160 if (callbacks_->shouldDrainReadBuffer()) {
161 callbacks_->setReadBufferReady();
162 keep_reading = false;
163 }
164 }

and in my case I have shouldDrainReadBuffer() == true because

113 bool shouldDrainReadBuffer() override {
114 return read_buffer_limit_ > 0 && read_buffer_.length() >= read_buffer_limit_; // 16k > 0 && 16k == 16k
115 }

and then setReadBufferReady does

121 void setReadBufferReady() override { file_event_->activate(Event::FileReadyType::Read); }

which seems to immediately get eaten by the first readDisable(true)

readDisable: enabled=true disable=true state=0

and when the second readDisable(false) happens

readDisable: enabled=false disable=false state=0

the read event is not activated.

If readDisable(true/false) isn't triggered or the setReadBufferReady() code path is not executed all is well.
I tried plain TCP and wasn't able to reproduce the issue (there's a similar if shouldDrainReadBuffer in it).
I wasn't able to reproduce the issue with an external connection either (maybe my network is slow).
The code snippets that I shared may be a few lines off v1.14.4 as I have a few ERR_clear_error() calls added as I'm using OpenSSL but looking at the behaviour I would not blame them or OpenSSL.
I can't unfortunately share either the client or the server code.

I can reliable reproduce the issue once every 5 minutes or so, so if anyone has a patch in mind I'll be willing to try it out. In the meantime, I'll see if I can come up with something.

mattklein123 · 2020-07-27T16:38:53Z

cc @antoniovicente @alyssawilk

antoniovicente · 2020-07-27T16:48:17Z

I think I see the bug in ConnectionImpl::readDisable(false)

the " && read_buffer_.length() > 0" should be removed from the resumption condition.

antoniovicente · 2020-07-27T16:52:37Z

But this should be an older bug, the issue is that with SSL we can't assume that resetting the fd mask by calling file_event_->setEnabled(Event::FileReadyType::Read | Event::FileReadyType::Write); will result in correct resumption in all cases. There may be bytes to read in SSL's internal buffers

antoniovicente · 2020-07-27T16:58:37Z

Commit 77cca6b is not in v1.14.4, so this should be the older bug.

https://github.com/envoyproxy/envoy/blob/v1.14.4/source/common/network/connection_impl.cc#L364

antoniovicente · 2020-07-27T17:48:32Z

/assign @antoniovicente

antoniovicente · 2020-07-27T17:50:38Z

/backport

antoniovicente · 2020-07-28T05:19:46Z

Thinking about this one some more, I think that we could consider this being a bug in SslSocket::doRead instead of ConnectionImpl::disableRead.

SslSocket::doRead does not check for additional bytes in SSL_read internal buffers by calling SSL_pending and doing additional SSL_read calls after callbacks_->shouldDrainReadBuffer() returns true. The consequence is that the SSL connection is left in a state where it is able to generate additional bytes during a future SslSocket::doRead call even if the underlying fd's read buffer is fully drained. The other 2 publicly available implementations of TransportSocket (e.i. raw sockets and ALTS) have the property that doRead can't make progress without additional read bytes from the socket.

Fixing SslSocket::doRead should be straightforward. It is less clear if readDisable(false) should activate(Read) in cases where the read buffer is empty as a way to simplify the transport socket contract. I don't know what non-OSS, private transport socket implementations exist.

atanchev · 2020-07-28T13:46:53Z

I cherry-picked commit 77cca6b into our repo of 1.14.4 and it failed to fix the issue. But after removing this condition:

I think I see the bug in ConnectionImpl::readDisable(false)

the " && read_buffer_.length() > 0" should be removed from the resumption condition.

as suggested by @antoniovicente it seems to be working for now. I'll test this a bit more. Just wanted to add some details from our testing so far.

antoniovicente · 2020-07-29T04:36:36Z

I cherry-picked commit 77cca6b into our repo of 1.14.4 and it failed to fix the issue. But after removing this condition:

I think I see the bug in ConnectionImpl::readDisable(false)
the " && read_buffer_.length() > 0" should be removed from the resumption condition.

as suggested by @antoniovicente it seems to be working for now. I'll test this a bit more. Just wanted to add some details from our testing so far.

Thanks for confirming that more eagerly calling setReadBufferReady(); helps.

I'm going to try to continue working on a fix, with focus on making the SslSocket drain internal buffers on read so that resumption based on fd re-registration works correctly.

antoniovicente · 2020-09-23T15:58:30Z

Sent out a PR with tests that repo the timeout while processing the last SSL record issue and changes to SslSocket to ensure that that bytes in internal buffers are drained out before returning from doRead.

Sorry for the delays, too many other things to pay attention to. Hopefully this fix will be merged in time for 1.16, and backported to earlier releases if appropriate.

…on requests and replay them when re-enabling read. (#13772) (#14017) Fixes SslSocket read resumption after readDisable when processing the SSL record that contains the last bytes of the HTTP message Risk Level: low Testing: new unit and integration tests Docs Changes: n/a Release Notes: added Platform Specific Features: n/a Fixes #12304 Signed-off-by: Antonio Vicente <[email protected]> Signed-off-by: Christoph Pakulski <[email protected]>

cpakulski · 2020-12-16T21:09:18Z

Backported #13772 to rels 1.16, 1.15, 1.14 and 1.13. Removing backport/review label.

mattklein123 added the investigate Potential bug that needs verification label Jul 27, 2020

mattklein123 added area/connection bug help wanted Needs help! and removed investigate Potential bug that needs verification labels Jul 27, 2020

mattklein123 added this to the 1.16.0 milestone Jul 27, 2020

repokitteh-read-only bot assigned antoniovicente Jul 27, 2020

repokitteh-read-only bot added the backport/review Request to backport to stable releases label Jul 27, 2020

antoniovicente mentioned this issue Sep 23, 2020

tls: fix SslSocket read resumption after readDisable when processing the SSL record that contains the last bytes of the HTTP message #13234

Closed

mattklein123 modified the milestones: 1.16.0, 1.17.0 Oct 7, 2020

antoniovicente mentioned this issue Oct 27, 2020

connection: Remember transport socket read resumption requests and replay them when re-enabling read. #13772

Merged

antoniovicente closed this as completed in #13772 Oct 28, 2020

antoniovicente mentioned this issue Nov 10, 2020

Envoy doesn't always send complete request body on POST request #13947

Closed

cpakulski mentioned this issue Nov 13, 2020

backport to 1.16: connection: Remember transport socket read resumption requests and replay them when re-enabling read. (#13772) #14017

Merged

cpakulski mentioned this issue Nov 18, 2020

backport to 1.15: connection: Remember transport socket read resumption requests and replay them when re-enabling read. (#13772) #14073

Closed

cpakulski removed the backport/review Request to backport to stable releases label Dec 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shouldDrainReadBuffer followed up by readDisable(true) and readDisable(false) strange behaviour #12304

shouldDrainReadBuffer followed up by readDisable(true) and readDisable(false) strange behaviour #12304

ppadevski commented Jul 27, 2020

mattklein123 commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 28, 2020

atanchev commented Jul 28, 2020

antoniovicente commented Jul 29, 2020

antoniovicente commented Sep 23, 2020

cpakulski commented Dec 16, 2020

shouldDrainReadBuffer followed up by readDisable(true) and readDisable(false) strange behaviour #12304

shouldDrainReadBuffer followed up by readDisable(true) and readDisable(false) strange behaviour #12304

Comments

ppadevski commented Jul 27, 2020

mattklein123 commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 27, 2020

antoniovicente commented Jul 28, 2020

atanchev commented Jul 28, 2020

antoniovicente commented Jul 29, 2020

antoniovicente commented Sep 23, 2020

cpakulski commented Dec 16, 2020