DTLS error when using 4096 key/cert #252

saghul · 2015-06-04T11:50:15Z

I'm getting this:

[1675940553] DTLS timeout on component 1 of stream 1, retransmitting
[WARN] [1675940553] The DTLS stack is trying to send a packet of 2236 bytes, this may be larger than the MTU and get dropped!

After switching to a 4096 key to avoid #251

Eventually all components timeout and fail.

The text was updated successfully, but these errors were encountered:

saghul · 2015-06-04T12:16:14Z

I checked and 2048 has the same problem. 1024 certs work well.

lminiero · 2015-06-04T13:16:35Z

That's a well known issue, and has been discussed several times on the google group. The reason for this is that the DTLS stack in OpenSSL does not fragment packets that exceed the MTU when needed, when a BIO is used instead of UDP directly.

Normally, the DTLS stack should try sending the "huge" packet, and when the timeout fires (because the message never reached the other side), fragment the original packet in smaller ones and send those instead. This apparently only works when you use the DTLS stack over UDP directly, that is, when OpenSSL has more control over the transport. When a BIO is used, as in Janus because we need to handle the transport ourselves (libnice), this never happens, and as such the message containing the too large certificate is dropped somewhere in the network and never reaches the destination, thus leading to a handshake failure.

I tried investigating ways on how to fix this and force the right behaviour somehow, but never managed to get it to work as expected. As such, the only solution as of now is to rely on "smaller" certificates. Hopefully someone will find the proper solution in the near future: not sure, for instance, if BoringSSL handles that properly, as I never managed to use that as a stack in place of OpenSSL.

I was documenting this guideline in an additional .md file in the certs folder, also to account for the feedback in #251. It will basically say that yes, as of now, certificates must be 1024 bits. Any additional text or clarification (or even example if you have any) is more than welcome!

saghul · 2015-06-04T13:19:22Z

That's a well known issue, and has been discussed several times on the google group. The reason for this is that the DTLS stack in OpenSSL does not fragment packets that exceed the MTU when needed, when a BIO is used instead of UDP directly.

Oh, sorry about that! I looked through the open issue and didn't find anything related. Will search the group next time!

I was documenting this guideline in an additional .md file in the certs folder, also to account for the feedback in #251. It will basically say that yes, as of now, certificates must be 1024 bits. Any additional text or clarification (or even example if you have any) is more than welcome!

Cool, I'll have a look once it lands!

lminiero · 2015-06-04T15:27:26Z

Just to add some more details to what I explained in my previous posts, the DTLS stack in OpenSSL does indeed take care of fragmenting the packets according to what is assumed to be the MTU (1472 by default). The problem is that the mem BIO ignores that fragmentation info completely, and so, when you do an BIO_read, makes available at the application the whole message anyway. This results in the whole buffer being passed to nice_agent_send, which means it's just as not fragmenting anything. You can verify this by using, e.g., a 4096 bits certificate, and capture the DTLS traffic with Wireshark: you'll see that the message is recognized as composed of not only multiple messages, but also fragments.

My guess is that the mem BIO simply isn't smart enough to inspect the actual messages being transported: it probably doesn't care if it's DTLS, TLS or whatever else, and just acts as an opaque transport, which means that when the internal stack writes the fragmented packets in a bunch, that's what you get when you get the pending data to send. Not sure if this means we'll have to inspect the payload ourselves, e.g., do a BIO_read, process the packet to see if there are fragments (length+offset), and if so send each of them separately through libnice. This might probably do it, although it sounds a bit silly that the application is required to do so, especially considering that the application is not assumed to be aware of the protocol specifics in the first place (that's why you rely on a library usually).

lminiero · 2015-06-05T09:22:47Z

I asked about this on the OpenSSL mailing list, and I already received useful feedback:

https://mta.openssl.org/pipermail/openssl-users/2015-June/001503.html

They basically confirm that the mem BIO has not enough knowledge to handle this, specifically as to datagram semantics, for instance. A suggestion they made is to write a BIO filter that wraps the mem BIO in order to handle fragmentation automatically. I'll try to do that ASAP.

… on large certificates (see #252)

lminiero · 2015-06-05T16:08:43Z

@saghul can you check if #254 works for you?

saghul · 2015-06-05T17:29:22Z

Great, will check it!

lminiero pushed a commit that referenced this issue Jun 5, 2015

Implemented new OpenSSL BIO filter to fix fragmentation issue in DTLS…

58409e8

… on large certificates (see #252)

lminiero closed this as completed Jun 8, 2015

amnonbb mentioned this issue Sep 17, 2015

DTLS1_READ_BYTES:tlsv1 alert decrypt error #136

Closed

zucher mentioned this issue Feb 10, 2023

DTLS Bio read buffer is to small and not compatible with ecdsa-with-SHA256 key/cert with CA aiortc/aiortc#828

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DTLS error when using 4096 key/cert #252

DTLS error when using 4096 key/cert #252

saghul commented Jun 4, 2015

saghul commented Jun 4, 2015

lminiero commented Jun 4, 2015

saghul commented Jun 4, 2015

lminiero commented Jun 4, 2015

lminiero commented Jun 5, 2015

lminiero commented Jun 5, 2015

saghul commented Jun 5, 2015

DTLS error when using 4096 key/cert #252

DTLS error when using 4096 key/cert #252

Comments

saghul commented Jun 4, 2015

saghul commented Jun 4, 2015

lminiero commented Jun 4, 2015

saghul commented Jun 4, 2015

lminiero commented Jun 4, 2015

lminiero commented Jun 5, 2015

lminiero commented Jun 5, 2015

saghul commented Jun 5, 2015