Use 16-bit unsigned integers in ssl_srv.c #3238

irwir · 2020-04-24T11:45:45Z

Description

This continues what was done in already merged PR #2856 and PR #3150 - use shorter integers if higher bits were never used.

Additional tiny improvements/optimizations: two cases of moving around common expressions (at lines 1440 and 1509 in the new code) and corrected type cast in line 2228.

Status

READY

Requires Backporting

NO
Which branch?
development

Migrations

NO

danh-arm · 2020-04-27T11:15:48Z

Hi @irwir. Thanks for this but the PR builds with warnings which are treated as errors in some configs:

ssl_srv.c: In function ‘ssl_parse_servername_ext’:
ssl_srv.c:103:34: warning: comparison between signed and unsigned integer expressions [-Wsign-compare]
     if( servername_list_size + 2 != len )
                                  ^
ssl_srv.c: In function ‘ssl_parse_signature_algorithms_ext’:
ssl_srv.c:265:31: warning: comparison between signed and unsigned integer expressions [-Wsign-compare]
     if( sig_alg_list_size + 2 != len ||
                               ^
ssl_srv.c: In function ‘ssl_parse_supported_elliptic_curves’:
ssl_srv.c:339:23: warning: comparison between signed and unsigned integer expressions [-Wsign-compare]
     if( list_size + 2 != len ||

Also, this will need a changelog entry.

These changes continue what was done in PR Mbed-TLS#2856 and Mbed-TLS#3150 - use shorter integers if higher bits were never used. Additionally here are two cases of moving around common expressions and a better type cast (line 2228). Signed-off-by: irwir <[email protected]>

irwir · 2020-04-27T12:03:00Z

Thanks, @danh-arm.

the PR builds with warnings which are treated as errors in some configs:

Now the issues should be fixed.
Parenthses in the line 1704 were inconsistent with the rest of the file.

Also, this will need a changelog entry.

Done, please see if it needs improvements.

danh-arm

CI passed (failures not relevant) but don't think this is ready yet.

ChangeLog.d/changes.txt

danh-arm · 2020-04-27T15:14:44Z

library/ssl_srv.c

@@ -100,7 +100,7 @@ static int ssl_parse_servername_ext( mbedtls_ssl_context *ssl,
        return( MBEDTLS_ERR_SSL_BAD_HS_CLIENT_HELLO );
    }
    servername_list_size = ( ( buf[0] << 8 ) | ( buf[1] ) );
-    if( servername_list_size + 2 != len )
+    if( (size_t) servername_list_size + 2 != len )


Changing the type of servername_list_size doesn't really make sense if you have to then do a cast. You can't change the type of len since it's in the function spec so it's not worth changing this instance.

The key to this change is the previous line - two bytes make a 16-bit unsigned integer only; no additional space is required while size_t could be 32 or 64 bits.
As for type casts - you pointed out this issue previously, and its was necessary to suppress the compiler warning.
This cast should not generate additional code, because internally compiler would do about the same - expand 16 bits to integer size, and to size_t to compare.

It wasn't me who pointed that out previously but anyway, in that earlier PR the cast was from a larger type to a smaller one whereas here you're casting to a larger type, which presumably will undo any code size reduction by declaring the smaller type? Have you actually demonstrated this improves code size in some cases?

here you're casting to a larger type

The existing code has a cast to a larger type too, an implicit one:
servername_list_size = (size_t)( ( buf[0] << 8 ) | ( buf[1] ) );

Right. I'm not so much concerned about whether the cast is implicit or explicit. I'm more concerned the change won't improve the code, whether it's readability or code size or something else.

(I haven't followed all the discussions in this PR) Code size at this micro level is very dependent on the target architecture as well as the compiler version and the surrounding code. It can happen that using a 16-bit variable to store a 16-bit value is a win if the variable has its own register, but a loss if the variable is spilled to memory, which is highly sensitive to how many registers are available at this point in the code and how the compiler uses them.

x86_64 code size is not that great an indicator because it's a 64-bit platform and code size is usually not an issue there. When I'm concerned about code size, I do a Cortex-M0+ build: that's the second-smallest Arm platform, and the smallest (Cortex-M0).

make CC=arm-none-eabi-gcc AR=arm-none-eabi-ar LD=arm-none-eabi-ld size library/*.o

About this specific line, a way to avoid a cast is

if( servername_list_size + 2u != len )

I'd prefer to avoid a cast because a cast tells the reader “watch out, something fancy is happening here”, but nothing remarkable is happening on this line: there's no risk of overflow or truncation. The reason for the warning in 919af9b is that an unsigned 16-bit type got widened to int (because it fit) which then got wrapped around to a size_t, and the compiler analyzed the wrapped-around part independently from the widening part and is telling us that negative values could suffer, but thanks to the first part we know that negative values are impossible. All of which the reader shouldn't have to care about, because the only reason negative numbers started popping up was the use of a type (uint16_t) that's smaller than int and C's promotion rules kicking in.

@gilles-peskine-arm
Thanks for the idea; I made these changes and eliminated type casts.
On the subject of code size, it might be interesting if you could find time and check what it gets for these Cortex processor. Just for this single type change at line 90 to uint16_t.
For x86 and x64 code optimizer in MSVC is not doing a great job, and code with size_t was shorter.

Compiler:

arm-none-eabi-gcc (GNU Tools for Arm Embedded Processors 7-2018-q3-update) 7.3.1 20180622 (release) [ARM/embedded-7-branch revision 261907]

Flags: -Os -mthumb -mcpu=cortex-m0plus
Library configuration: scripts/config.py baremetal

Code size of ssl_srv.o (text column of size library/ssl_srv.o):

Originally (5cac337): 20102

Current version of this PR (a7025a2): 20102

Current version, but with size_t instead of uint16_t at line 90: 20106

I haven't analyzed the generated assembly.

Thanks for the test.
Your numbers together with my data confirm that the difference is small and compiler-specific. Hence, the code size should be of a minor importance to prefer size_t to uint16_t or vice versa.

By the way, there was no All checks have passed messages for a long time. Good to see it fixed.

danh-arm · 2020-04-27T15:15:49Z

library/ssl_srv.c

@@ -262,7 +262,7 @@ static int ssl_parse_signature_algorithms_ext( mbedtls_ssl_context *ssl,
        return( MBEDTLS_ERR_SSL_BAD_HS_CLIENT_HELLO );
    }
    sig_alg_list_size = ( ( buf[0] << 8 ) | ( buf[1] ) );
-    if( sig_alg_list_size + 2 != len ||
+    if( (size_t) sig_alg_list_size + 2 != len ||


Ditto - this instance not worth fixing

The reasons are the same as above.

danh-arm · 2020-04-27T15:16:14Z

library/ssl_srv.c

@@ -336,7 +336,7 @@ static int ssl_parse_supported_elliptic_curves( mbedtls_ssl_context *ssl,
        return( MBEDTLS_ERR_SSL_BAD_HS_CLIENT_HELLO );
    }
    list_size = ( ( buf[0] << 8 ) | ( buf[1] ) );
-    if( list_size + 2 != len ||
+    if( (size_t) list_size + 2 != len ||


Ditto - this instance not worth fixing

Once again, the same as above.

danh-arm · 2020-04-27T15:23:46Z

library/ssl_srv.c

@@ -3630,7 +3630,7 @@ static int ssl_decrypt_encrypted_pms( mbedtls_ssl_context *ssl,
    int ret = MBEDTLS_ERR_ERROR_CORRUPTION_DETECTED;
    mbedtls_pk_context *private_key = mbedtls_ssl_own_key( ssl );
    mbedtls_pk_context *public_key = &mbedtls_ssl_own_cert( ssl )->pk;
-    size_t len = mbedtls_pk_get_len( public_key );
+    uint16_t len = (uint16_t) mbedtls_pk_get_len( public_key );


Ditto - this instance not worth fixing

Just a little below, at line 3657 the len variable is used as 16-bit unsigned integer and therefore we would never need the whole size_t.
Yet again, type cast is only necessary to suppress a compiler warning.

Here, I accept the cast to a smaller type is OK, assuming the length is always represented in less than 16-bits. Even so, it could do with a comment like the earlier PR referring to a spec that validates this assumption.

a comment like the earlier PR

This is interesting. A value could be silently truncated to two bytes, but explicit type cast always raised questions.
According to comments, the returned value is the size of public_key in bytes. 16 bits would allow 2^19-1 bits for a key.
Would you mind suggesting a good comment line?

Hmm, I'm not the best person to suggest something here. @mpg - can you suggest a comment to use? Presumably we can assume something about the TLS buffer structure format?

At this point, it isn't about TLS, it's about what mbedtls_pk_get_len can return. The maximum is MBEDTLS_MPI_MAX_SIZE, reached for an RSA key, and that value is 1024 by default and we say “Maximum tested: 2048 bytes” in include/mbedtls/config.h. That's RSA-16384 which is already slower than most people's patience.

Until post-quantum crypto changes the deal, I'm perfectly happy with assuming that key sizes in bits (let alone in bytes) fit in 16 bits. There's other code in the library that makes this assumption. I'm ok with not checking this assumption in TLS code.

irwir · 2020-04-27T16:42:52Z

Thanks for reviewing.
The general idea is to treat most of the cases where 2 bytes are read or written in the same style.
Additional type casts were necessary for suppressing compiler warnings.
That was partially implemented in mentioned PRs that were already merged, and used type casts too for the same reasons.
Depending on compiler and processor architecture, this might produce slightly shorter code and decrease the amount of used memory.

danh-arm · 2020-04-29T16:42:35Z

library/ssl_srv.c

@@ -1700,8 +1700,8 @@ static int ssl_parse_client_hello( mbedtls_ssl_context *ssl )
             | ( buf[ciph_offset + 1]      );

    if( ciph_len < 2 ||
-        ciph_len + 2 + ciph_offset + 1 > msg_len || /* 1 for comp. alg. len */
-        ( ciph_len % 2 ) != 0 )
+        (size_t) ciph_len + 2 + ciph_offset + 1 > msg_len || /* 1 for comp. alg. len */


Won't the cast to a larger type here undo the saving by declaring a smaller type?

gilles-peskine-arm · 2020-04-30T20:10:58Z

What is the goal of this change? If it's about code size optimization, the results are inconclusive, but probably below the threshold of being worth the effort.

hanno-becker · 2020-05-01T07:36:52Z

I agree with @gilles-peskine-arm. While there can be a benefit in fine tuning the choice of integer type, I believe that if this is to be attempted, one should consider a general approach that applies throughout the library, and which also allows simple change of types for experimentation.

I played with this in the prototype MPS implementation, e.g. here by introducing typedefs for various kinds of integers and instantiating it with (a) natural machine width, (b) minimal width, e.g. uint8_t, (c) optimal type with required width, e.g. uint8fast_t, and I think the differences were quite minor, but I'd need to check again to be sure.

Signed-off-by: irwir <[email protected]>

irwir · 2020-05-01T12:30:35Z

What is the goal of this change? If it's about code size optimization

Uniform approach was the goal, not code size.
I tried to reinforce this in the additional commit.

Signed-off-by: irwir <[email protected]>

irwir · 2020-05-08T11:52:55Z

An additional commit that modifies ssl_cli.c in the same way.

gilles-peskine-arm

As a general matter, using 16-bit types for 16-bit values is potentially risky because as soon as you add them, you need to check for overflow. In much of the code around the TLS protocols, we're lucky that sizes are limited by the protocol to 24 bits or 16 bits, so if we use 32-bit types to manipulate variables, we don't risk overflows. A buffer size should be stored in a 16-bit type only if there's a good reason for it.

Furthermore using types that are less wide than int is generally risky in C due to promotions. It's easy to miss that x + 0 might not actually be equivalent to x because a promotion snuck in. So generally smaller types should be avoided for anything that will be involved in arithmetic. There are a lot of places where uint8_t is the right type because it's the type of a byte. Uses for uint16_t are rarer. It can be the right type to indicate that a value is known to fit in 16 bits, for example when preparing to write it out as two bytes. But I don't think it's a good idea in parsing code.

Using types that are smaller than the word size is typically a small performance and code size hit in terms of register manipulations: it tends to require additional fetches, extensions and bitwise and. They can save memory bandwidth, but this only matters if there's a non-negligible amount of variables involved. So generally speaking uint16_t arrays or uint16_t in data structures which are likely to be instantiated many times are performance wins, but uint16_t local variables are bad for performance and code size.

For these reasons, I am very wary of this pull request, and would need a much stronger reason than “uniform approach” to justify it. Furthermore, the work and the review have already taken a fairly long time that I think would be better spent elsewhere.

I've recorded my opinion as “request changes” and not closed the PR because I'm open to having my mind changed. But as it stands I'm opposed to the objective of the PR, not just to specific issues in the code.

irwir · 2020-05-14T17:26:08Z

@gilles-peskine-arm
Thanks for the review.

As a general matter, using 16-bit types for 16-bit values is potentially risky because as soon as you add them, you need to check for overflow.

Generally any integer type with fixed number of bits can overflow or wrap around.

It was already pointed out, the existing code assumes that it was safe to manipulate size_t numbers and then put the result into 16 bits.
Does it mean there was presumably no potential risks of truncation even for malformed data in 64-bit architecture?
Absense of checks implies that the result always had to be correct.
But in that case, no issues should be possible with short integers.

Otherwise, the issue was always here, and shorter data type would replace a truncation error with an unsigned wraparound.
The 16-bit resulting value would be the same, by the way.

A buffer size should be stored in a 16-bit type only if there's a good reason for it.

The existing library code implements the pertinent RFCs so that the code reads and writes 16-bit lengths.
That is, 16 bits is the natural size here.
It creates no risks in reading 16 bits of data into 16-bit variable or in writing such variable into 16 bits.

There are a lot of places where uint8_t is the right type because it's the type of a byte.

These types could be the right ones only when there are no more than 8 meaningful bits of data.
In this PR uint16_t is used for 16-bit values, so it could be considered as the right.

danh-arm · 2020-05-27T16:26:02Z

Sorry but I'm going to close this PR. It's taken too much review bandwidth for little or no gain. Before accepting changes like this, we would need a clear cost/benefit analysis, considering changes across the whole codebase. If needed, the mailing list is the best place to continue this discussion.

As a general matter, using 16-bit types for 16-bit values is potentially risky because as soon as you add them, you need to check for overflow.

Generally any integer type with fixed number of bits can overflow or wrap around.

True but I think the point Gilles was making is that if code generally fails to check for overflow, moving to a shorter type is more likely to trigger problems, though not in this specific case.

danh-arm added the enhancement label Apr 27, 2020

danh-arm added the needs-work label Apr 27, 2020

irwir force-pushed the fix_ssl_srv2 branch from 6075f38 to 919af9b Compare April 27, 2020 11:46

danh-arm suggested changes Apr 27, 2020

View reviewed changes

irwir force-pushed the fix_ssl_srv2 branch from 22cce0b to a7025a2 Compare April 27, 2020 16:28

danh-arm reviewed Apr 29, 2020

View reviewed changes

danh-arm self-assigned this Apr 29, 2020

danh-arm requested a review from mpg April 30, 2020 14:14

irwir force-pushed the fix_ssl_srv2 branch from a7025a2 to 5fb814a Compare April 30, 2020 17:08

gilles-peskine-arm mentioned this pull request Apr 30, 2020

Rationalize Travis builds #3218

Merged

irwir added 2 commits May 1, 2020 15:22

Add changelog entry

f12a607

Signed-off-by: irwir <[email protected]>

Change more size_t values to uint8_t and uint16_t for uniform approach.

913ed95

Signed-off-by: irwir <[email protected]>

irwir force-pushed the fix_ssl_srv2 branch from 5fb814a to 913ed95 Compare May 1, 2020 12:26

Similar changes in ssl_cli.c

292e8c4

Signed-off-by: irwir <[email protected]>

gilles-peskine-arm requested changes May 14, 2020

View reviewed changes

mpg removed their request for review May 15, 2020 10:52

danh-arm closed this May 27, 2020

hanno-becker mentioned this pull request May 7, 2021

Review fix for ssl_new_session_ticket_parse hannestschofenig/mbedtls#257

Merged

4 tasks

irwir deleted the fix_ssl_srv2 branch October 6, 2023 14:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use 16-bit unsigned integers in ssl_srv.c #3238

Use 16-bit unsigned integers in ssl_srv.c #3238

irwir commented Apr 24, 2020

danh-arm commented Apr 27, 2020

irwir commented Apr 27, 2020

danh-arm left a comment

danh-arm Apr 27, 2020

irwir Apr 27, 2020

danh-arm Apr 29, 2020

irwir Apr 29, 2020

danh-arm Apr 30, 2020

gilles-peskine-arm Apr 30, 2020

irwir Apr 30, 2020

gilles-peskine-arm Apr 30, 2020

irwir May 1, 2020

irwir May 1, 2020

danh-arm Apr 27, 2020

irwir Apr 27, 2020

danh-arm Apr 27, 2020

irwir Apr 27, 2020

danh-arm Apr 27, 2020

irwir Apr 27, 2020

danh-arm Apr 29, 2020

irwir Apr 29, 2020

danh-arm Apr 30, 2020

gilles-peskine-arm Apr 30, 2020

irwir commented Apr 27, 2020

danh-arm Apr 29, 2020

gilles-peskine-arm commented Apr 30, 2020

hanno-becker commented May 1, 2020 •

edited

Loading

irwir commented May 1, 2020

irwir commented May 8, 2020

gilles-peskine-arm left a comment

irwir commented May 14, 2020

danh-arm commented May 27, 2020

Use 16-bit unsigned integers in ssl_srv.c #3238

Use 16-bit unsigned integers in ssl_srv.c #3238

Conversation

irwir commented Apr 24, 2020

Description

Status

Requires Backporting

Migrations

danh-arm commented Apr 27, 2020

irwir commented Apr 27, 2020

danh-arm left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

irwir commented Apr 27, 2020

Choose a reason for hiding this comment

gilles-peskine-arm commented Apr 30, 2020

hanno-becker commented May 1, 2020 • edited Loading

irwir commented May 1, 2020

irwir commented May 8, 2020

gilles-peskine-arm left a comment

Choose a reason for hiding this comment

irwir commented May 14, 2020

danh-arm commented May 27, 2020

hanno-becker commented May 1, 2020 •

edited

Loading