Reorganize http_receive() #640

DimitriPapadopoulos · 2020-04-14T13:23:46Z

Prepare for adding an external HTTP library later on.

The pull request will help use external HTTP / XML libraries later on. At this point I am not certain we will benefit from using PicoHTTPParser in terms of maintainability and less code. Yet I'm opening the pull request to perform some code clean up, add a few assertions here and there, and raise a few issues.

For example we do not catch SSL errors if the header has already been read. Here is the relevant part in the new code:
https://github.com/DimitriPapadopoulos/openfortivpn/blob/230b158/src/http.c#L177-L181
and in the old code:
https://github.com/adrienverge/openfortivpn/blob/8b0c235/src/http.c#L217-L221
@adrienverge Is this on purpose? Are we lenient on purpose? On my machine I have modified the code to be strict on SSL errors, whether the HTTP header has been read or not. I haven't seen any occurrence of SSL errors.

adrienverge · 2020-04-14T13:51:41Z

@DimitriPapadopoulos I don't know about SSL errors, I can't remember.

All I can tell is that I tried to avoid having external dependencies, hence the HTTP decoding inside our source.

DimitriPapadopoulos · 2020-04-14T14:14:55Z

Oh, agreed about the dependencies. I'm after small libraries that can be included in the source code.

PicoHTTPParser is just two files picohttpparser.c and picohttpparser.h.

adrienverge · 2020-04-14T14:20:17Z

I'm after small libraries that can be included in the source code.

👍

DimitriPapadopoulos · 2020-04-14T14:21:06Z

I have pushed the code that is stricter on SSL errors. I will keep testing it for a few days.

DimitriPapadopoulos · 2020-04-15T09:41:01Z

OK, it appears we are lenient on SSL errors after the header has been read because of change 70dadda. Specifically the change from:

	if (n < 0) {

to:

	if (!header_size) {

The comment reads:

So, let's just obey the Content-Length header. If it's missing (chunked
encoding or standards violation) then read until the EOF, but still
succeed if the header was complete. This way we're hopefully on the safe
side.

Yet I don't see fundamental differences in the way we process chunked encoding vs. Content-Length-based responses. If Content-Length is known, we stop as soon as we read the expected length:

openfortivpn/src/http.c

Lines 204 to 205 in 8b0c235

    
           if (bytes_read >= header_size + content_size) 
        
           	break;

If on the other hand the response is chunked-encoded, we stop as soon as we detect the last chunk terminator:

openfortivpn/src/http.c

Lines 198 to 202 in 8b0c235

    
           /* Last chunk terminator. Done naively. */ 
        
           if (bytes_read >= 7 && 
        
               !memcmp(&buffer[bytes_read - 7], 
        
                       "\r\n0\r\n\r\n", 7)) 
        
           	break;

So what is this "'safe side" about? I have verified that checking strictly for SSL errors still works when parsing chunked-encoded responses.

The code that handles the chunked encoding has been introduced in f1bfe16, a few days after 70dadda. My guess is that the "safe side" part had probably been left over because the author wasn't yet certain about the robustness of the chunked encoding code:

FortiOS 5 talks chunked. We don't really follow; but let's assume that
the if the packet ends with a zero-sized chunk we're done.

Kind of a change that works in testing and breaks in production.

Yet I believe the chunked encoding code has proven quite astute and robust over the years. It believe it is time we parse HTTP following the rules - so that we can move to 3rd party HTTP parsing code if needed.

mrbaseman

I have a few small comments just to be sure I have understood the changes.

src/http.c

mrbaseman

With the explanations now, I think the changes are good.

DimitriPapadopoulos · 2020-04-15T13:01:31Z

I'll try to provide more comprehensive explanations in the commit.

mrbaseman · 2020-04-15T13:09:32Z

maybe just a few buzzwords like "code cleanup and improve readability" would have lead my thoughts in the right direction :-)

Improve readabilty by moving conditionals that operate on 'n' right after assignment of 'n'. The conditional at the end of the loop: while (n > = 0) has been replaced by this conditional: if (n < = 0) and this one that triggers a tight 2nd loop: if (n == ERR_SSL_AGAIN) ⇔ if (n == 0) The new codes *returns from the function* with ERR_HTTP_SSL upon: if (n < 0) The previous code would only *leave the loop* upon: if (n < 0) and then return from the function with ERR_HTTP_SSL only upon: if (!header) Therefore SSL violations were silently ignored after reading the header and while reading the body of the HTTP response. Increase the HTTP buffer capacity when needed. You never know. The previous size of 32 KB used to work well from 2015 to 2020. In 2020 a case was reported where 32 KB were not enough anymore and we increased the buffer size to 64 KB. Someday 64 KB might not be enough either. Yet in most cases 32 KB are more than enough. So start with 32 KB and increase well beyond 64 KB if needed, instead of bailing out with ERR_HTTP_SSL. These changes will help introduce 3rd party HTTP parsing code if we decide to go that way.

DimitriPapadopoulos requested a review from adrienverge April 14, 2020 13:23

DimitriPapadopoulos force-pushed the http branch from 230b158 to 53c2d74 Compare April 14, 2020 13:26

DimitriPapadopoulos force-pushed the http branch from 53c2d74 to c2c467e Compare April 14, 2020 14:19

DimitriPapadopoulos assigned mrbaseman and unassigned mrbaseman Apr 14, 2020

DimitriPapadopoulos requested a review from mrbaseman April 14, 2020 14:38

DimitriPapadopoulos mentioned this pull request Apr 15, 2020

Use 3rd party HTTP and XML parsing library? #347

Open

DimitriPapadopoulos self-assigned this Apr 15, 2020

mrbaseman reviewed Apr 15, 2020

View reviewed changes

src/http.c Show resolved Hide resolved

src/http.c Show resolved Hide resolved

src/http.c Show resolved Hide resolved

mrbaseman approved these changes Apr 15, 2020

View reviewed changes

DimitriPapadopoulos force-pushed the http branch from c2c467e to f608fce Compare April 15, 2020 13:41

DimitriPapadopoulos force-pushed the http branch from f608fce to 867bbe9 Compare April 15, 2020 13:46

DimitriPapadopoulos merged commit 5574b45 into adrienverge:master Apr 15, 2020

DimitriPapadopoulos deleted the http branch April 15, 2020 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reorganize http_receive() #640

Reorganize http_receive() #640

DimitriPapadopoulos commented Apr 14, 2020

adrienverge commented Apr 14, 2020

DimitriPapadopoulos commented Apr 14, 2020

adrienverge commented Apr 14, 2020

DimitriPapadopoulos commented Apr 14, 2020

DimitriPapadopoulos commented Apr 15, 2020 •

edited

Loading

mrbaseman left a comment

mrbaseman left a comment

DimitriPapadopoulos commented Apr 15, 2020

mrbaseman commented Apr 15, 2020

Reorganize http_receive() #640

Reorganize http_receive() #640

Conversation

DimitriPapadopoulos commented Apr 14, 2020

adrienverge commented Apr 14, 2020

DimitriPapadopoulos commented Apr 14, 2020

adrienverge commented Apr 14, 2020

DimitriPapadopoulos commented Apr 14, 2020

DimitriPapadopoulos commented Apr 15, 2020 • edited Loading

mrbaseman left a comment

Choose a reason for hiding this comment

mrbaseman left a comment

Choose a reason for hiding this comment

DimitriPapadopoulos commented Apr 15, 2020

mrbaseman commented Apr 15, 2020

DimitriPapadopoulos commented Apr 15, 2020 •

edited

Loading