Ability for AsgiHandler and AsgiRequest to process the request body in chunks. #1269

hozblok · 2019-03-26T21:01:41Z

To prevent a situation when the request body is loaded completely into RAM, I suggest adding io.IOBase streams support to the AsgiRequest class. django.http.HttpRequest supports work with streams so we have to correctly construct the stream and implement read() method for Django.

In case of early termination of loading the request body, in cases of disconnect or errors (for example, exceeding the file size), we must close the stream correctly and allow to send a response if it is necessary.

The error handling mechanisms and the generation of responses are left unchanged. The ability to work with chunks is essential for the file upload which is usually implemented through a POST request with multipart/from-data.

Thanks in advance for your feedback. I'm interested in making it better.

…est body in chunks.

carltongibson · 2019-03-27T06:43:01Z

Hi @hozblok. Super thanks. I'm already working on this in #1251 so I shall compare.

hozblok · 2019-03-27T07:51:21Z

Hi @carltongibson. Great. Please, feel free to ask anything about implementation.

carltongibson · 2019-03-27T15:27:19Z

Hi @hozblok. Good stuff. Help me understand it... 🙂

So can you tell me why we need the more complex task based approach to consuming the receive callable, rather than (essentially) just await-ing it in read(), as I have going in #1251? (What sort of test case shows the other way to be inadequate?)

Make sense?

Thanks.

hozblok · 2019-03-28T15:31:34Z

Hi @carltongibson

I'm looking at your implementation now. I am ready to admit that your way looks more suitable and simple in this case. (I mean pass the receive to the stream). Although I have not tested your version.
The truncate() for self.buffer call is not needed? Will chunks be stored in RAM until truncate?
RequestBodyWrapper if it's file-like object ... If I were you I would inherit it from io.IOBase of RawIOBase or BufferedIOBase. Thus, we will automatically get the opportunity to close the stream, the readline() inherited method, context manager interface.
def __init__(self, scope, body): I propose to change the name body -> stream
Why did you decide to remove the check?

        # Limit the maximum request data size that will be handled in-memory.
        if (
            settings.DATA_UPLOAD_MAX_MEMORY_SIZE is not None
            and self._content_length > settings.DATA_UPLOAD_MAX_MEMORY_SIZE
        ):
            raise RequestDataTooBig(
                "Request body exceeded settings.DATA_UPLOAD_MAX_MEMORY_SIZE."
            )

Perhaps an interesting case for tests: self._content_length does not match the size of the body being transferred. This should be an exception and we should correctly abort the upload.

carltongibson · 2019-03-29T06:59:48Z

So the idea was that if we can just make receive into a (sync) file-like then we can basically leverage django.http.request.HttpRequest, rather than duplicating all the functionality. So, we remove the settings.DATA_UPLOAD_MAX_MEMORY_SIZE check, because Django already does that (and it's much more battle-tested, so we won't hit issues like #1240.)

I agree with your other points. RequestBodyWrapper should probably be something like RequestIO and extend from IO... as you say. etc.

Fancy jumping onto #1251 and co-authoring it with me? I have time scheduled for the sprints at DjangoCon Europe but if you have capacity we could get it finished and a release out before then?

The code side is nearly there. (Just needs removing of the _read_started checks, as per the comments) The changes you've suggested here make sense. Then it's just tidying the tests and so on. (the pattern of creating the wrapper in the tests could be factored better, etc.) Let me know: happy to input if you can assist! Thanks.

hozblok · 2019-03-29T20:41:46Z

Of course, I'm ready to participate. Let's finish it together.

The scope of work is clear. First I will reread the entire history of the ticket more carefully. 🙂

hozblok · 2019-04-02T16:25:17Z

Hi @carltongibson ,

Just in case, I will duplicate the new PR here:
carltongibson#1

carltongibson · 2019-04-02T16:35:48Z

Super. Thanks. I’ll have a look tomorrow. Good work!

carltongibson · 2019-05-08T11:59:46Z

Closing in favour of #1251 — Thanks again @hozblok!

Added the ability for AsgiHandler and AsgiRequest to process the requ…

8050817

…est body in chunks.

hozblok requested a review from carltongibson as a code owner March 26, 2019 21:01

Small fix.

7630994

hozblok mentioned this pull request Apr 2, 2019

Improve ASGI request body stream. Fix tests. carltongibson/channels#1

Merged

carltongibson closed this May 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability for AsgiHandler and AsgiRequest to process the request body in chunks. #1269

Ability for AsgiHandler and AsgiRequest to process the request body in chunks. #1269

hozblok commented Mar 26, 2019

carltongibson commented Mar 27, 2019

hozblok commented Mar 27, 2019

carltongibson commented Mar 27, 2019

hozblok commented Mar 28, 2019 •

edited

Loading

carltongibson commented Mar 29, 2019

hozblok commented Mar 29, 2019 •

edited

Loading

hozblok commented Apr 2, 2019

carltongibson commented Apr 2, 2019

carltongibson commented May 8, 2019

Ability for AsgiHandler and AsgiRequest to process the request body in chunks. #1269

Ability for AsgiHandler and AsgiRequest to process the request body in chunks. #1269

Conversation

hozblok commented Mar 26, 2019

carltongibson commented Mar 27, 2019

hozblok commented Mar 27, 2019

carltongibson commented Mar 27, 2019

hozblok commented Mar 28, 2019 • edited Loading

carltongibson commented Mar 29, 2019

hozblok commented Mar 29, 2019 • edited Loading

hozblok commented Apr 2, 2019

carltongibson commented Apr 2, 2019

carltongibson commented May 8, 2019

hozblok commented Mar 28, 2019 •

edited

Loading

hozblok commented Mar 29, 2019 •

edited

Loading