Fix high memory usage when using BaseHTTPMiddleware middleware classes and streaming responses #1018

erewok · 2020-08-03T04:25:39Z

Limit queue size to 1 to prevent loading entire streaming response into memory.

This small PR has been separated from #1017 in order to continue evaluating the changes included in the other separately.

Closes #1012

Limit queue size to 1 to prevent loading entire streaming response into memory

JayH5

LGTM, is there any way we can test this?

starlette/middleware/base.py

florimondmanca · 2020-08-03T10:13:53Z

@JayH5 Assuming we already have tests for streaming responses, the way I'd properly test this is with a memory test. I had done it for streaming in HTTPX but eventually we dropped it since it requires adding an extra test dependency (memoryprofile I think?). I'd be open to having it for Starlette though, since I don't think there's any other way we can prevent high memory usage from coming back in the future.

Edit: the dropped memory test in HTTPX can be seen here: encode/httpx@f092ecd

erewok · 2020-08-03T15:01:05Z

I haven't been able to get memory_profiler to work by decorating async functions yet in the (admittedly, two) times I've tried. Did you have that working on httpx, @florimondmanca? I can take a look at the memory test you had before to see if there are some tips for me.

When I was working on this, I did a naive sys.getsizeof of all the items in the queue (it wraps a deque, so you can iterate), and I was printing out the queue size to validate.

Here's an example of the chunk of code inside the body_stream I was using to validate my assumptions:

    async def body_stream() -> typing.AsyncGenerator[bytes, None]:
            while True:
                print("Queue element count is now: ", queue.qsize())
                all_obj_getsizeof = sum(sys.getsizeof(elem) for elem in queue._queue)  # type: ignore
                print("Queue memory use should be: ", all_obj_getsizeof)

On master, this produces output like this:

...
Yield chunk:  92
Yield chunk:  93
Yield chunk:  94
Yield chunk:  95
Yield chunk:  96
Yield chunk:  97
Yield chunk:  98
Yield chunk:  99
INFO:     127.0.0.1:54974 - "GET /streaming-memory-test/100 HTTP/1.1" 200 OK
Queue element count is now:  102
Queue memory use should be:  23664
Queue element count is now:  101
Queue memory use should be:  23432
Queue element count is now:  100
Queue memory use should be:  23200
Queue element count is now:  99
Queue memory use should be:  22968
Queue element count is now:  98
...

Whereas on this branch, it produces output like this:

Yield chunk:  0
Yield chunk:  1
INFO:     127.0.0.1:54931 - "GET /streaming-memory-test/100 HTTP/1.1" 200 OK
Queue element count is now:  1
Queue memory use should be:  232
Queue element count is now:  0
Queue memory use should be:  0
Yield chunk:  2
Queue element count is now:  0
Queue memory use should be:  0
Yield chunk:  3
Queue element count is now:  0
Queue memory use should be:  0
Yield chunk:  4
Queue element count is now:  0
Queue memory use should be:  0
Yield chunk:  5

This is for an endpoint that looks like this:

@app.route("/streaming-memory-test/{total_size:int}")
async def streaming_memory_test(request):
    total_size = request.path_params["total_size"]
    chunk_size = 1024
    while total_size < chunk_size:
        total_size *= chunk_size
    
    async def byte_stream():
        chunk_count = total_size // chunk_size
        remainder = total_size % chunk_size
        for n in range(chunk_count):
            print("Yield chunk: ", n)
            yield os.urandom(chunk_size)
        yield os.urandom(remainder)

    return StreamingResponse(byte_stream())

(which I wrote to simulate streaming a large file), and a request like this for a 100kb file:

~/open_source
❯ curl http://localhost:8000/streaming-memory-test/100 --output sample_fl
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  100k    0  100k    0     0  19.5M      0 --:--:-- --:--:-- --:--:-- 19.5M

~/open_source
❯ ll -h sample_fl
-rw-r--r--  1 erewok  staff   100K Aug  3 07:55 sample_fl

Also, there's no test of BaseHTTPMiddleware with a StreamingResponse in the codebase, so I added one in the other PR, and I think it makes more sense there because that PR messes with the way streaming happens arguably more than this one? In truth, those tests I added can go in either PR. They should pass on master, in this branch, and the other.

florimondmanca · 2020-08-03T16:40:32Z

@erewok Yes, I remember that the memory test there was passing, and pretty reliably even! Locally and on CI. Tricky to get right but I was fairly confident about it.

erewok · 2020-08-04T13:35:13Z

I'd like to look at the "pending task" issue before merging this; this is when the coro task doesn't get cancelled in the event of a disconnect.

erewok · 2020-08-04T15:33:56Z

I am satisfied now: there was a bug in the handler I wrote to test this code. When I fix that handler, then the pending task issue is no longer present. This PR is good to merge in my opinion.

florimondmanca

Sounds good!

JayH5

LGTM too 🚀

BaseHTTPMiddleware add maxsize arg to Queue constructor

040e460

Limit queue size to 1 to prevent loading entire streaming response into memory

erewok requested review from JayH5 and florimondmanca August 3, 2020 04:25

erewok mentioned this pull request Aug 3, 2020

Streaming with BaseHTTPMiddleware: force background to run after response completes #1017

Closed

JayH5 reviewed Aug 3, 2020

View reviewed changes

starlette/middleware/base.py Outdated Show resolved Hide resolved

Add type hint suggestion for asyncio.Queue

41cf67c

florimondmanca approved these changes Aug 4, 2020

View reviewed changes

florimondmanca changed the title ~~BaseHTTPMiddleware: add maxsize arg to Queue constructor~~ Fix high memory usage when using BaseHTTPMiddleware middleware classes and streaming responses Aug 4, 2020

JayH5 approved these changes Aug 4, 2020

View reviewed changes

erewok merged commit 3d77a1c into master Aug 4, 2020

erewok deleted the limit_base_qsize branch August 4, 2020 23:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix high memory usage when using BaseHTTPMiddleware middleware classes and streaming responses #1018

Fix high memory usage when using BaseHTTPMiddleware middleware classes and streaming responses #1018

erewok commented Aug 3, 2020

JayH5 left a comment

florimondmanca commented Aug 3, 2020 •

edited

erewok commented Aug 3, 2020 •

edited

florimondmanca commented Aug 3, 2020

erewok commented Aug 4, 2020

erewok commented Aug 4, 2020

florimondmanca left a comment

JayH5 left a comment

Fix high memory usage when using BaseHTTPMiddleware middleware classes and streaming responses #1018

Fix high memory usage when using BaseHTTPMiddleware middleware classes and streaming responses #1018

Conversation

erewok commented Aug 3, 2020

JayH5 left a comment

Choose a reason for hiding this comment

florimondmanca commented Aug 3, 2020 • edited

erewok commented Aug 3, 2020 • edited

florimondmanca commented Aug 3, 2020

erewok commented Aug 4, 2020

erewok commented Aug 4, 2020

florimondmanca left a comment

Choose a reason for hiding this comment

JayH5 left a comment

Choose a reason for hiding this comment

florimondmanca commented Aug 3, 2020 •

edited

erewok commented Aug 3, 2020 •

edited