WSGI middleware should stream request body, rather than loading it all at once. #371

peterlandry · 2019-06-11T22:10:52Z

Hi! I'm deploying a Django app with uvicorn, running on k8s. Our containers were being killed, and I've found that when users upload large files, uvicorn increases memory usage, and slows to a crawl. Eventually causing an OOM.

I'm not sure where this is happening yet, and I suspect could be related to django/channels#1251, but I did a bit more digging and I'm not totally sure. I've tried running uvicorn in wsgi mode, and completely removed channels from the django install, and I'm getting the same behavior. File uploads are being loaded into memory (rather than streaming to disk, as they should be), and upload speeds slow to a crawl. The same app running on gunicorn works fine.

The file I'm testing with is about 470mb.

tomchristie · 2019-06-12T09:39:51Z

So both channels, and uvicorn's WSGI middleware consume the entire request body into memory rather than streaming it (due to some complexities of bridging across from async on the frontend, to WSGI's threaded concurrency)

See https://github.com/encode/uvicorn/blob/master/uvicorn/middleware/wsgi.py#L76-L83

Ideally we'd rework the WSGI middleware to provide proper streaming of request bodies.

abersheeran · 2020-02-11T10:18:34Z

Maybe using tempfile.SpooledTemporaryFile instead is a simple and easy good idea.

abersheeran · 2020-04-14T07:27:33Z

@peterlandry @tomchristie What do you think of my newly submitted pr? Use bytearray to solve it.

Bluehorn · 2022-02-03T22:12:52Z

I was looking into this for @Flauschbaellchen and wrote a demo that illustrates how hard this problem hits. While working on this I ran into #1345 all the time so I though this would also aid in fixing that issue. Of course today this wasn't reproducing anymore as the newest release fixes that bug. 👍

Still, here is an example that can be used to compare the different ways to upload, comparing especially a2wsgi with the middle ware included with uvicorn. Instead of storing the uploads, I just compute a sha256 hash:

issue371_demo.zip

After unpacking, you can run this like this:

$ cd issue371_demo/
$ sudo docker build .

When using the a2wsgi middleware for wsgi, 10 concurrent uploads using curl complete like this (uvicorn is run from gunicorn with 2 workers, wsgi middleware may use 2 threads):

Upload timings for wsgi/a2wsgi: [
1.3421823978424072, 1.4076666831970215, 1.4279963970184326, 1.448310375213623,
2.0521488189697266, 2.1042470932006836, 2.168593168258667, 2.179323434829712,
2.662322998046875, 2.7450664043426514].
Highest latency: 0.025380373001098633.

So basically 4 uploads (2 processes * 2 threads) are processed concurrently. While doing the upload, I run an asgi request concurrently to check the latency for processing ASGI requests. So here they complete after 0.025 s worst case.

Compare this to using the wsgi middleware in uvicorn:

Upload timings for wsgi/uvicorn: [
33.341026306152344, 33.38411903381348, 33.76080870628357,
73.60798692703247, 73.64170670509338, 73.97475504875183,
74.0065565109253, 74.37995886802673, 74.40127992630005,
74.73946642875671].
Highest latency: 0.619253396987915.

No idea why the uploads complete at such irregular intervals.

And, of course, using only asgi, the uploads are processed concurrently and the latency is much better:

Upload timings for asgi: [1.972560167312622, 2.0667288303375244, 2.0667288303375244, 2.0784835815429688, 2.5489165782928467, 2.6735644340515137, 2.724545955657959, 2.724545955657959, 2.735142707824707, 2.735142707824707].
Highest latency: 0.020055770874023438.

BTW: If you are adventurous, you can raise the size of the uploads. Note that the pull request #1329 actually made matters worse in my tests because it took down my machine for many big uploads.

tomchristie · 2022-02-16T12:41:57Z

Simply replacing our previous naive byte concatenation with #1329 makes a massive difference here.

Still feasible that an implementation that streams the request body through the WSGI adapter would be preferable, but it's not absolutely necessary that's the case. Sometime simple wins, just by virtue of being simple.

Bluehorn · 2022-03-26T23:31:14Z

@tomchristie I agree that simple is preferable. But it seems you did not read my comment:

Note that the pull request #1329 actually made matters worse in my tests because it took down my machine for many big uploads.

I'd consider this a security issue given that a simply running a few big uploads will take down an application (or maybe just a single pod) by collecting all incoming data in memory before even handing it over the the application code.

Kludex · 2022-10-27T07:26:24Z

The goal here is to replace the WSGIMiddleware we have by the a2wsgi.WSGIMiddleware.

Implementation is available on #1303. If someone wants to take that over, it will be really helpful.

Kludex · 2022-12-24T11:00:35Z

Or... Option 1 from #1303 (comment).

tomchristie changed the title ~~Large file upload memory leak?~~ WSGI middleware should stream request body, rather than loading it all at once. Jun 12, 2019

abersheeran mentioned this issue Feb 11, 2020

use SpooledTemporaryFile in wsgi body #572

Closed

abersheeran added a commit to abersheeran/kui that referenced this issue Apr 11, 2020

增加更好的 wsgi 支持 encode/uvicorn#371

b540c48

abersheeran mentioned this issue Apr 14, 2020

update wsgi to support big body #635

Closed

vytas7 mentioned this issue Jun 30, 2020

WSGI interface: reading request body is O(n^2) wrt payload size #708

Closed

euri10 mentioned this issue May 29, 2021

Improved wsgi middleware, mainly a copy of a2wsgi #1049

Closed

Flauschbaellchen mentioned this issue Jan 31, 2022

Balancing long-running request between multiple workers / Large file uploads benoitc/gunicorn#2734

Closed

euri10 added the wsgi label Feb 10, 2022

abersheeran mentioned this issue Feb 11, 2022

Use a2wsgi.WSGIMiddleware replace WSGIMiddleware #1303

Closed

Kludex added the good first issue label Oct 27, 2022

Kludex added this to the Version 0.21.0 milestone Oct 29, 2022

humrochagf mentioned this issue Dec 31, 2022

Replace current WSGIMiddleware implementation by a2wsgi one #1825

Merged

humrochagf closed this as completed in #1825 Jan 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WSGI middleware should stream request body, rather than loading it all at once. #371

WSGI middleware should stream request body, rather than loading it all at once. #371

peterlandry commented Jun 11, 2019

tomchristie commented Jun 12, 2019

abersheeran commented Feb 11, 2020

abersheeran commented Apr 14, 2020

Bluehorn commented Feb 3, 2022

tomchristie commented Feb 16, 2022

Bluehorn commented Mar 26, 2022

Kludex commented Oct 27, 2022

Kludex commented Dec 24, 2022

WSGI middleware should stream request body, rather than loading it all at once. #371

WSGI middleware should stream request body, rather than loading it all at once. #371

Comments

peterlandry commented Jun 11, 2019

tomchristie commented Jun 12, 2019

abersheeran commented Feb 11, 2020

abersheeran commented Apr 14, 2020

Bluehorn commented Feb 3, 2022

tomchristie commented Feb 16, 2022

Bluehorn commented Mar 26, 2022

Kludex commented Oct 27, 2022

Kludex commented Dec 24, 2022