Add RequestSizeLimitMiddleware and RequestTimeoutMiddleware #2328

adriangb · 2023-11-06T23:27:55Z

Closes #2155

adriangb · 2023-11-06T23:28:07Z

TODOs:

Docs
What do we want to do for websockets? Lifespans?

adriangb · 2023-11-13T16:55:48Z

A couple of things to consider:

Dealing with SSE requests
Overriding instead of adding. As this PR currently stands you could set a 10s timeout for your entire app and a stricter timeout for individual routes, but not the other way around (a 1s timeout for all routes except some specific routes which get not timeout, e.g. because they stream data). To get that working I think we'd need to do the enforcing at the route level and have a middleware that simply sets some markers on the scope object to customize the limits. That would be more invasive of a change than this.

Kludex · 2023-11-14T10:45:10Z

Both middlewares can be on a third party packages, and we can close the issue by just pointing at them.

Can we separate the two middlewares in different PRs? The initial discussion was about the request size one.

adriangb · 2023-11-14T14:01:21Z

Both middlewares can be on a third party packages, and we can close the issue by just pointing at them.

I disagree. It's an important feature to have readily available to all Starlette users, it doesn't require 3rd party packages, the complexity isn't massive and I believe we should make some limits the default in 1.0 like Django and other more mature frameworks do.

Can we separate the two middlewares in different PRs? The initial discussion was about the request size one.

Yes I'm happy to do that if it'd make it easier to discuss.

alex-oleshkevich · 2023-11-14T15:34:53Z

for reference: my approach which works on per-route basis
#2175

alex-oleshkevich · 2023-11-14T15:40:54Z

My main concern about middleware based approach is that it is very limited. Say, your global middleware will always have a large value. Because if just one route needs to handle 1gb uploads, the global limit will be 1gb.
Per route limits does not have this flaw, however it is still possible to read unlimited data in custom middlewares. But per-route approach solves the majority of cases.

I see it as app-level limit (the global one that covers all middlewares also) + option to override per a route. In result, we can get this example setup: 8mb global and 24mb for /user/photo-upload.

adriangb · 2023-11-14T16:49:54Z

Yeah, I brought that up above.

I'll rework this to satisfy that use case, although it will be a good chunk more invasive as I said above.

adriangb · 2023-11-14T17:32:08Z

Also sorry I did not consider your PR, I had lost track / forgotten about it.

abersheeran · 2023-11-15T01:52:49Z

I have a small question, why not just read the request content-length to limit the size.

adriangb · 2023-11-15T02:53:53Z

Clients can easily lie about that

abersheeran · 2023-11-15T04:05:46Z

Clients can easily lie about that

But proxy server or asgi
server should not receive data more than content-length?

adriangb · 2023-11-15T04:44:01Z

I'm not saying we shouldn't also limit the length to the reported content length, I'm just saying that it doesn't serve a security purpose.

abersheeran · 2023-11-15T05:50:27Z

I'm not saying we shouldn't also limit the length to the reported content length, I'm just saying that it doesn't serve a security purpose.

Maybe my previous words caused some misunderstanding. What I meant was: the forward reverse proxy server or ASGI server will not pass data to the ASGI application that exceeds the content-length length. Therefore, forging a content-length length on the client side will only result in truncation when the server receives data.
For example, h11 used by Uvicorn will check whether the Content-Length in the Reader matches the actual size of the received data.

adriangb · 2023-11-15T07:09:09Z

Then I’m confused as to what you are suggesting: are you saying that we should also enforce that limit, that that limit is already enforced elsewhere, and this PR (or #2174) is not needed, or something else?

abersheeran · 2023-11-15T07:25:20Z

starlette/middleware/limits.py

+        async def rcv() -> Message:
+            nonlocal total_size
+            message = await receive()
+            chunk_size = len(message.get("body", b""))
+            if self.max_chunk_size is not None and chunk_size > self.max_chunk_size:
+                raise ChunkTooLarge(
+                    self.max_chunk_size
+                    if self.include_limits_in_error_responses
+                    else None
+                )
+            total_size += chunk_size
+            if self.max_request_size is not None and total_size > self.max_request_size:
+                raise RequestTooLarge(
+                    self.max_request_size
+                    if self.include_limits_in_error_responses
+                    else None
+                )
+            return message


Then I’m confused as to what you are suggesting: are you saying that we should also enforce that limit, that that limit is already enforced elsewhere, and this PR (or #2174) is not needed, or something else?

Would it be more concise and efficient to change here to judge the request content-length?

Instead of the user-defined limits? Or in addition to?

What @abersheeran meant is to check the Content-Length instead of having the logic of adding up chunk_sizes - which is actually what Django does: https://github.com/django/django/blob/6daf86058bb6fb922eb3fe3abae6f5c0e645020c/django/http/request.py#L323-L347.

Django also has this logic on the multipart parser: https://github.com/django/django/blob/594873befbbec13a2d9a048a361757dd3cf178da/django/http/multipartparser.py#L241-L248.

Kludex · 2023-11-17T15:46:52Z

What do we want to do for websockets?

Tornado WebSockets have a limitation by default.
1. https://www.tornadoweb.org/en/stable/releases/v4.5.0.html
2. https://www.tornadoweb.org/en/stable/websocket.html#tornado.websocket.WebSocketHandler
Uvicorn already limits the size of each WebSocket message via —ws-max-size configuration - it only applies to websockets.
1. https://www.uvicorn.org/settings/#implementation.

I don't think WebSockets should be a point of concern here.

Lifespans?

Nothing to do for lifespans.

Dealing with SSE requests

No special logic is required for SSE.

Notes/Questions

Is the argumentation of having resource limits on the ASGI application, and not ASGI server because you can apply on individual endpoints/have more granularity?
We need to check the Content-Length (single body event) on the requests that are not Transfer-Encoding: chunked (streaming).

adriangb · 2023-11-17T15:55:27Z

I don't think WebSockets should be a point of concern here.

The configurations offered by Uvicorn don't cover all of the use cases here. You can't configure it on a per route basis for starters.

No special logic is required for SSE.

What I mean is that if you install a blanket timeout or request size limit it might make sense to say "SSE requests don't have a limit". That would require special logic.

Is the argumentation of having resource limits on the ASGI application, and not ASGI server because you can apply on individual endpoints/have more granularity?

Yes.

We need to check the Content-Length (single body event) on the requests that are not Transfer-Encoding: chunked (streaming).

Agreed, we can check it and if it is larger than the limit error immediately without streaming anything. But we can't "trust" it if it's smaller.

Kludex · 2023-11-17T16:01:22Z

We can trust the content length header. The server errors otherwise.

adriangb · 2023-11-17T16:02:36Z

But Starlette doesn't depend on the server. We shouldn't make assumptions like this about the server's behavior unless it's part of the ASGI spec, which to my knowledge this is not. Besides, the only advantage of trusting the content-length header when it says it's smaller than the user defined limit is that we do one less ASGI middleware wrapping on receive. It's not expensive to keep track of the actual streamed size, I don't think it justifies the potential footgun.

adriangb · 2024-01-21T00:13:12Z

@Kludex I’m not sure if it was on purpose or not but I noticed you closed #2175 and not this one. I don’t think there was any concensus on how to implement this feature, nor do I think this PR is intrinsically better in significant ways, so I just wanted to check what your intention was.

Add RequestSizeLimitMiddleware and RequestTimeoutMiddleware

e029e6f

adriangb self-assigned this Nov 6, 2023

adriangb requested a review from a team November 6, 2023 23:28

adriangb added the feature New feature or request label Nov 6, 2023

adriangb added this to the Version 1.0 milestone Nov 6, 2023

adriangb mentioned this pull request Nov 8, 2023

Limit max request size #2155

Open

Merge branch 'master' into limits-middleware

eaac22c

abersheeran reviewed Nov 15, 2023

View reviewed changes

alex-oleshkevich mentioned this pull request Nov 16, 2023

Add request_max_size option #2175

Closed

Kludex mentioned this pull request Nov 27, 2023

Add LimitBodySizeMiddleware #2350

Closed

Kludex closed this Jan 20, 2024

Kludex reopened this Jan 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RequestSizeLimitMiddleware and RequestTimeoutMiddleware #2328

Add RequestSizeLimitMiddleware and RequestTimeoutMiddleware #2328

adriangb commented Nov 6, 2023

adriangb commented Nov 6, 2023 •

edited

adriangb commented Nov 13, 2023

Kludex commented Nov 14, 2023

adriangb commented Nov 14, 2023

alex-oleshkevich commented Nov 14, 2023

alex-oleshkevich commented Nov 14, 2023

adriangb commented Nov 14, 2023

adriangb commented Nov 14, 2023

abersheeran commented Nov 15, 2023

adriangb commented Nov 15, 2023

abersheeran commented Nov 15, 2023

adriangb commented Nov 15, 2023

abersheeran commented Nov 15, 2023

adriangb commented Nov 15, 2023

abersheeran Nov 15, 2023

adriangb Nov 16, 2023

Kludex Nov 17, 2023

Kludex commented Nov 17, 2023

adriangb commented Nov 17, 2023

Kludex commented Nov 17, 2023

adriangb commented Nov 17, 2023 •

edited

adriangb commented Jan 21, 2024

Add RequestSizeLimitMiddleware and RequestTimeoutMiddleware #2328

Are you sure you want to change the base?

Add RequestSizeLimitMiddleware and RequestTimeoutMiddleware #2328

Conversation

adriangb commented Nov 6, 2023

adriangb commented Nov 6, 2023 • edited

adriangb commented Nov 13, 2023

Kludex commented Nov 14, 2023

adriangb commented Nov 14, 2023

alex-oleshkevich commented Nov 14, 2023

alex-oleshkevich commented Nov 14, 2023

adriangb commented Nov 14, 2023

adriangb commented Nov 14, 2023

abersheeran commented Nov 15, 2023

adriangb commented Nov 15, 2023

abersheeran commented Nov 15, 2023

adriangb commented Nov 15, 2023

abersheeran commented Nov 15, 2023

adriangb commented Nov 15, 2023

abersheeran Nov 15, 2023

Choose a reason for hiding this comment

adriangb Nov 16, 2023

Choose a reason for hiding this comment

Kludex Nov 17, 2023

Choose a reason for hiding this comment

Kludex commented Nov 17, 2023

Notes/Questions

adriangb commented Nov 17, 2023

Kludex commented Nov 17, 2023

adriangb commented Nov 17, 2023 • edited

adriangb commented Jan 21, 2024

adriangb commented Nov 6, 2023 •

edited

adriangb commented Nov 17, 2023 •

edited