Server would occupy a lot of memory when method is not async #980

ZackJiang21 · 2020-06-22T19:00:00Z

I just read the source code of starlette, and I think I found reason why it's occupying so much memory
The problem is in starlette.routing.py methodrequest_response()


```python
def request_response(func: typing.Callable) -> ASGIApp:
    """
    Takes a function or coroutine `func(request) -> response`,
    and returns an ASGI application.
    """
    is_coroutine = asyncio.iscoroutinefunction(func)

    async def app(scope: Scope, receive: Receive, send: Send) -> None:
        request = Request(scope, receive=receive, send=send)
        if is_coroutine:
            response = await func(request)
        else:
            response = await run_in_threadpool(func, request)
        await response(scope, receive, send)

    return app
  
 
async def run_in_threadpool(
    func: typing.Callable[..., T], *args: typing.Any, **kwargs: typing.Any
) -> T:
    loop = asyncio.get_event_loop()
    if contextvars is not None:  # pragma: no cover
        # Ensure we run in the same context
        child = functools.partial(func, *args, **kwargs)
        context = contextvars.copy_context()
        func = context.run
        args = (child,)
    elif kwargs:  # pragma: no cover
        # loop.run_in_executor doesn't accept 'kwargs', so bind them in here
        func = functools.partial(func, **kwargs)
    return await loop.run_in_executor(None, func, *args)

My rest interface is not async, it will run in loop.run_in_executor, but starlette do not specify the executor here, so the default thread pool size should be os.cpu_count() * 5, my test machine has 40 cpus so I should have 200 threads in the pool. And after each request it will not release the object in these threads, unless the thread be reused by next request, which will occupy a a lot of memory. Especially when I wrap a large deep learning model in the server.

My question is could we make the thread pool size configurable?

The text was updated successfully, but these errors were encountered:

raphaelauv · 2020-11-19T11:23:32Z

It seams that there is a limitation to 32 threads in the source code

https://github.com/python/cpython/blob/7d9d25dbedfffce61fc76bc7ccbfa9ae901bf56f/Lib/concurrent/futures/thread.py#L136

ZackJiang21 · 2020-11-25T19:42:31Z

It seams that there is a limitation to 32 threads in the source code

https://github.com/python/cpython/blob/7d9d25dbedfffce61fc76bc7ccbfa9ae901bf56f/Lib/concurrent/futures/thread.py#L136
The link u pasted is in python 3.8, but if run on python 3.7 or lower version, it does not have the limitation. So I suppose it still make sense to do the limitation in startelle.

raphaelauv · 2020-11-27T09:09:57Z

Yes in 3.7 it's a *5
https://github.com/python/cpython/blob/db95802bdfac4d13db3e2a391ec7b9e2f8d92dbe/Lib/concurrent/futures/thread.py#L127

tomchristie · 2022-02-03T11:07:04Z

My question is could we make the thread pool size configurable?

No - we ought to have sensible configurations here on behalf of our users, rather than adding to the number of things they need to think about.

If anyone's motivated enough to dig into clear justifications that the system defaults are appropriate for us, and has an actionable change that we oughta make, then let's consider that. Otherwise, let's just leave this as it is.

aminalaee · 2022-02-03T11:10:00Z

I think this was for before migrating to anyio.
We now rely on anyio default pool size. Last time I checked the default pool size for asyncio is 40.

I guess this is still configurable outside of Starlette, so we can keep this closed.

tomchristie · 2022-02-03T11:11:20Z

Thanks @aminalaee 👍🏼

Kludex · 2022-02-03T11:14:55Z

It's worth mentioning that you can modify the default capacity limiter on anyio. 👍

adriangb · 2022-07-08T14:16:21Z

It's worth mentioning that you can modify the default capacity limiter on anyio. 👍

I don't think this is true

Kludex · 2022-07-08T14:28:52Z

https://gist.github.com/Kludex/f889f02bb81adf3a8d63176ad4d8bcb1 and #1724 (comment)

adriangb · 2022-07-08T14:41:31Z

Yup you're right, it can be modified, I thought you meant replace 😅

raphaelauv mentioned this issue Nov 19, 2020

Custom executor in run_in_threadpool #1094

Closed

tomchristie closed this as completed Feb 3, 2022

aminalaee mentioned this issue Jul 8, 2022

Custom CapacityLimiter #1724

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Server would occupy a lot of memory when method is not async #980

Server would occupy a lot of memory when method is not async #980

ZackJiang21 commented Jun 22, 2020

raphaelauv commented Nov 19, 2020

ZackJiang21 commented Nov 25, 2020

raphaelauv commented Nov 27, 2020

tomchristie commented Feb 3, 2022

aminalaee commented Feb 3, 2022 •

edited

tomchristie commented Feb 3, 2022

Kludex commented Feb 3, 2022

adriangb commented Jul 8, 2022

Kludex commented Jul 8, 2022 •

edited

adriangb commented Jul 8, 2022

Server would occupy a lot of memory when method is not async #980

Server would occupy a lot of memory when method is not async #980

Comments

ZackJiang21 commented Jun 22, 2020

raphaelauv commented Nov 19, 2020

ZackJiang21 commented Nov 25, 2020

raphaelauv commented Nov 27, 2020

tomchristie commented Feb 3, 2022

aminalaee commented Feb 3, 2022 • edited

tomchristie commented Feb 3, 2022

Kludex commented Feb 3, 2022

adriangb commented Jul 8, 2022

Kludex commented Jul 8, 2022 • edited

adriangb commented Jul 8, 2022

aminalaee commented Feb 3, 2022 •

edited

Kludex commented Jul 8, 2022 •

edited