feat!: don't run Celery workers in dev mode #1041

arbrandes · 2024-04-16T19:29:11Z

Tutor's importing * from devstack.py[1] for the development settings, and that means that we aren't using Celery workers at all in dev mode (see [2]). This removes them from the dev compose file, thus saving everyboding a significant chunk of RAM.

[1] https://github.com/overhangio/tutor/blob/master/tutor/templates/apps/openedx/settings/lms/development.py#L3
[2] https://github.com/openedx/edx-platform/blob/master/lms/envs/devstack.py#L35

To do so, we rely on Docker compose profiles[3].

[3] https://docs.docker.com/compose/profiles/

BREAKING CHANGE: the COMPOSE_PROJECT_STARTED hook signature had to be changed to accomodate profile selection.

DawoudSheraz · 2024-04-17T09:39:38Z

changelog.d/20240416_162659_arbrandes_no_workers_in_dev.md

@@ -0,0 +1,2 @@
+- 💥[Feature] Use Docker compose profiles to control services. (by @arbrandes) -->
+- [Fix] Don't start Celery workers in dev mode, as they're never used. (by @arbrandes) -->


We would need some sort of guidelines to understand how the workers can be setup in dev mode, especially now that we are using docker compose profiles.

Just a simple tutor dev dc start lms-worker would start the service, for instance, but the workers wouldn't be actually usable without modifying settings.py. Is this what you mean we should document?

Of course, the other question would be: why would anybody want to do this in a dev environment?

Yeah, the command and setting changes, yes. I get the point that most devs might not be using this, we can have the doc in a followup. It is not a blocker.

regisb · 2024-04-17T10:22:24Z

tutor/templates/local/docker-compose.yml

@@ -171,6 +171,8 @@ services:
      {%- endfor %}
    depends_on:
      - lms
+    profiles:


I wasn't familiar with profiles, to be honest. I like the idea of not running workers in dev, but I'd like to avoid making a breaking change to the filter API, and also changes across many python functions.

Instead, could we simply move the lms-worker and cms-worker declarations from docker-compose.yml to docker-compose.prod.yml?

I don't think it's going to be that simple. There are other places in the code that assume those service definitions exist. For instance, https://github.com/overhangio/tutor/blob/master/tutor/commands/compose.py#L289. Plus, I suspect profiles might be useful down the road, woudn't you say so?

If you're dead set against it, though, there is another way: we can just set the worker services in the dev compose override file to a profile like "donotstart". We get the same thing this PR provides, but without actually supporting different profiles.

DawoudSheraz

The changes look good to me, though I will keep this open until Régis is back.

Tutor's importing * from devstack.py[1] for the development settings, and that means that we aren't using Celery workers at all in dev mode (see [2]). This makes it so they don't start in dev mode, thus saving everyboding a significant chunk of RAM. [1] https://github.com/overhangio/tutor/blob/master/tutor/templates/apps/openedx/settings/lms/development.py#L3 [2] https://github.com/openedx/edx-platform/blob/master/lms/envs/devstack.py#L35 To do so, we rely on Docker compose profiles[3]. [3] https://docs.docker.com/compose/profiles/ BREAKING CHANGE: the `COMPOSE_PROJECT_STARTED` hook signature had to be changed to accomodate profile selection.

kdmccormick · 2024-05-07T14:35:50Z

Haha, I have a competing proposal (although I let it languish, because I was never able to confirm or deny that pdb would work on these workers):

fix: run Celery tasks asynchronously in dev mode #928

My thinking was that it would be good to run these tasks asynchronously, because it would expose developers to the same race conditions that inevitably always happen with Celery tasks in production. But I can see the other side of the argument: async tasks are harder to debug and the workers use resources.

Either way, I think we can all agree that we should either use these workers, or turn them off. So I see four paths forward:

Use this PR, which will disable workers in dev. Open edX contributors who work on Celery tasks should take note to manually test their code using tutor local.
Use my PR, which will enable Celery tasks by default for everybody.
Create a new Tutor ASYNC_CELERY_TASKS configuration flag, enabled by default. Disabling it would remove the Celery workers from docker-compose and configure Django to run the tasks in-process.
Same as (3), but implement it as a tutor-celery plugin. Like tutor-mfe, it would be installed and enabled by default.

My vote is (1), but I would support any of them.

DawoudSheraz · 2024-05-07T15:01:04Z

Haha, I have a competing proposal (although I let it languish, because I was never able to confirm or deny that pdb would work on these workers):

fix: run Celery tasks asynchronously in dev mode #928

My thinking was that it would be good to run these tasks asynchronously, because it would expose developers to the same race conditions that inevitably always happen with Celery tasks in production. But I can see the other side of the argument: async tasks are harder to debug and the workers use resources.

Either way, I think we can all agree that we should either use these workers, or turn them off. So I see four paths forward:

Use this PR, which will disable workers in dev. Open edX contributors who work on Celery tasks should take note to manually test their code using tutor local.

Use my PR, which will enable Celery tasks by default for everybody.

Create a new Tutor ASYNC_CELERY_TASKS configuration flag, enabled by default. Disabling it would remove the Celery workers from docker-compose and configure Django to run the tasks in-process.

Same as (3), but implement it as a tutor-celery plugin. Like tutor-mfe, it would be installed and enabled by default.

My vote is (1), but I would support any of them.
My take on all points:

This seems like the most viable solution. However, I also 2nd the fact that devs should be able to run the tasks async if they want to. If they can do so using local, good enough. But it will confuse devs as we suggest dev for primarily development purposes. Switching to local results in images being rebuilt and all, which can be infuriating if you are trying to debug.
Already discussed in 1
Régis has argued to avoid adding new configs and instead add filters/plugins if needed.
How do you envision the plugin would work/would be responsible for within the ecosystem? To relate with 1, if we can use the plugin for this purpose, it would be nice. I suspect we would be using the plugin to override IDA's celery settings and also allow celery configs to be added without modification to the core (sort-of similar situation Régis is describing on feat: Celery worker concurrency setting #1010 (comment))

arbrandes requested a review from DawoudSheraz April 16, 2024 19:29

arbrandes force-pushed the no-workers-in-dev branch from 0122e69 to 4334c84 Compare April 16, 2024 19:30

arbrandes marked this pull request as draft April 16, 2024 19:32

arbrandes force-pushed the no-workers-in-dev branch from 4334c84 to bf0460b Compare April 16, 2024 21:12

arbrandes changed the title ~~feat: don't run Celery workers in dev mode~~ feat!: don't run Celery workers in dev mode Apr 16, 2024

arbrandes force-pushed the no-workers-in-dev branch from bf0460b to 5c3b229 Compare April 16, 2024 21:16

arbrandes marked this pull request as ready for review April 16, 2024 21:16

arbrandes mentioned this pull request Apr 16, 2024

feat: Celery worker concurrency setting #1010

Open

arbrandes requested a review from regisb April 16, 2024 21:19

DawoudSheraz reviewed Apr 17, 2024

View reviewed changes

regisb reviewed Apr 17, 2024

View reviewed changes

arbrandes force-pushed the no-workers-in-dev branch from 5c3b229 to 41c9dc5 Compare April 29, 2024 16:46

DawoudSheraz self-requested a review April 30, 2024 09:06

DawoudSheraz reviewed Apr 30, 2024

View reviewed changes

arbrandes force-pushed the no-workers-in-dev branch from 41c9dc5 to 35eb685 Compare May 7, 2024 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!: don't run Celery workers in dev mode #1041

feat!: don't run Celery workers in dev mode #1041

arbrandes commented Apr 16, 2024 •

edited

DawoudSheraz Apr 17, 2024

arbrandes Apr 19, 2024

DawoudSheraz Apr 22, 2024 •

edited

regisb Apr 17, 2024

arbrandes Apr 19, 2024

DawoudSheraz left a comment

kdmccormick commented May 7, 2024

DawoudSheraz commented May 7, 2024

		@@ -0,0 +1,2 @@
		- 💥[Feature] Use Docker compose profiles to control services. (by @arbrandes) -->
		- [Fix] Don't start Celery workers in dev mode, as they're never used. (by @arbrandes) -->

feat!: don't run Celery workers in dev mode #1041

Are you sure you want to change the base?

feat!: don't run Celery workers in dev mode #1041

Conversation

arbrandes commented Apr 16, 2024 • edited

DawoudSheraz Apr 17, 2024

Choose a reason for hiding this comment

arbrandes Apr 19, 2024

Choose a reason for hiding this comment

DawoudSheraz Apr 22, 2024 • edited

Choose a reason for hiding this comment

regisb Apr 17, 2024

Choose a reason for hiding this comment

arbrandes Apr 19, 2024

Choose a reason for hiding this comment

DawoudSheraz left a comment

Choose a reason for hiding this comment

kdmccormick commented May 7, 2024

DawoudSheraz commented May 7, 2024

arbrandes commented Apr 16, 2024 •

edited

DawoudSheraz Apr 22, 2024 •

edited