#11949 trial: Pass parallel trial worker index as env variable #11950

p12tic · 2023-09-03T09:30:36Z

Scope and purpose

Fixes #11949

Currently tests run under parallel trial do not know about each other. This makes it difficult to coordinate access to shared resources (e.g. database) that take a long time to construct.

For example, consider a case of 10000 tests that access a database. Currently tests can only be run in sequence, each using the same database and cleaning up after itself. Running tests in parallel would involve constructing 10000 separate databases which is infeasible. However, if each test knew the index of the parallel Twisted trial worker it runs under, then only small number of databases would need to be created and tests would use them like in serial case. E.g. when running tests using trial -j16, only 16 databases would need to be created.

This is done by exposing environment variable
TWISTED_TRIAL_PARALLEL_INDEX to the tests run under the parallel runner.

Currently tests run under parallel trial do not know about each other. This makes it difficult to coordinate access to shared resources (e.g. database) that take a long time to construct. For example, consider a case of 10000 tests that access a database. Currently tests can only be run in sequence, each using the same database and cleaning up after itself. Running tests in parallel would involve constructing 10000 separate databases which is infeasible. However, if each test knew the index of the parallel Twisted trial worker it runs under, then only small number of databases would need to be created and tests would use them like in serial case. E.g. when running tests using trial -j16, only 16 databases would need to be created. This is done by exposing environment variable TWISTED_TRIAL_PARALLEL_INDEX to the tests run under the parallel runner.

p12tic · 2023-09-03T09:41:44Z

please review

adiroiban

Many thanks for the changes.

They look good. Great PR.

I left only a few minor comments.

adiroiban · 2023-09-04T07:48:06Z

src/twisted/trial/_dist/test/test_disttrial.py

+        self.assertEqual(os.pathsep.join(sys.path), environments[0]["PYTHONPATH"])
+        parallel_indexes = set(e["TWISTED_TRIAL_PARALLEL_INDEX"] for e in environments)
+        self.assertEqual(set(["0", "1", "2", "3"]), parallel_indexes)


This is just a small comment

parrallel_indexes vs parallelIndexes for pedantic code standard

and also check that PYTHONPATH is present in all environments

Suggested change

self.assertEqual(os.pathsep.join(sys.path), environments[0]["PYTHONPATH"])

parallel_indexes = set(e["TWISTED_TRIAL_PARALLEL_INDEX"] for e in environments)

self.assertEqual(set(["0", "1", "2", "3"]), parallel_indexes)

pythonpathEnvs = [e["PYTHONPATH" for e in environments]

self.assertEqual([os.pathsep.join(sys.path)] * 4, pythonpathEnvs)

parallelIndexes = set(e["TWISTED_TRIAL_PARALLEL_INDEX"] for e in environments)

self.assertEqual(set(["0", "1", "2", "3"]), parallelIndexes)

adiroiban · 2023-09-04T07:53:31Z

docs/core/howto/testing.rst

@@ -246,3 +246,7 @@ then deletes that schema in the tearDown function, your tests will behave in an
 unpredictable fashion as they tromp upon each other if they have their own
 schema.  And this won't actually indicate a real error in your code, merely a
 testing-specific race-condition.
+
+Trial provides `TWISTED_TRIAL_PARALLEL_INDEX` environment variable to the tests when run in parallel.


I think that this documentation is good enough.

Just a curiosity and for future reference,

Do you know how other testing frameworks are dealing with this issue?

Do they also set an env variable?
Is there some convention for the variable name?

Are there any other things that can help with shared resources?

adiroiban · 2023-09-04T10:26:49Z

I have approved the changes and the PR looks good to me.

I see in #11949 that Glyph and Jean Paul are not happy with the way this is implemented.

I will leave this unmerged waiting for a consensus on how to get this done.

Regards

glyph

As discussed on #11949 , I don't think that this is a good API surface for us to support, and I think the documentation is insufficient. I'd like to see more explanation of whether this is actually necessary or not yet.

There was already some discussion on the ticket I don't see addressed in the review.

adiroiban · 2023-09-05T00:34:27Z

@glyph

There was already some discussion on the ticket I don't see addressed in the review.

Sorry about that.
The first notification was for the PR ... and I did the review without checking the ticket.
My bad. I will check the ticket discussion before doing a review.

thanks for the follow up here.

glyph · 2023-09-05T05:26:38Z

The first notification was for the PR ... and I did the review without checking the ticket. My bad. I will check the ticket discussion before doing a review.

No worries. We should discuss stuff on tickets more, but so many are just perfunctory CI / types / test housekeeping these days that it is an understandable habit to fall in to :)

chevah-robot added the needs-review label Sep 3, 2023

chevah-robot requested a review from a team September 3, 2023 09:41

adiroiban previously approved these changes Sep 4, 2023

View reviewed changes

chevah-robot added needs-merge and removed needs-review labels Sep 4, 2023

adiroiban requested a review from a team September 4, 2023 10:16

chevah-robot added needs-review and removed needs-merge labels Sep 4, 2023

glyph requested changes Sep 4, 2023

View reviewed changes

chevah-robot added needs-changes and removed needs-review labels Sep 4, 2023

Merge branch 'trunk' into twisted-trial-parallel-worker-index

377ea8e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#11949 trial: Pass parallel trial worker index as env variable #11950

#11949 trial: Pass parallel trial worker index as env variable #11950

p12tic commented Sep 3, 2023

p12tic commented Sep 3, 2023

adiroiban left a comment

adiroiban Sep 4, 2023

adiroiban Sep 4, 2023

adiroiban commented Sep 4, 2023

glyph left a comment •

edited

adiroiban commented Sep 5, 2023

glyph commented Sep 5, 2023

-        self.assertEqual(os.pathsep.join(sys.path), environments[0]["PYTHONPATH"])
-        parallel_indexes = set(e["TWISTED_TRIAL_PARALLEL_INDEX"] for e in environments)
-        self.assertEqual(set(["0", "1", "2", "3"]), parallel_indexes)
+        pythonpathEnvs = [e["PYTHONPATH" for e in environments]
+        self.assertEqual([os.pathsep.join(sys.path)] * 4, pythonpathEnvs)
+        parallelIndexes = set(e["TWISTED_TRIAL_PARALLEL_INDEX"] for e in environments)
+        self.assertEqual(set(["0", "1", "2", "3"]), parallelIndexes)

#11949 trial: Pass parallel trial worker index as env variable #11950

Are you sure you want to change the base?

#11949 trial: Pass parallel trial worker index as env variable #11950

Conversation

p12tic commented Sep 3, 2023

Scope and purpose

p12tic commented Sep 3, 2023

adiroiban left a comment

Choose a reason for hiding this comment

adiroiban Sep 4, 2023

Choose a reason for hiding this comment

adiroiban Sep 4, 2023

Choose a reason for hiding this comment

adiroiban commented Sep 4, 2023

glyph left a comment • edited

Choose a reason for hiding this comment

adiroiban commented Sep 5, 2023

glyph commented Sep 5, 2023

glyph left a comment •

edited