Inconsistent results for a test when running in-suite #842

SHxKM · 2020-05-17T12:03:11Z

Python 3.6.8
pytest==5.4.2
pytest-django==3.9.0
pytest-forked==1.0.2
pytest-socket==0.3.3
pytest-xdist==1.29.0

It seems that one particular test outputs different results based on if I'm running the whole suite or not.

When running the test like so:

pytest -k "test_name_of_test_func"

The test passes. But the test fails if I run the whole suite:

pytest -vvv

I've been debugging this for the past few hours to no avail. I've removed --reuse-db --no-migrations and brought them back, no dice. The behaviour under test has been manually verified, and the test should definitely pass.

Some details:

The test uses indirect parametrization like so:

@pytest.mark.parametrize(
    "profile, expected",
    [("test_some", False), ("test_some_other", True)],
    indirect=["profile"],
)
def test_name_of_test_func(profile, client, expected)

Both test_some and test_some_other are fixtures defined in fixtures.py:

@pytest.fixture
def profile(request):
    return request.getfixturevalue(request.param)

@pytest.fixture(scope="function")
def test_some(db):
    # ...
@pytest.fixture(scope="function")
def test_some_other(db):
    # ...

Any suggestions?

The text was updated successfully, but these errors were encountered:

SHxKM · 2020-05-17T12:38:45Z

Adding more information after some reading:
I’m especially suspicious of ‘getfixturevalue’ as it’s mentioned in an ongoing issue here: pytest-dev/pytest#6492 (comment)_

I have no session-scoped fixtures in my suite.

SalmonMode · 2020-05-17T14:05:49Z

Ah, I think I see the problem. It looks like you're inadvertently dependent on test execution order. I would recommend avoiding getfixturevalue at all costs, and, more of a nitpick, but avoid using "test" in fixture names as they aren't tests, and it can be confusing to anyone reading your test code.

I would also avoid putting the expected result as part of your parameterization, because different results ultimately means different behavior under test, and it tends to cause problems like this one because of the test structure it requires. I know the docs demonstrate parameterization this way, but it's likely to cause problems in tests that have complexity beyond small unit tests.

Pytest also isn't built to handle the results of fixtures as parameter inputs. Parameter inputs are really more for the arguments you would pass into a function, rather than the result of that function.

If you can provide some more detail about what test_some and test_some_other are doing, I can provide a more actionable solution.

SHxKM · 2020-05-17T14:13:49Z

Ah, I think I see the problem. It looks like you're inadvertently dependent on test execution order.

Interesting. How so?

and, more of a nitpick, but avoid using "test" in fixture names as they aren't tests, and it can be confusing to anyone reading your test code.

Noted. Thank you.

I would also avoid putting the expected result as part of your parameterization, because different results ultimately means different behavior under test, and it tends to cause problems like this one because of the test structure it requires. I know the docs demonstrate parameterization this way, but it's likely to cause problems in tests that have complexity beyond small unit tests.

This is the only way I found that lets me keep my tests DRY, as they contain a lot of set-up. test_some and test_some_other are simply Django user objects with a certain attribute that when found, should change a certain subset of the output given the same inputs (besides the User object of course).

I would recommend avoiding getfixturevalue at all costs

Is there an alternative to this one? It seemed like a lifesaver when I discovered it.

SalmonMode · 2020-05-17T16:14:49Z

Ah, I think I see the problem. It looks like you're inadvertently dependent on test execution order.

Interesting. How so?

I could be wrong, but it seems another test/fixture in your suite is doing something to the cached value of your fixtures or the state of a common asset that wasn't "undone" properly. When a test is run in isolation and is consistently resulting a certain way, but that result is different than when run with other tests, it being dependent on other tests is almost always the reason why.

I would also avoid putting the expected result as part of your parameterization, because different results ultimately means different behavior under test, and it tends to cause problems like this one because of the test structure it requires. I know the docs demonstrate parameterization this way, but it's likely to cause problems in tests that have complexity beyond small unit tests.

This is the only way I found that lets me keep my tests DRY, as they contain a lot of set-up. test_some and test_some_other are simply Django user objects with a certain attribute that when found, should change a certain subset of the output given the same inputs (besides the User object of course).

DRY is more of a warning that you may be missing an opportunity to use more DAMP (i.e. abstraction); not so much an opportunity only to reduce duplicate code. There's also KISS to keep things straightforward. It's fine, and often a good thing, to embrace having many fixtures and many tests.

I'm not sure I follow the use case though. It sounds like you have a sort "if object has this attribute, then do this, otherwise, do this". Is that the case?

I would recommend avoiding getfixturevalue at all costs

Is there an alternative to this one? It seemed like a lifesaver when I discovered it.

There's always an alternative. I find that when folks try to use the internal pytest stuff, or even some of the hooks when trying to do certain things, they may have missed another pytest feature, are missing robust abstraction, or weren't lined up for a more effective structure for their fixtures and tests.

I would say not to fear (for lack of a better word) expensive setups (which I'm assuming based off of what you said about them having a lot of setup), but leverage them a little more carefully.

For example, I use larger scopes, like classes, to represent a resulting state. I only use the class scope for my fixtures, and make sure that my test methods in a given test class only query and never modify the state of the SUT. This lets me run as many asserts as I want against a common state without having to worry about one test stepping on the toes if any other. As long as I make sure any state-changing fixtures for that test class run before any test executes (which I use autouse and fixture request chains for), then I'm golden.

I don't think any one of these suggestions answers your question, but I feel collectively they might. Let me know if not, and we can dive in a bit further.

SHxKM · 2020-05-17T16:20:13Z

@SalmonMode Thanks for taking the time to answer and educate.

I could be wrong, but it seems another test/fixture in your suite is doing something to the cached value of your fixtures or the state of a common asset that wasn't "undone" properly. When a test is run in isolation and is consistently resulting a certain way, but that result is different than when run with other tests, it being dependent on other tests is almost always the reason why.

I guess the only question that remains is: is getfixturevalue supposed to behave like this? If it is, then I misunderstood it. If it’s not, then it’s a bug in pytest? Maybe the library shouldn’t advertise a feature that some say should be avoided at all costs?

SalmonMode · 2020-05-17T16:42:40Z

The cause of the issue that I mentioned isn't directly related to getfixturevalue since that's the same method used inside the normal fixture logic flow inside pytest. Something not being "undone" properly is usually an issue with the logic in the fixtures defined by the user, in that the teardown logic (which usually is placed after the yield statement) wasn't sufficient (e.g. deleting the cache and cookies in a Selenium driver session, rather than quiting the session entirely). It could also be a matter of the scope of those fixtures (e.g. a session scope fixture that one test modified, but another test assumes it's untouched) or how their dependencies are defined not creating an explicit and deterministic order of operations (i.e. the other fixtures they request, and the fixtures that those fixtures request, and so on). It's tricky to identify, but these are usually the cause.

There are some issues in pytest that pytest-dev/pytest#6551 found fix and some in pytest-django that #807 should fix in tandem with the other fix, and they can be the cause of this as well. But since you used both the PRs I mentioned and still had the issue, I think it's likely the issues I mentioned above.

That said, I agree that pytest shouldn't advertise such features when there's simpler and safer ways to achieve results. But it's an enormous investment to come up with a sanctioned, standardized structure for pytest tests and fixtures, and good educational material to teach users. Also, it's not my call.

SHxKM mentioned this issue May 17, 2020

[WIP/RFC] fixtures: register finalizers with all previous fixtures in the stack (take 2) pytest-dev/pytest#6551

Closed

4 tasks

SHxKM closed this as completed May 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent results for a test when running in-suite #842

Inconsistent results for a test when running in-suite #842

SHxKM commented May 17, 2020 •

edited

SHxKM commented May 17, 2020

SalmonMode commented May 17, 2020 •

edited

SHxKM commented May 17, 2020

SalmonMode commented May 17, 2020

SHxKM commented May 17, 2020

SalmonMode commented May 17, 2020 •

edited

Inconsistent results for a test when running in-suite #842

Inconsistent results for a test when running in-suite #842

Comments

SHxKM commented May 17, 2020 • edited

SHxKM commented May 17, 2020

SalmonMode commented May 17, 2020 • edited

SHxKM commented May 17, 2020

SalmonMode commented May 17, 2020

SHxKM commented May 17, 2020

SalmonMode commented May 17, 2020 • edited

SHxKM commented May 17, 2020 •

edited

SalmonMode commented May 17, 2020 •

edited

SalmonMode commented May 17, 2020 •

edited