Scrutineer: integrating opportunistic fault-localisation with PBT #2859

Zac-HD · 2021-02-16T09:47:18Z

I'm currently working on a paper about integrating fault localisation with property-based testing - TLDR, it's pretty easy for PBT libraries to suggest where to start debugging test failures.

It turns out that my simple baseline version is also reliable, useful, and easy to interpret. On that basis I thought it makes sense to ship it, albeit disabled-by-default. I've tried this on a variety of toy examples and some real bugs in black; reviews and/or feedback on how this works for your problems would be most welcome 🙂

Future plans for other PRs: adding fancier techniques, if they're reliable and fast and actually work better; and integration with generalised examples (#2192). As currently planned, all of these would go in a single "explain phase" - they share a purpose and workflow and therefore aren't individually configurable.

sobolevn

This feature looks amazing! I would love to try it, but I don't have any failing tests at hand 😆

sobolevn · 2021-02-16T13:23:43Z

hypothesis-python/docs/settings.rst

@@ -71,7 +74,7 @@ with each phase corresponding to a value on the :class:`~hypothesis.Phase` enum:
 3. ``Phase.generate`` controls whether new examples will be generated.
 4. ``Phase.target`` controls whether examples will be mutated for targeting.
 5. ``Phase.shrink`` controls whether examples will be shrunk.
-
+6. ``Phase.explain`` controls whether Hypothesis attempts to explain test failures.


I would love to see some examples of how explain results look like somewhere in the docs. Does it fit there?

I was planning on showing it off in a blog post - there's nowhere obvious to put it in the docs, and the exact output format is pretty simple:

from hypothesis import Phase, given, strategies as st @given(st.integers()) def test_reports_branch_in_test(x): if x > 10: raise AssertionError # BUG

(Obviously this is a toy example, but it's been useful on real projects too)

_________________________ test_reports_branch_in_test _________________________ Traceback (most recent call last): ... AssertionError --------------------------------- Hypothesis ---------------------------------- Falsifying example: test_reports_branch_in_test( x=11, ) Explanation: These lines were always and only run by failing examples: /path/to/test_file.py:6

One consideration in "what do we report" is that this format (usually) allows you to click on terminal output and have the relevant file open to that line in your preferred editor.

The alternative approach of reporting branches (source/destination pairs of lines) is only rarely more precise in practice, and much more difficult to explain to non-expert users. "report branches if there are no reportable lines" would be a nice trick to explore in future, though.

sobolevn · 2021-02-17T08:32:00Z

hypothesis-python/tests/cover/test_scrutineer.py

+    assert len(expected) == code.count(BUG_MARKER)
+    print(pytest_stdout)
+    for report in expected:
+        assert report in pytest_stdout


Maybe https://github.com/syrusakbary/snapshottest will be a good fit here?

It is great for testing the output! ⭐

The catch is that we only want to test parts of the output, i.e. the explanation but not anything about the actual file paths.

(The other catch is that at present explain mode skips the tracing if sys.gettrace() is not None... which makes it compatible with debuggers and also hides it from coverage. Hmmm.)

Zac-HD · 2021-02-17T12:54:15Z

Ugh. The actual implementation of C trace-functions is tricky enough that I think we actually can't reliably swap in the explain-tracer for e.g. the coverage tracer, not least due to decade-old CPython issues.

Being a basic-but-useful system for fault localisation.

Zac-HD · 2021-03-04T13:07:59Z

@HypothesisWorks/hypothesis-python-contributors - final call for review! I'd love another set of eyes on this and an approving review, but I'll eventually merge it anyway if there are no objections 🙂

Zac-HD added the new-feature entirely novel capabilities or strategies label Feb 16, 2021

Zac-HD requested a review from DRMacIver as a code owner February 16, 2021 09:47

Zac-HD force-pushed the scrutineer-poc branch 2 times, most recently from e853b0c to 64dbcc8 Compare February 16, 2021 11:36

sobolevn reviewed Feb 16, 2021

View reviewed changes

Zac-HD force-pushed the scrutineer-poc branch 5 times, most recently from e4868af to 136b1f3 Compare February 17, 2021 07:54

sobolevn reviewed Feb 17, 2021

View reviewed changes

Zac-HD force-pushed the scrutineer-poc branch 2 times, most recently from 56b4404 to a895b9e Compare February 22, 2021 00:57

Initial Scrutineer

c923932

Being a basic-but-useful system for fault localisation.

Zac-HD force-pushed the scrutineer-poc branch from a895b9e to c923932 Compare March 4, 2021 12:01

Stranger6667 approved these changes Mar 4, 2021

View reviewed changes

Zac-HD merged commit e2622c8 into HypothesisWorks:master Mar 7, 2021

Zac-HD deleted the scrutineer-poc branch March 7, 2021 04:41

Zac-HD mentioned this pull request Aug 12, 2022

Skip known-uninformative modules in Scrutineer explain mode #3439

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scrutineer: integrating opportunistic fault-localisation with PBT #2859

Scrutineer: integrating opportunistic fault-localisation with PBT #2859

Zac-HD commented Feb 16, 2021 •

edited

sobolevn left a comment

sobolevn Feb 16, 2021 •

edited

Zac-HD Feb 17, 2021 •

edited

sobolevn Feb 17, 2021

sobolevn Feb 17, 2021

Zac-HD Feb 17, 2021

Zac-HD commented Feb 17, 2021 •

edited

Zac-HD commented Mar 4, 2021

Scrutineer: integrating opportunistic fault-localisation with PBT #2859

Scrutineer: integrating opportunistic fault-localisation with PBT #2859

Conversation

Zac-HD commented Feb 16, 2021 • edited

sobolevn left a comment

Choose a reason for hiding this comment

sobolevn Feb 16, 2021 • edited

Choose a reason for hiding this comment

Zac-HD Feb 17, 2021 • edited

Choose a reason for hiding this comment

sobolevn Feb 17, 2021

Choose a reason for hiding this comment

sobolevn Feb 17, 2021

Choose a reason for hiding this comment

Zac-HD Feb 17, 2021

Choose a reason for hiding this comment

Zac-HD commented Feb 17, 2021 • edited

Zac-HD commented Mar 4, 2021

Zac-HD commented Feb 16, 2021 •

edited

sobolevn Feb 16, 2021 •

edited

Zac-HD Feb 17, 2021 •

edited

Zac-HD commented Feb 17, 2021 •

edited