[test] add a line matcher object #12219

picnixz · 2024-03-31T17:46:49Z

Here's the line matching feature.

TL;DR: Usage is

app.stderr(flavor='re').assert_match('.*my message')

for the warnings for instance. It automatically removes the colors at the beginning and searches for a line matching the above string. The 'flavor' can maybe changed to 're' by default or have a assert_re_match method but that's how you would use the object in general.

There are some options with their default values but I'm not sure which one should be the default. For now, the line matcher object for test application uses the default options but maybe I can make it with better defaults.

@chrisjsewell, I'd suggest you have a look at the tests (tests/test_testing/test_matcher.py) to see how it would be used. It's still WIP but I would like some feedback and what to expose (actually, with the current API you can more or less match whatever you want more or less however you want but it's better if you don't need to add options or if you have dedicated methods so as an extension's writer I'd be glad if you could give me some feedback on what would the functionalities that you want for sure).

In the tests I used the most precise match information (namely the Line + offset) but you can ignore the offset and simply match against strings (see the tests for Line and Block objects).

I'm still wondering how to have a "nice" error context when the match fails... well, if you don't need the regex support you can just write something like app.stderr().lines() == [...]. Maybe I can also expose a simple interface for simple matching.

picnixz · 2024-03-31T17:53:13Z

FYI: some stuff from opened PRs is included inside, so the diff will be smaller once they are merged. In addition, with the current implementation, it's easy for me to make some "shortcut" functions (the main logic is essentially done, except for the nice diff with regex patterns).

picnixz · 2024-04-01T17:38:12Z

So, after some investigation, it's quite hard to have a "nice" diff because of regexes. I'll probably make another PR for that one because it's much more complex than what I thought at first (I think I'll take some inspiration from difflib, though it's not on my tasklist now).

I'll wait for some feedback before coming back to that PR.

EDIT: I've found various typos in the documentation so I'll update it tomorrow.

sphinx/testing/_matcher/buffer.py

picnixz · 2024-04-02T17:31:18Z

After considering #12216, I'll update this PR as well.

picnixz · 2024-04-10T07:43:47Z

Sure! I just pushed some docs because there multiple typos and refactored a bit things. By the way, I also observed that we had a bug in autodoc thanks to that... so I fixed it (I've got 3 PRs that need to be merged before that just for the docs to be correctly rendered).

jayaddison · 2024-04-20T13:58:20Z

TL;DR: Usage is

app.stderr(flavor='re').assert_match('.*my message')

Checking some of the design reasoning for the method signature:

A reason to prefer the caller to select a flavor is that otherwise, a matcher pattern of type str could ambiguously be either a regex or a simple string match (app.stderr.assert_match('A*')).
Using keyword-arguments to select the flavor (app.stderr.assert_match(string='[not a regex]') might be non-obvious, and would prevent positional single-argument usage like the example above.

sphinx/testing/matcher/__init__.py

tests/test_testing/test_matcher_engine.py

picnixz · 2024-04-20T16:15:42Z

A reason to prefer the caller to select a flavor is that otherwise, a matcher pattern of type str could ambiguously be either a regex or a simple string match (app.stderr.assert_match('A*')).

I don't know which flavor to have by default... I would like to say "yay, it's maybe better to have a pure string flavor" because you usually want to have an exact match (I think most of our tests have exact matching and I think people don't want to escape possible meta-characters...).

One possibility that I had in mind is just to have other methods:

assert_match
assert_equal

But I'm not very happy with the equal itself... (ideally, it's assert_that_there_is_a_line_equal_to_one_of_the_strings_or_patterns)

Using keyword-arguments to select the flavor (app.stderr.assert_match(string='[not a regex]') might be non-obvious, and would prevent positional single-argument usage like the example above.

I think assert_match(expect, flavor=...) is more natural but I'm open to suggestions. Alternatively, we could have assert_match(regex='whatever to be considered a regex', string='whatever string it should be', fnmatch='whatever fnmatch pattern') but I'm not sure if more flavors should be supported (I tried to have an interface that is flavor-agnostic as much as possible and delegate the task to converting string-like objects to re-patterns objects however you deem them fit (maybe I could even use predicate-based things because I don't think I use anything else except pattern.match)).

jayaddison · 2024-04-23T22:08:41Z

I think assert_match(expect, flavor=...) is more natural but I'm open to suggestions.

I think I'd probably prefer a developer experience of:

assert_contains(string) to check for lines containing string.
assert_matches(pattern) to check for lines that match a regular expression pattern.

That is: binding the flavour to the method name. That's partly because it could help me to think about and write / read code, but also there are some second-order downstream static analysis/dependency benefits (tooling/analysts can infer to some degree that assert_contains does not enter finite automata logic originated from Sphinx, for example).

AA-Turner · 2024-04-24T06:41:56Z

I am probably missing some context here -- can someone point me to a brief overview of what this PR is for? The body text of the description simply says here is the matcher!

I'm aware Bénédikt has his thesis coming up soon, so no rush, just would be interested in the background given it's a 3000 line PR.

A

picnixz · 2024-04-24T06:56:05Z

I am probably missing some context here

Oh yes, I think I never created an issue for that and just rushed to the implementation as if it were a hackathon. So, here's the context: we were doing some cleanups about colorization and co and I observed that many tests always have the same logic, namely:

lines = strip_colors(app.status.getvalue()).strip().splitlines()

and then, they match the lines one by one. All those lines could be replaced by a single function, let's say app.get_status_lines() and that would be it. But since I had time on my hands (and I wanted to have something for my local dev as well), I ended up implementing a line matcher where you can do more than just matching strings by strings, but also match blocks in one go, or with regexes and so on + have a nicer diff. The original idea was based on the pytester object that I used for testing plugins.

Technically speaking, the original issue could be solved by just adding two methods to SphinxTestApp that would just give you the lines. I can do a small PR if you think the 3k lines are too much. Actually, the interface of the matcher object is flexible enough that we can gradually add more methods if needed, but I essentially built it so that the most common operations are done by default, namely "remove colors -> strip -> split lines without keeping line breaks" (well, this default sequence is only for the matcher object for the test application, because people might want to use that matcher for any other kind of string technically).

I also think that it could be useful for matching autodoc outputs. I created a class for formatting expected RST output for enumeration test cases and I thought "how nice it would be to be able to do that but for other autodoc outputs as well" so I think the matcher object can be used in conjunction with those factory classes in a more user-friendly API. At least, tests would be easier to write (and probably shorter). Also, while the pytest output is fine, sometimes it's hard to detect exactly where the error happenned in the huge diff (the line matcher object would "highlight" the erroneous blocks; for now the logic is simple enough but I intended to do it using diff-like algorithms).

I'm aware Bénédikt has his thesis coming up soon, so no rush, just would be interested in the background given it's a 3000 line PR.

Yes, I'm currently writing it !

picnixz · 2024-04-24T06:57:01Z

@jayaddison

That is: binding the flavour to the method name. That's partly because it could help me to think about and write / read code, but also there are some second-order downstream static analysis/dependency benefits (tooling/analysts can infer to some degree that assert_contains does not enter finite automata logic originated from Sphinx, for example).

The argument is sound. I can make the flavor-agnostic method private and expose dispatcher methods.

picnixz added 10 commits March 30, 2024 11:05

enhance ANSI functions

30a4671

remove # type: ignore[attr-defined] for colors

45ea39b

add tests for ANSI strippers

86c3efe

add matcher objects for tests

55099de

enhance ANSI functions

0fe03c0

remove # type: ignore[attr-defined] for colors

2c1df50

add tests for ANSI strippers

73f3d88

Merge branch 'fix/ansi-functions' into feat/line-matcher

d4a277b

update

fb92ab5

cleanup

f4309fb

picnixz added 7 commits April 1, 2024 13:35

split tests and utils into modules

b9ce7ec

Merge remote-tracking branch 'upstream/master' into fix/ansi-functions

6972a22

Merge branch 'fix/ansi-functions' into feat/line-matcher

c48e6ac

add explicit typing-extensions dependency

4c3566a

cleanup

bef6b17

fixup

bf8de84

Update documentation and make buffers more efficient.

9fd372a

This comment was marked as resolved.

Sign in to view

picnixz commented Apr 1, 2024

View reviewed changes

sphinx/testing/_matcher/buffer.py Outdated Show resolved Hide resolved

picnixz added 8 commits April 2, 2024 14:24

fix various bugs

5232421

fixup

3695617

fix bugs

c8cb941

fix bugs

83b5003

simplify ANSI handling

7edf273

remove complicated stuff

dc37de5

fixup

7ce24f3

fixup

846edef

revert

5752a51

picnixz force-pushed the feat/line-matcher branch from f654399 to 5752a51 Compare April 10, 2024 07:41

fixup

0b0b5a1

This was referenced Apr 10, 2024

Improve sphinx.util.inspect [part 1] #12256

Merged

7.3.0 release plan #12242

Closed

picnixz added 3 commits April 15, 2024 09:52

Merge branch 'master' into feat/line-matcher

f66885c

Merge branch 'master' into feat/line-matcher

310f589

cleanup

b85e80a

jayaddison reviewed Apr 20, 2024

View reviewed changes

sphinx/testing/matcher/__init__.py Outdated Show resolved Hide resolved

jayaddison reviewed Apr 20, 2024

View reviewed changes

tests/test_testing/test_matcher_engine.py Show resolved Hide resolved

picnixz added 13 commits April 28, 2024 00:35

Merge branch 'master' into feat/line-matcher

5a2d07b

Update test_matcher_buffer.py

488343f

Update test_matcher_cleaner.py

b3967c4

Update test_matcher_buffer.py

dbeba16

Update __init__.py

963c4ce

improve python-code roles

810ca5a

simplify op-codes logic

843e0c7

use type aliases

35673f0

changed 'none' flavor to 'literal'

6c50e61

Update API according to comments

acdc53f

update tests

35fe86b

remove unused code

466bcc3

Merge branch 'master' into feat/line-matcher

1e2cf99

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[test] add a line matcher object #12219

[test] add a line matcher object #12219

picnixz commented Mar 31, 2024 •

edited

picnixz commented Mar 31, 2024

picnixz commented Apr 1, 2024 •

edited

This comment was marked as resolved.

picnixz commented Apr 2, 2024

picnixz commented Apr 10, 2024

jayaddison commented Apr 20, 2024

picnixz commented Apr 20, 2024

jayaddison commented Apr 23, 2024

AA-Turner commented Apr 24, 2024 •

edited

picnixz commented Apr 24, 2024

picnixz commented Apr 24, 2024

[test] add a line matcher object #12219

Are you sure you want to change the base?

[test] add a line matcher object #12219

Conversation

picnixz commented Mar 31, 2024 • edited

picnixz commented Mar 31, 2024

picnixz commented Apr 1, 2024 • edited

This comment was marked as resolved.

picnixz commented Apr 2, 2024

picnixz commented Apr 10, 2024

jayaddison commented Apr 20, 2024

picnixz commented Apr 20, 2024

jayaddison commented Apr 23, 2024

AA-Turner commented Apr 24, 2024 • edited

picnixz commented Apr 24, 2024

picnixz commented Apr 24, 2024

picnixz commented Mar 31, 2024 •

edited

picnixz commented Apr 1, 2024 •

edited

AA-Turner commented Apr 24, 2024 •

edited