Add a Hypothesis plugin #2097

Zac-HD · 2020-11-07T00:20:57Z

This patch adds a plugin which teaches Hypothesis how to generated examples of Pydantic's custom field types (which closes #2017). Note that there is no runtime impact or dependency; this module is imported by Hypothesis so it only exists at test-time and only when Hypothesis is both installed and actually being used.

Unit tests for the changes exist
Tests pass on CI and coverage remains at 100%
Documentation reflects the changes where applicable
changes/<pull request or issue id>-<github username>.md file added describing change
(see changes/README.md for details)

I've ended up dropping the integration for URLs, and for constrained lists and sets... but everything else is ready to go!

Why not support constrained lists and sets?

In short, I couldn't get them to play nicely with Hypothesis' type registry - because the runtime objects are subclasses of a parametrized generic type, and aren't always distinguished by our introspection logic 😭

Why I decided to leave our URLs (with code)

I spent a long time tweaking support for URLs, but ultimately gave it up: it's easy to generate valid URLs, unless you want to generally really strange ones which will find lots of bugs. I decided that it was better to make users register their own - and make the tradeoff explicit - than to cover up the complexity and have automatic but weak tests. This is a standard design choice for Hypothesis, sadly, so I've included my code so far below for the benefit of any future contributor who wants to pick it up.

def idna_encodable(s: str) -> bool:
    # We only need this because the regex patterns aren't fully precise; but
    # rejection sampling is a LOT easier to implement than precise patterns.
    try:
        s.encode('idna')
    except Exception:
        hypothesis.reject()  # type: ignore[no-untyped-call]
    return True


@resolves(pydantic.AnyUrl)
def resolve_anyurl(cls):  # type: ignore[no-untyped-def]
    domains = st.one_of(
        st.from_regex(ascii_domain_regex(), fullmatch=True),
        st.from_regex(int_domain_regex(), fullmatch=True).filter(idna_encodable),
    )
    if cls.tld_required:

        def has_tld(s: str) -> bool:
            assert isinstance(s, str)
            match = ascii_domain_regex().fullmatch(s) or int_domain_regex().fullmatch(s)
            return bool(match and match.group('tld'))

        hosts = domains.filter(has_tld)
    else:
        hosts = domains | st.from_regex(
            r'(?P<ipv4>(?:\d{1,3}\.){3}\d{1,3})' r'|(?P<ipv6>\[[A-F0-9]*:[A-F0-9:]+\])',
            fullmatch=True,
        )

    return st.builds(
        cls.build,
        scheme=(
            st.sampled_from(sorted(cls.allowed_schemes))
            if cls.allowed_schemes
            else st.from_regex(r'(?P<scheme>[a-z][a-z0-9+\-.]+)', fullmatch=True)
        ).filter(idna_encodable),
        user=st.one_of(
            st.nothing() if cls.user_required else st.none(),
            st.from_regex(r'(?P<user>[^\s:/]+)', fullmatch=True).filter(idna_encodable),
        ),
        password=st.none() | st.from_regex(r'(?P<password>[^\s/]*)', fullmatch=True).filter(idna_encodable),
        host=hosts,
        port=st.none() | st.integers(0, 2 ** 16 - 1).map(str),
        path=st.none() | st.from_regex(r'(?P<path>/[^\s?]*)', fullmatch=True).filter(idna_encodable),
        query=st.none() | st.from_regex(r'(?P<query>[^\s#]+)', fullmatch=True).filter(idna_encodable),
        fragment=st.none() | st.from_regex(r'(?P<fragment>\S+)', fullmatch=True).filter(idna_encodable),
    ).filter(lambda url: cls.min_length <= len(url) <= cls.max_length)


st.register_type_strategy(pydantic.AnyUrl, resolve_anyurl)
st.register_type_strategy(pydantic.AnyHttpUrl, resolve_anyurl)
st.register_type_strategy(pydantic.HttpUrl, resolve_anyurl)
st.register_type_strategy(pydantic.PostgresDsn, resolve_anyurl)
st.register_type_strategy(pydantic.RedisDsn, resolve_anyurl)

def gen_url_models():
    class AnyUrlModel(pydantic.BaseModel):
        anyurl: pydantic.AnyUrl

    class AnyHttpUrlModel(pydantic.BaseModel):
        anyhttp: pydantic.AnyHttpUrl

    class HttpUrlModel(pydantic.BaseModel):
        http: pydantic.HttpUrl

    class PostgresDsnModel(pydantic.BaseModel):
        postgres: pydantic.PostgresDsn

    class RedisDsnModel(pydantic.BaseModel):
        redis: pydantic.RedisDsn

    yield from (AnyUrlModel, AnyHttpUrlModel, HttpUrlModel, PostgresDsnModel, RedisDsnModel)


@pytest.mark.parametrize('model', gen_url_models())
@settings(suppress_health_check=[HealthCheck.filter_too_much, HealthCheck.too_slow])
@given(data=st.data())
def test_can_construct_urls_model(data, model):
    # This is a separate test because we want a minimal health-check exemption
    instance = data.draw(st.from_type(model))
    assert isinstance(instance, model)

`Literal[None]` requires Python 3.7+ or a recent version of Hypothesis

Because I literally just fixed that this evening - since it requires a backport on a security-only Python version, we delayed working out how to support it until we knew it would actually be used before the 3.6 EOL later this year.

codecov · 2020-11-07T08:27:34Z

Codecov Report

Merging #2097 (6cece5c) into master (d0baf0f) will decrease coverage by 0.11%.
The diff coverage is 96.03%.

@@             Coverage Diff             @@
##            master    #2097      +/-   ##
===========================================
- Coverage   100.00%   99.88%   -0.12%     
===========================================
  Files           21       22       +1     
  Lines         4202     4323     +121     
  Branches       855      873      +18     
===========================================
+ Hits          4202     4318     +116     
- Misses           0        5       +5

Impacted Files	Coverage Δ
pydantic/_hypothesis_plugin.py	`95.68% <95.68%> (ø)`
pydantic/types.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d0baf0f...6cece5c. Read the comment docs.

PrettyWood · 2020-11-26T22:32:22Z

Amazing work @Zac-HD! So glad to see this PR open!
Could you please check the coverage to make sure it remains at 100% please?

Zac-HD · 2020-11-27T01:18:48Z

@PrettyWood - thanks! I'm looking forward to seeing what people do with it 😃

There was one deliberately unreachable line (assertion that we returned from a loop body), which I've unrolled so we get 100% coverage without needing lots of pragmas.

samuelcolvin

Just a few small things, overall I think this is looking great.

docs/examples/hypothesis_property_based_test.py

docs/hypothesis_plugin.md

pydantic/_hypothesis_plugin.py

pydantic/color.py

setup.py

tests/test_hypothesis_plugin.py

Zac-HD · 2020-12-01T11:25:10Z

OK @samuelcolvin - I think I'm done, and have reverted the #2155 patch in favor of a test-only solution 😄

tests/test_hypothesis_plugin.py

lsorber · 2020-12-20T10:48:54Z

@Zac-HD I was wondering, would it be possible to add generic support for constrained types like ConstrainedStr. For example:

st.register_type_strategy(
    ConstrainedStr,
    lambda T: st.from_regex(T.regex, fullmatch=True) if T.regex is not None else st.text(min_size=T.min_length, max_size=T.max_length)
)

Unfortunately, that type strategy is not picked up on for subclasses of ConstrainedStr:

class MyString(ConstrainedStr):
    regex = "[A-Z]{2,8}"

st.from_type(MyString).example()  # Returns empty string

This doesn't need to hold back this PR, but it is related so I thought I'd post it here.

Zac-HD · 2020-12-20T11:38:48Z

I was all ready to explain why we couldn't, but in fact there is a way... it's just a little more complicated than you might expect. First, the obvious approach of registering ConstrainedStr doesn't do anything for child classes. That's correct, if unfortunate in this case - otherwise we could just return builds(object) for everything!

So the trick is to register a strategy for each child class when that class is created, which can be done from the __init__ method of a metaclass. The second trick is to ensure that this is a noop unless the user is already using Hypothesis, and that can be done with a WeakSet and a plugin (see https://github.com/Parquery/icontract/pull/181/files plus https://github.com/mristin/icontract-hypothesis/pull/5/files for an example).

I'll update this PR in a week or two, or if you'd like to merge it sooner I can open a follow-up instead 😁

PrettyWood · 2021-01-03T11:31:02Z

Great job again @Zac-HD! I'll probably use it as soon as v1.8 is released 🚀
For conset and conlist, I think it will be possible to support them in the plugin once rewritten with Generic[T]. We are currently waiting for cython to support it

Makefile

pydantic/_hypothesis_plugin.py

Zac-HD · 2021-01-04T00:36:59Z

Thanks for the review @PrettyWood! I've added your suggestions, along with even more comments to explain what's happening 😄

tests/requirements-linting.txt

samuelcolvin · 2021-02-11T12:34:02Z

this is awesome, thank you so much! 🚀 🙏 🥳

I'm working through PRs now, v1.8 coming soon.

Zac-HD · 2021-02-11T12:35:58Z

Woohoo! Pydantic is also awesome, I'm so excited about combining it with Hypothesis 😁

Can't wait to see what people do with this either

Zac-HD force-pushed the hypothesis-plugin branch 4 times, most recently from 55517f8 to e7a11a5 Compare November 7, 2020 08:26

Zac-HD force-pushed the hypothesis-plugin branch 7 times, most recently from c54f1ba to 8232803 Compare November 8, 2020 07:54

Zac-HD force-pushed the hypothesis-plugin branch from 8232803 to 318bc56 Compare November 27, 2020 01:02

samuelcolvin reviewed Nov 29, 2020

View reviewed changes

Zac-HD mentioned this pull request Nov 30, 2020

Tighten color regex #2155

Closed

4 tasks

Zac-HD force-pushed the hypothesis-plugin branch 7 times, most recently from b835b9d to 1b10c1e Compare December 1, 2020 11:18

lsorber reviewed Dec 3, 2020

View reviewed changes

tests/test_hypothesis_plugin.py Show resolved Hide resolved

Zac-HD force-pushed the hypothesis-plugin branch 2 times, most recently from 0d5f569 to eb7aaca Compare December 30, 2020 13:56

Zac-HD force-pushed the hypothesis-plugin branch 2 times, most recently from 81efff5 to 4e193cf Compare January 3, 2021 11:05

PrettyWood reviewed Jan 3, 2021

View reviewed changes

Makefile Outdated Show resolved Hide resolved

PrettyWood reviewed Jan 3, 2021

View reviewed changes

pydantic/_hypothesis_plugin.py Outdated Show resolved Hide resolved

PrettyWood reviewed Jan 3, 2021

View reviewed changes

pydantic/_hypothesis_plugin.py Outdated Show resolved Hide resolved

Zac-HD force-pushed the hypothesis-plugin branch from 4e193cf to 134942d Compare January 4, 2021 00:11

Zac-HD added 3 commits January 4, 2021 11:20

Configure Hypothesis

b712371

Hypothesis plugin docs

589f017

Add Hypothesis plugin

af5c8e3

Zac-HD force-pushed the hypothesis-plugin branch from 134942d to af5c8e3 Compare January 4, 2021 00:20

PrettyWood reviewed Jan 4, 2021

View reviewed changes

tests/requirements-linting.txt Show resolved Hide resolved

Zac-HD force-pushed the hypothesis-plugin branch 2 times, most recently from 032870b to af5c8e3 Compare January 4, 2021 00:53

PrettyWood added the ready for review label Jan 19, 2021

Zac-HD mentioned this pull request Jan 25, 2021

Contracts in API with icontract tiangolo/fastapi#1996

Closed

Zac-HD mentioned this pull request Feb 3, 2021

Class with parameter of type of another class HypothesisWorks/hypothesis#2851

Closed

Merge branch 'master' into Zac-HD-hypothesis-plugin

6cece5c

samuelcolvin merged commit 771b0d3 into pydantic:master Feb 11, 2021

Zac-HD deleted the hypothesis-plugin branch February 11, 2021 12:36

samuelcolvin mentioned this pull request Feb 11, 2021

make hypothesis optional for testing #2343

Merged

Bobronium mentioned this pull request Feb 13, 2021

New test_hypothesis_plugin tests failing locally #2357

Closed

3 tasks

samuelcolvin mentioned this pull request Feb 22, 2021

fix(mypy): remove complaints about most custom _pydantic_ types #2099

Merged

4 tasks

lsorber mentioned this pull request Mar 4, 2021

Hypothesis plugin does not work for ConstrainedStr #2473

Closed

This was referenced Mar 6, 2021

Bump pydantic from 1.7.3 to 1.8.1 in /backend Vaarna/vaarna#139

Merged

Bump pydantic from 1.6.1 to 1.8.1 in /emmet-core rkingsbury/emmet#25

Closed

dependabot bot mentioned this pull request Mar 17, 2021

Bump pydantic[dotenv] from 1.7.3 to 1.8.1 enchant97/web-portal#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a Hypothesis plugin #2097

Add a Hypothesis plugin #2097

Zac-HD commented Nov 7, 2020 •

edited

codecov bot commented Nov 7, 2020 •

edited

PrettyWood commented Nov 26, 2020

Zac-HD commented Nov 27, 2020

samuelcolvin left a comment

Zac-HD commented Dec 1, 2020

lsorber commented Dec 20, 2020

Zac-HD commented Dec 20, 2020

PrettyWood commented Jan 3, 2021

Zac-HD commented Jan 4, 2021

samuelcolvin commented Feb 11, 2021

Zac-HD commented Feb 11, 2021

Add a Hypothesis plugin #2097

Add a Hypothesis plugin #2097

Conversation

Zac-HD commented Nov 7, 2020 • edited

codecov bot commented Nov 7, 2020 • edited

Codecov Report

PrettyWood commented Nov 26, 2020

Zac-HD commented Nov 27, 2020

samuelcolvin left a comment

Choose a reason for hiding this comment

Zac-HD commented Dec 1, 2020

lsorber commented Dec 20, 2020

Zac-HD commented Dec 20, 2020

PrettyWood commented Jan 3, 2021

Zac-HD commented Jan 4, 2021

samuelcolvin commented Feb 11, 2021

Zac-HD commented Feb 11, 2021

Zac-HD commented Nov 7, 2020 •

edited

codecov bot commented Nov 7, 2020 •

edited