Efficient Hypothesis strategies #1503

Zac-HD · 2024-02-21T08:30:08Z

This pull request fixes #404, which I opened a few years ago to fix some performance issues related to your rejection sampling, prompted by this stackoverflow question.

Recent Hypothesis versions can usually rewrite filters expressed as partial(operator.xxx, bound), and so this style is considerably more efficient in most cases. The only downside is that it can take a few minutes to get used to the partial() calls being "backwards", so lambda x: x < y becomes partial(op.gt, y) (via lambda x: y > x).

In the process, I also fixed two regex-related bugs where you'd see different behavior between the first and subsequent filters:

str_matches_strategy used fullmatch for the first, but match for subsequent filters, allowing generation of data with a disallowed suffix
for the first filter, str_startswith_strategy and str_endswith_strategy prepended/appended a regex boundary to the pattern. However, if the pattern includes alternation (e.g. a|b), this boundary would only be applied to the first/last branch, and thus invalid data could be generated. Placing the user's pattern inside a group resolves this problem.

Finally, I've updated the minimum Hypothesis version to that required for efficient length filtering, and included some regex expressions where the corresponding Hypothesis issue is currently open - so that they'll become efficient for your users as soon as we ship that.

Signed-off-by: Zac Hatfield-Dodds <zac.hatfield.dodds@gmail.com>

Signed-off-by: cosmicBboy <niels.bantilan@gmail.com>

codecov · 2024-02-22T17:49:12Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.29%. Comparing base (4df61da) to head (3df43d4).
Report is 15 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1503   +/-   ##
=======================================
  Coverage   94.29%   94.29%           
=======================================
  Files          91       91           
  Lines        7024     7029    +5     
=======================================
+ Hits         6623     6628    +5     
  Misses        401      401

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

cosmicBboy

Thanks @Zac-HD !

Zac-HD · 2024-02-23T00:27:43Z

Woohoo! I wonder if we'll get user reports of their tests suddenly working much faster 😁

Zac-HD and others added 2 commits February 22, 2024 12:00

Efficient Hypothesis strategies

78c6776

Signed-off-by: Zac Hatfield-Dodds <zac.hatfield.dodds@gmail.com>

update requirements files

3df43d4

Signed-off-by: cosmicBboy <niels.bantilan@gmail.com>

cosmicBboy force-pushed the bugfix/hypothesis-strategies branch from 683db8e to 3df43d4 Compare February 22, 2024 17:17

cosmicBboy approved these changes Feb 22, 2024

View reviewed changes

cosmicBboy merged commit 10cac40 into unionai-oss:main Feb 22, 2024
74 checks passed

Zac-HD deleted the bugfix/hypothesis-strategies branch February 23, 2024 00:26

tmcclintock mentioned this pull request Apr 16, 2024

Hypothesis examples are all the same #1579

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Efficient Hypothesis strategies #1503

Efficient Hypothesis strategies #1503

Zac-HD commented Feb 21, 2024

codecov bot commented Feb 22, 2024 •

edited

cosmicBboy left a comment

Zac-HD commented Feb 23, 2024

Efficient Hypothesis strategies #1503

Efficient Hypothesis strategies #1503

Conversation

Zac-HD commented Feb 21, 2024

codecov bot commented Feb 22, 2024 • edited

Codecov Report

cosmicBboy left a comment

Choose a reason for hiding this comment

Zac-HD commented Feb 23, 2024

codecov bot commented Feb 22, 2024 •

edited