Improve performance of small tests that use rejection sampling #2030

DRMacIver · 2019-07-02T09:25:25Z

I think this problem must always have been latent in Hypothesis but the previous implementation of the tree had a bug which masked it somehow, because the issue is actually intrinsic to what we were doing: The rejection sampling on small state spaces forces the necessary novel prefix to be progressively longer and longer, creating an accidentally quadratic problem where generating the Nth example takes O(N) time.

This PR fixes it by implicitly capping all rejection sampling loops to 20 iterations by building logic for marking longer sequences of consecutive discards as invalid into ConjectureData.

Earlier versions of this PR had an independent performance improvement to generate_novel_prefix which turned out to change the distribution enough that I decided to pull it out for now (it causes us to miss some branches in coverage that we were previously hitting).

DRMacIver · 2019-07-02T16:56:13Z

Despite the failing build, this should actually pass and be ready for review. It's just failing because Cloudflare issues are breaking most of the internet right now.

Zac-HD · 2019-07-04T01:11:36Z

hypothesis-python/tests/nocover/test_sampled_from.py

@@ -70,6 +70,7 @@ def test_unsat_sets_of_samples(x):
    assert False


+@settings(suppress_health_check=[HealthCheck.too_slow, HealthCheck.filter_too_much])


I'm not really comfortable with this since it seems to defeat the purpose of the test, but after merging #2031 and rebasing I don't think it will be needed.

Yup, agreed. I think it's not totally defeating the point of the test, but it definitely weakens it. Either way, it's no longer necessary.

…jection sampling

Zac-HD

LGTM!

This was referenced Jul 2, 2019

Simple test never stop? #2027

Closed

Avoid rejection sampling in cu.integer_range #2029

Closed

Improve performance of unique lists with elements=sampled_from(...) #2031

Merged

DRMacIver force-pushed the DRMacIver/rejection-sampling-performance branch 2 times, most recently from 7040de6 to be14bad Compare July 2, 2019 14:12

DRMacIver force-pushed the DRMacIver/rejection-sampling-performance branch from be14bad to 7579868 Compare July 3, 2019 07:41

Zac-HD reviewed Jul 4, 2019

View reviewed changes

DRMacIver mentioned this pull request Jul 4, 2019

Extend special casing of sampled_from in unique collections #2036

Closed

DRMacIver force-pushed the DRMacIver/rejection-sampling-performance branch from 7579868 to 903e1cc Compare July 4, 2019 10:14

DRMacIver added 3 commits July 4, 2019 17:50

Remove skip on test_blacklisted_characters

eac937a

Bound the number of consecutive discards we allow, terminating all re…

b88f888

…jection sampling

Add release file

1b0867c

DRMacIver force-pushed the DRMacIver/rejection-sampling-performance branch from 903e1cc to 1b0867c Compare July 4, 2019 16:50

Zac-HD approved these changes Jul 4, 2019

View reviewed changes

DRMacIver merged commit 22a2f60 into master Jul 4, 2019

DRMacIver deleted the DRMacIver/rejection-sampling-performance branch July 4, 2019 18:55

DRMacIver mentioned this pull request Jul 5, 2019

Improve performance of generate_novel_prefix #2037

Merged

Zac-HD mentioned this pull request Oct 12, 2019

Perform health checks for each component of a strategy #1007

Closed

DRMacIver mentioned this pull request Nov 7, 2019

Prune parts of the data tree that have discards in them #2185

Merged

DRMacIver mentioned this pull request Dec 20, 2019

Remove consecutive discards heuristic #2290

Merged

Zac-HD mentioned this pull request Jan 19, 2020

Strategies deduplication does not working for st.none as expected in OneOfStrategy.element_strategies #2327

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of small tests that use rejection sampling #2030

Improve performance of small tests that use rejection sampling #2030

DRMacIver commented Jul 2, 2019 •

edited

DRMacIver commented Jul 2, 2019

Zac-HD Jul 4, 2019

DRMacIver Jul 4, 2019

Zac-HD left a comment

		@@ -70,6 +70,7 @@ def test_unsat_sets_of_samples(x):
		assert False


		@settings(suppress_health_check=[HealthCheck.too_slow, HealthCheck.filter_too_much])

Improve performance of small tests that use rejection sampling #2030

Improve performance of small tests that use rejection sampling #2030

Conversation

DRMacIver commented Jul 2, 2019 • edited

DRMacIver commented Jul 2, 2019

Zac-HD Jul 4, 2019

Choose a reason for hiding this comment

DRMacIver Jul 4, 2019

Choose a reason for hiding this comment

Zac-HD left a comment

Choose a reason for hiding this comment

DRMacIver commented Jul 2, 2019 •

edited