Fix average_size calculation #3160

Zac-HD · 2021-11-23T14:47:26Z

Closes #3143.

hypothesis-python/src/hypothesis/internal/conjecture/utils.py

jebob · 2021-11-26T11:46:05Z

I suspect a gradient descent method might run faster and be more reliable, this approach can choke for relatively small ints (if target=10000, max_size=10001 you can get float overflows when you compute the average for large inc.). I can't test the validity of this Python locally currently but testing nasty edge cases in excel shows that this can handle up to 2000 in a few iterations.

# Gradient derivation
average_size = (1.0 / (1 - p_continue) - 1) * (1 - p_continue ** max_size)
daverage/dp = (1 - p_continue ** max_size) * d/dp((1 - p_continue)**-1 - 1) + (1.0 / (1 - p_continue) - 1) * d/dp(1 - p_continue ** max_size)
daverage/dp = (1 - p_continue ** max_size) * (1 - p_continue)**-2 - (1.0 / (1 - p_continue) - 1) * max_size * p_continue ** (max_size-1)

	@staticmethod
	def _calc_p_continue(desired_avg, max_size):
		p_continue = 1 - 1.0 / (1 + desired_avg)
		if p_continue == 0 or max_size == float("inf"):
			return p_continue
		# For small max_size, the infinite-series p_continue is a poor approximation,
		# and while we can't solve the polynomial a few rounds of iteration quickly
		# gets us a good approximate solution in almost all cases (sometimes exact!).
		err = desired_avg - _p_continue_to_avg(p_continue, max_size)
		for _ in range(10):
			# Should converge in <5 iterations for nearly all cases
			if abs(err) < desired_avg * 0.001:
				# Good enough
				break
			gradient = _p_continue_to_gradient(p_continue, max_size)
			p_continue += err / gradient
			err = desired_avg - _p_continue_to_avg(p_continue, max_size)
		else:
			assert False, "Infinite loop error"
		assert 0 < p_continue < 1, p_continue
		return p_continue

def _p_continue_to_avg(p_continue, max_size):
    """Return the average_size generated by this p_continue and max_size."""
    return (1.0 / (1 - p_continue) - 1) * (1 - p_continue ** max_size)
	
def _p_continue_to_gradient(p_continue, max_size):
    """Return the gradient (with respect to p_continue) for p_continue and max_size."""
	# While the true gradient is well behaved around 1, this function is not.
	# Approximating to near-one is good enough in most cases
	# Todo: if the true value of p_continue is > this number we won't converge
	p_continue = min(p_continue, 0.9999)
	
    return (
	    (1 - p_continue ** max_size) / (1 - p_continue) ** 2 
	    - (1.0 / (1 - p_continue) - 1) * max_size * p_continue ** (max_size-1)
	)

hypothesis-python/src/hypothesis/provisional.py

Co-Authored-By: Robert Howlett <9222111+jebob@users.noreply.github.com>

Zac-HD · 2021-11-28T12:22:50Z

I've replaced my previous heuristic approach with a binary search, which is simple enough that it's clear-on-inspection that it converges and maintains the bounds we want at each step. The constant factors might be a bit larger, but it's still fast in practice and @lru_cache makes it faster again.

@jebob, I tried out your gradient-descent code, but couldn't get it numerically-stable enough to respect strict bounds when testing. Regardless, I feel that we've co-written this fix and have given you commit and coauthor credit accordingly.

jebob · 2021-11-28T12:46:26Z

@Zac-HD Thanks for the co-authorship!

Stability is essential so I agree binary search is the way to go.

jebob

LGTM other than the two comments

jebob · 2021-11-28T13:28:12Z

hypothesis-python/src/hypothesis/internal/conjecture/utils.py

+    # gets us a good approximate solution in almost all cases (sometimes exact!).
+    while _p_continue_to_avg(p_continue, max_size) > desired_avg:
+        # This is impossible over the reals, but *can* happen with floats.
+        p_continue -= 0.0001


In this case I think we can just return p_continue (or do nothing, thus skipping the zeroth iteration of the while loop)? If 1 - 1.0 / (1 + desired_avg) is close enough to the correct answer that floating-point error causes p_continue to be an overestimate then we're close enough for practical purposes.

I agree that the estimate is close enough that we could just use the initial p_continue.

However, it's cheap to make this adjustment, and that allows us to test that our binary search always maintains a strict upper bound.

jebob · 2021-11-28T13:47:39Z

hypothesis-python/tests/conjecture/test_utils.py

+
+
+def test_p_continue_to_average_saturates():
+    assert cu._p_continue_to_avg(1.1, 100) == 100


Can we replace this special case with an @example on test_p_continue_to_average?

We do actually have such an example, but also need this test to reliably get full coverage from our conjecture-coverage task (the other is in nocover).

Zac-HD · 2021-11-28T14:08:57Z

Thanks again for the review, and all your help with this!

Zac-HD added the internals Stuff that only Hypothesis devs should ever see label Nov 23, 2021

Zac-HD requested a review from DRMacIver as a code owner November 23, 2021 14:47

jebob reviewed Nov 23, 2021

View reviewed changes

hypothesis-python/src/hypothesis/internal/conjecture/utils.py Outdated Show resolved Hide resolved

Zac-HD force-pushed the denser-small-arrays branch 4 times, most recently from 220868a to 0c43f1b Compare November 24, 2021 09:26

Zac-HD requested a review from Zalathar November 24, 2021 22:32

jebob reviewed Nov 26, 2021

View reviewed changes

hypothesis-python/src/hypothesis/internal/conjecture/utils.py Show resolved Hide resolved

Zalathar reviewed Nov 26, 2021

View reviewed changes

hypothesis-python/src/hypothesis/provisional.py Outdated Show resolved Hide resolved

Zac-HD force-pushed the denser-small-arrays branch 5 times, most recently from 7db8071 to 8b30b67 Compare November 28, 2021 11:54

Fix average_size calculation

59521cd

Co-Authored-By: Robert Howlett <9222111+jebob@users.noreply.github.com>

Zac-HD force-pushed the denser-small-arrays branch from 8b30b67 to 59521cd Compare November 28, 2021 12:19

jebob suggested changes Nov 28, 2021

View reviewed changes

Zac-HD merged commit 84afe88 into HypothesisWorks:master Nov 28, 2021

Zac-HD deleted the denser-small-arrays branch November 28, 2021 14:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix average_size calculation #3160

Fix average_size calculation #3160

Zac-HD commented Nov 23, 2021

jebob commented Nov 26, 2021 •

edited by rsokl

Zac-HD commented Nov 28, 2021

jebob commented Nov 28, 2021

jebob left a comment

jebob Nov 28, 2021

Zac-HD Nov 28, 2021

jebob Nov 28, 2021

Zac-HD Nov 28, 2021

Zac-HD commented Nov 28, 2021



		def test_p_continue_to_average_saturates():
		assert cu._p_continue_to_avg(1.1, 100) == 100

Fix average_size calculation #3160

Fix average_size calculation #3160

Conversation

Zac-HD commented Nov 23, 2021

jebob commented Nov 26, 2021 • edited by rsokl

Zac-HD commented Nov 28, 2021

jebob commented Nov 28, 2021

jebob left a comment

Choose a reason for hiding this comment

jebob Nov 28, 2021

Choose a reason for hiding this comment

Zac-HD Nov 28, 2021

Choose a reason for hiding this comment

jebob Nov 28, 2021

Choose a reason for hiding this comment

Zac-HD Nov 28, 2021

Choose a reason for hiding this comment

Zac-HD commented Nov 28, 2021

jebob commented Nov 26, 2021 •

edited by rsokl