Boundary of search space on pairs of floats is reliably limited #2671

rsokl · 2020-11-18T19:04:09Z

Generating a pair of floats with @given(x=st.floats(x_min, x_max), y=st.floats(y_min, y_max)) fails to ever generate cases along the "vertical strip" (x_min, y), other than the point (x_min, y_min).

This is not true for the transpose – the "horizontal strip" (x, y_min) is explored in a robust way.

The following tests demonstrate this:

import hypothesis.strategies as st
from hypothesis import given

@given(x=st.floats(0, 10), y=st.floats(0, 10))
def test_explore_xmin_y(x, y):
    # always passes, even if `max_examples` is cranked up
    if x == 0:
        assert y == 0


@given(x=st.floats(0, 10), y=st.floats(0, 10))
def test_explore_x_ymin(x, y):
    # reliably fails
    if y == 0:
        assert x == 0

I found this because I noticed that a test that I had written should have been failing along (x_min, y) other than at (x_min, y_min), but it never did. I was surprised that simply swapping the order of my parameters got the test to fail.

The text was updated successfully, but these errors were encountered:

Zac-HD · 2020-11-18T19:43:05Z

I think this might be related to our zero-blocks heuristics? @DRMacIver or @Zalathar can probably say more.

rsokl · 2020-11-18T20:24:07Z

To summarize this visually, the following plots 10,000 ceil'd (x, y) values generated by x=st.floats(0, 10), y=st.floats(0, 10) (on a log(cnt + 1) scale)

Zalathar · 2020-11-22T08:54:09Z

I had a brief look into this, and the impression I have so far is that our strategy for bounded floats doesn't have any explicit bias towards zeros (unlike the unbounded-floats strategy).

And since Conjecture currently doesn't go out of its way to generate low-level zeros (IIRC), the end result is that zeros end up being astronomically rare outside of the guaranteed all-zeros example.

(I haven't yet investigated what's going on behind the skew towards one axis working and the other axis not working.)

Zac-HD · 2020-11-22T09:15:33Z

Hmm, probably related to #1704 then - in that we may want to reengineer the bounded floats strategy.

rsokl · 2020-11-23T17:21:22Z

And since Conjecture currently doesn't go out of its way to generate low-level zeros (IIRC), the end result is that zeros end up being astronomically rare outside of the guaranteed all-zeros example.

~~@Zalathar I don't quite understand how this would explain the asymmetry that we see. i.e. Why would we see zeroes along the "vertical" strip but not the horizontal?~~

Sorry, just saw your statement in parenthesis

Zalathar · 2020-11-25T11:51:37Z

OK, I believe the observed asymmetry happens as a side-effect of #2523.

After generating a novel prefix, the engine first tries zero-extending it, to get an estimate of how long a “short” example should be. In the case of two bounded-float arguments, this process is pretty much guaranteed to generate a non-minimal value for the first, and a minimal value for the second.

This matches my observation that the (correctly) reliably-failing test would always fail on its second example (i.e. the first one after the all-zeros example).

(Aside: I managed to greatly confuse myself by assuming that Hypothesis generates argument values in left-to-right program order, which turns out to not be the case in general. Something to keep an eye on when doing this sort of low-level detective work.)

Zac-HD · 2021-03-13T01:54:02Z

I think a good general solution to this would be to add another mutation pass. Currently we just try to duplicate segments (causing the bright diagonal line in @rsokl's figure above), but shuffling equivalent segments would also make sense - albeit probably with different heuristics about when to stop.

Zalathar · 2021-03-13T02:12:01Z

There were two things that I had vaguely intended to do about this, but never ended up prioritizing:

Re-engineer the bounded-floats strategy to be more similar to the unbounded-floats strategy (especially for large ranges), so there is less risk of unintuitive behaviour when switching between the two.
Reintroduce some of the fancier mutator features or biased-random features (e.g. deliberately generating zeros with higher probability) that we used to have.

For the latter, my understanding is that David mostly removed them in the spirit of “let's rip out some annoying complexity and see if we get away with it”. So if this issue constitutes evidence that we didn't actually get away with it, then it would make sense to thoughtfully bring some of them back.

Zac-HD · 2021-03-13T02:21:47Z

Sounds good!

Re-engineer the bounded-floats strategy to be more similar to the unbounded-floats strategy (especially for large ranges), so there is less risk of unintuitive behaviour when switching between the two.

I'm actually planning out a #2878-style merging of our bounded and unbounded floating-point strategies, with the intent of improving the distribution and shrinking behaviour - especially for 32bit and 16bit floats where rejection sampling is hilariously inefficient.

Reintroduce some of the fancier mutator features or biased-random features (e.g. deliberately generating zeros with higher probability) that we used to have.

I'm interested in building mutators which work for very long runs (~millions of test cases) without e.g. blowing through memory limits, to use in HypoFuzz, including many ideas which don't 'pay for their overhead' in Hypothesis' typically short runs. Agreed that the biased-random tricks might be worth bringing back though.

Zac-HD · 2022-06-20T18:00:36Z

Fixed by #3327 - we now add the min, next_up(min), min+1, and symetrically for max to our "interesting" floats which are generated more often.

Zac-HD added enhancement it's not broken, but we want it to be better internals Stuff that only Hypothesis devs should ever see labels Nov 18, 2020

Zac-HD mentioned this issue Feb 28, 2022

st.integers() never generates examples close to the max_value #2942

Closed

Zac-HD closed this as completed Jun 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boundary of search space on pairs of floats is reliably limited #2671

Boundary of search space on pairs of floats is reliably limited #2671

rsokl commented Nov 18, 2020 •

edited

Zac-HD commented Nov 18, 2020

rsokl commented Nov 18, 2020 •

edited

Zalathar commented Nov 22, 2020

Zac-HD commented Nov 22, 2020

rsokl commented Nov 23, 2020 •

edited

Zalathar commented Nov 25, 2020

Zac-HD commented Mar 13, 2021

Zalathar commented Mar 13, 2021

Zac-HD commented Mar 13, 2021

Zac-HD commented Jun 20, 2022

Boundary of search space on pairs of floats is reliably limited #2671

Boundary of search space on pairs of floats is reliably limited #2671

Comments

rsokl commented Nov 18, 2020 • edited

Zac-HD commented Nov 18, 2020

rsokl commented Nov 18, 2020 • edited

Zalathar commented Nov 22, 2020

Zac-HD commented Nov 22, 2020

rsokl commented Nov 23, 2020 • edited

Zalathar commented Nov 25, 2020

Zac-HD commented Mar 13, 2021

Zalathar commented Mar 13, 2021

Zac-HD commented Mar 13, 2021

Zac-HD commented Jun 20, 2022

rsokl commented Nov 18, 2020 •

edited

rsokl commented Nov 18, 2020 •

edited

rsokl commented Nov 23, 2020 •

edited