Improve Hypothesis's ability to shrink sums #1403

DRMacIver · 2018-07-18T14:54:00Z

This improves Hypothesis's ability to shrink examples which depend on the sum of two values.

The motivating example for this is the following:

import numpy as np
import hypothesis.strategies as st
import hypothesis.extra.numpy as nps
from hypothesis import given

int16s = nps.from_dtype(np.dtype('int16'))

bounded_lists = st.lists(int16s, max_size=1).filter(lambda x: sum(x) < 256)

problems = st.tuples(
    bounded_lists,
    bounded_lists,
    bounded_lists,
    bounded_lists,
    bounded_lists,
)

@given(problems)
def test_sum_does_not_overflow(p):
    assert sum([x for sub in p for x in sub], np.int16(0)) < 5 * 256

This fails because you can get an overflow if the sum is large and negative. The ideal minimum example for it in Hypothesis's ordering is ([], [], [], [-1], [-32768]), but previously we rarely found that.

This comes from the SmartCheck paper and I'm running some evaluations in part based on that paper and it annoyed me that Hypothesis didn't normalize example. This PR is part one of two in fixing that.

Zac-HD

Generally looks good to me, with a few minor comments. Merge at your own discretion 😄

(I also love the motivation for this change!)

Zac-HD · 2018-07-19T01:20:36Z

hypothesis-python/RELEASE.rst

+
+    @given(st.integers(), st.integers())
+    def test_does_not_exceed_100(m, n):
+        assert m + n <= 100


To fail with m=0, n=100, this must use < not <=

Whoops, yes, thanks.

Zac-HD · 2018-07-19T01:57:53Z

hypothesis-python/src/hypothesis/internal/conjecture/engine.py

+                            except OverflowError:
+                                return False
+                            return self.incorporate_new_buffer(attempt)
+                        if trial(m - 1, n + 1) and m > 1:


I think this should be m >= 1, so that m can be driven all the way to 0, right? Also looks like putting the comparison before the call to trial might avoid trying to incorporate a bad buffer. (these probably cancel each other in practice but I'd find it easier to follow the other way)

No, this is correct. We know m > 0. If m = 1 we want to run trial(0, n + 1) and then stop regardless of what it returns. I'll add a comment to clarify the logic.

Aaah, right. That does make sense, but a comment would certainly be good!

DRMacIver added 2 commits July 18, 2018 15:37

Add a pass that minmizes a block while retaining a shared sum

9050a0c

Add release notes

57a0bab

DRMacIver changed the title ~~Handle Hypothesis's ability to shrink sums~~ Improve Hypothesis's ability to shrink sums Jul 18, 2018

DRMacIver mentioned this pull request Jul 18, 2018

Add shrink pass for reordering examples #1404

Merged

Guard against shrinking from m=0

5e44109

Zac-HD approved these changes Jul 19, 2018

View reviewed changes

DRMacIver merged commit 89d664c into master Jul 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Hypothesis's ability to shrink sums #1403

Improve Hypothesis's ability to shrink sums #1403

DRMacIver commented Jul 18, 2018

Zac-HD left a comment

Zac-HD Jul 19, 2018

DRMacIver Jul 19, 2018

Zac-HD Jul 19, 2018

DRMacIver Jul 19, 2018

Zac-HD Jul 19, 2018

Improve Hypothesis's ability to shrink sums #1403

Improve Hypothesis's ability to shrink sums #1403

Conversation

DRMacIver commented Jul 18, 2018

Zac-HD left a comment

Choose a reason for hiding this comment

Zac-HD Jul 19, 2018

Choose a reason for hiding this comment

DRMacIver Jul 19, 2018

Choose a reason for hiding this comment

Zac-HD Jul 19, 2018

Choose a reason for hiding this comment

DRMacIver Jul 19, 2018

Choose a reason for hiding this comment

Zac-HD Jul 19, 2018

Choose a reason for hiding this comment