Add `withCounterexample` to get a re-useable handle on counterexamples #376

MaximilianAlgehed · 2024-03-21T14:59:26Z

The lack of actual values returned by quickCheck when a property fails has been a long standing friction. This PR introduces a simple combinator withCounterexample :: (Typeable a, Show a, Testable prop) => a -> prop -> Property that lets you "export" data to the Failure result of a property.

closes #117
closes #310

MaximilianAlgehed · 2024-03-21T15:05:18Z

The thing I don't like here is the use of the word "counterexample" as that already has a meaning in QuickCheck. I'm very open to suggestion!!

nick8325 · 2024-03-21T19:40:41Z

I like it!

But also a while back I made a wrapper around QuickCheck which changes the API to have typed counterexamples:

https://hackage.haskell.org/package/quickcheck-with-counterexamples

So I don't know if we should: (1) merge your patch, (2) merge my thingy into main QuickCheck, (3) both. What do you think? Your approach is much simpler and has a far smaller API surface, but needs a bit more work from the user to use it (but maybe not so much).

MaximilianAlgehed · 2024-03-21T20:05:31Z

Maybe we can do both?

What I would love is if you could use the result of QuickCheck failing to re-run the same property with the counterexample - or with the counterexample slightly modified. That would open up a whole new way of doing testing whereby you would both be able to recheck the property after fixing things, modify the counterexample you have to better understand what's causing the property to fail, etc. etc.

With this light-weight approach we can get most of that given a bit of work from the user without changing the API much, but if we are willing to take the plunge on your approach that would allow us to make things much more seamless.

To reduce the burden on the user of this approach I would try first of all to integrate it with forAll - either by default or ad an optional thing.

Another thing I would like to try is splitting the implementation of Property so that the points of quantification are explicit, that way one could implement replaying a property given [Counterexample] on versions of QuickCheck built with Typeable.

The third thing if we really want to take the plunge is to make the Property type indexed. The thing I worry about with that is that it would completely overhaul the interface of QhickCheck and also risk making things difficult to keep Haskell98 compatible.

One might think that type indexing and wrapping things with existentials or similar could reduce the amount of type casting necessary to deal with replaying a counterexample, but I worry that won't be the case so long as we wish to keep the API at least somewhat similar.

So to sum up my views: I like the fact that this is a conservative extension to the API, I really like your approach making things nice and typed and making all quantification points explicit and repayable with a counterexample, and I think we can figure out a way of getting the best of both worlds. We should just settle on a couple of questions:

Do we want to automatically include a withCounterexample when you use forAll?
Do we think its possible to change the implementation of Property so that we can replay a property given a counterexample?
Is there anything else that we want to add the the API that would be conservative? (I can imagine a quickCheckCounterexample :: Testable p => p -> IO (Maybe [Counterexample]) foe example)

MaximilianAlgehed · 2024-03-23T10:39:41Z

I discovered a seriously cool feature in ghci that I didn't know about today. It appears it will resolve the type of existentially quantified variables with Typeable for you! I implemented quickCheckCounterexample on this fork but for untyped Property and amazingly ghci appears to know the type of the existentially quantified variable:

$> :t quickCheckCounterexample
quickCheckCounterexample
  :: Testable prop => prop -> IO Counterexamples
$> c :! _ <- quickCheckCounterexample $ \ x -> withCounterexample (x :: Double) (x > 0)
*** Failed! Falsified (after 1 test):  
0.0
$> :t c
c :: Double

Interestingly, it appears to only work for <- bindings, not let:

$> let a :! _ = (0 :: Int) :! NoCounterexamples
<< Big ugly type error >>

It also doesn't work for data types defined in the shell, but it does work for data types defined in the source code.

The cool thing about this is that it might allow us a fairly good middle ground between the strongly typed approach in your thing @nick8325 and a smaller API footprint while not making it a PITA to work with in ghci. I think that's pretty cool!

nick8325 · 2024-03-23T18:16:19Z

Cool! A couple of comments for now!

When it comes to re-running the same test case or a slightly modified version of it - I don't think that withCounterexample helps there, does it?
If we want to automatically store typed counterexamples, we would need to change the types of forAll etc to have a Typeable constraint. In the past this would have been a big and annoying breaking change so I didn't seriously consider it, but nowadays GHC automatically generates a Typeable instance for every type so it might be reasonable.

MaximilianAlgehed · 2024-03-23T18:35:54Z

I think you're right on the re-runnig point - with the current design of Property. But that can possibly change to include the binding-sites.

On the second point I agree and I would put that change under an ifndef NO_AUTO_TYPEABLE pragma to make sure it doesn't become annoying for versions of GHC that don't support automatic typeable.

MaximilianAlgehed · 2024-03-23T18:37:04Z

Regarding changing the Property type - this might require both existential quantification and might not be entirely feasible. For that reason I don't think it's immediately obvious it can be done without a massive headache. But we can investigate it / put it on the "maybe later" pile.

MaximilianAlgehed · 2024-03-28T09:16:48Z

I'm uhming and ahing a bit about the idea of putting Typeable on forAll for the sake of this. You'd also have to put it on Testable and in all sorts of other places. That would be a major breaking change and I don't think it's worth delaying this conservative extension for the sake of figuring out how to minimize the damage of a big breaking change. I propose this: we merge this change (perhaps with some cleanup? I'll go over it again) and we make an issue about investigating always producing Counterexamples in the future with a nod to the fact that it would break a lot of stuff.

We could also introduce the interface in quickcheck-with-counterexamples as a module in QuickCheck that builds on this extension to implement what I presume you do with an IORef now in the future. Does that sound like a good idea @nick8325 ?

add recheck to quickly re-test a property

MaximilianAlgehed · 2024-04-04T08:44:50Z

To avoid naming confusion I've renamed the new Counterexample Witness instead.

nick8325 · 2024-04-17T18:07:04Z

+1 to naming things Witness instead of Counterexample, and I'm pro this pull request in general, but some comments by way of bikeshedding:

There is already (the not wonderfully named) counterexample :: Testable prop => String -> prop -> Property. Should the new combinator perhaps be called witness for consistency?
The Cex constructor should be renamed to e.g. Witness
What is the purpose of the Witnesses type (as opposed to e.g. just exposing a [Witness])?
Witnesses is not exported from Test.QuickCheck
Witnesses is missing a Show instance
I'm not thrilled about the proliferation of quickCheck* functions (and e.g. we would also need verboseCheckWitness and verboseCheckWithWitness - and, given that there's no function of type Result -> Witnesses, maybe even quickCheckWitnessResult, quickCheckWitnessWithResult, verboseCheckWitnessResult, verboseCheckWitnessWithResult! )

Suggestion: we could get rid of one of Witness and Witnesses. So either:

Drop Witnesses, and just expose witness :: [Witness] to the user, or
Drop Witness, so that Result would have a field witness :: Witnesses

Then rather than quickCheckWitness and its many variants, users can just use quickCheckResult together with witnesses, and this could be documented in withWitness.

MaximilianAlgehed · 2024-04-18T07:44:19Z

Drop Witnesses, and just expose witness :: [Witness] to the user, or

Done. This is probably the least bad option. As per the docs on witness now, you can do:

-- >>> [Wit x] <- fmap witnesses . quickCheckResult $ \ x -> witness x $ x == (0 :: Int)
-- *** Failed! Falsified (after 2 tests):
-- 1
-- >>> x
-- 1
-- >>> :t x
-- x :: Int

MaximilianAlgehed requested review from UlfNorell and nick8325 March 21, 2024 14:59

MaximilianAlgehed mentioned this pull request Mar 21, 2024

Feature Request: Get failed value (instead of outputting to IO) #117

Closed

MaximilianAlgehed linked an issue Mar 21, 2024 that may be closed by this pull request

Feature Request: Get failed value (instead of outputting to IO) #117

Closed

Add withCounterexample to get a re-useable handle on counterexamples +

1bad5f4

add recheck to quickly re-test a property

MaximilianAlgehed force-pushed the PR-withCounterexample branch from 118c9f9 to 1bad5f4 Compare April 2, 2024 08:08

This was referenced Apr 3, 2024

Version bump to 2.15 #391

Merged

Pretty-show-ing the counterexample. #328

Open

MaximilianAlgehed added 2 commits April 4, 2024 10:41

Counterexample -> Witness

b084a08

wip

144b64b

MaximilianAlgehed added 2 commits April 18, 2024 09:31

rename withWitness to witness and rename Cex to Wit

8a61a78

react to comments

2336786

UlfNorell approved these changes Apr 18, 2024

View reviewed changes

MaximilianAlgehed merged commit ded925d into master Apr 18, 2024
108 checks passed

MaximilianAlgehed deleted the PR-withCounterexample branch April 18, 2024 08:04

stevana mentioned this pull request Apr 19, 2024

Witnesses of labelledExamples? #395

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `withCounterexample` to get a re-useable handle on counterexamples #376

Add `withCounterexample` to get a re-useable handle on counterexamples #376

MaximilianAlgehed commented Mar 21, 2024 •

edited

Loading

MaximilianAlgehed commented Mar 21, 2024

nick8325 commented Mar 21, 2024

MaximilianAlgehed commented Mar 21, 2024

MaximilianAlgehed commented Mar 23, 2024

nick8325 commented Mar 23, 2024

MaximilianAlgehed commented Mar 23, 2024 •

edited

Loading

MaximilianAlgehed commented Mar 23, 2024

MaximilianAlgehed commented Mar 28, 2024

MaximilianAlgehed commented Apr 4, 2024

nick8325 commented Apr 17, 2024

MaximilianAlgehed commented Apr 18, 2024 •

edited

Loading

Add withCounterexample to get a re-useable handle on counterexamples #376

Add withCounterexample to get a re-useable handle on counterexamples #376

Conversation

MaximilianAlgehed commented Mar 21, 2024 • edited Loading

MaximilianAlgehed commented Mar 21, 2024

nick8325 commented Mar 21, 2024

MaximilianAlgehed commented Mar 21, 2024

MaximilianAlgehed commented Mar 23, 2024

nick8325 commented Mar 23, 2024

MaximilianAlgehed commented Mar 23, 2024 • edited Loading

MaximilianAlgehed commented Mar 23, 2024

MaximilianAlgehed commented Mar 28, 2024

MaximilianAlgehed commented Apr 4, 2024

nick8325 commented Apr 17, 2024

MaximilianAlgehed commented Apr 18, 2024 • edited Loading

Add `withCounterexample` to get a re-useable handle on counterexamples #376

Add `withCounterexample` to get a re-useable handle on counterexamples #376

MaximilianAlgehed commented Mar 21, 2024 •

edited

Loading

MaximilianAlgehed commented Mar 23, 2024 •

edited

Loading

MaximilianAlgehed commented Apr 18, 2024 •

edited

Loading