What4 eval3 #887

robdockins · 2020-11-02T19:12:47Z

Another attempt at improving the situation WRT evaluating terms more fully before passing them down to the Crucible symbolic simulator.

These patches cause the ResolveSetupValue phase to evaluate terms using the What4 SAWCore backend, but limits the definitions it will unfold to those from the prelude, or those defined locally with "let", relying on the new structured name infrastructure in SAWCore to reliably make this determination.

src/SAWScript/Crucible/LLVM/ResolveSetupValue.hs

src/SAWScript/Builtins.hs

robdockins · 2020-11-16T23:57:49Z

@brianhuffman, I think this is ready to merge (pending CI), if you have a minute to review.

robdockins · 2020-11-17T17:45:02Z

@andreistefanescu, looks like there is a fairly serious slowdown in the s2n/hmac proofs on this branch, and a less serious slowdown in SIKE. Any off-the-cuff ideas why that might be?

the symbolic simulator. This currently breaks some tests, as there are some constructs that cannot be handled by this new code path.

robdockins · 2020-12-04T20:57:15Z

Some profiling and trial-and-error seems to indicate that most of the slowdown here is due to the exception-handling overheads arising from this line:

https://github.com/GaloisInc/saw-core/blob/8eefd675590ef81f819355af25d155d692e01345/saw-core-what4/src/Verifier/SAW/Simulator/What4.hs#L1165

I'll have to find some other way to implement this.

robdockins · 2020-12-04T23:49:00Z

Upon some more experimentation, I think exception handling isn't actually the issue. Instead, it seems to be that fewer expressions are being evaluated to concrete values because evaluation bails when it sees a non-prelude identifier. As a result, we are getting less exact results going into the simulator, and more override variants have to be tried.

So... I guess we need to make sure to evaluate concretely whenever possible, but don't expand values whose result is symbolic if we have to unfold a non-prelude symbol. I'm not sure how to accomplish that, except by the brute-force method of evaluating things fully and seeing if they are concrete. Then, if the result is symbolic checking to see if we unfolded any disallowed symbols, and discarding the results if so.... which seems kind of wasteful, but maybe isn't so bad.

robdockins · 2020-12-08T19:18:50Z

Status update, short version: I think the current state of things is a bit of a local minimum here, and some more extensive refactoring will be required to do better.

Status update, longer version. The current strategy is basically: evaluate bitvector and boolean values that do not contain free variables (ExtCns), and paste in any concrete values we compute into the symbolic simulator. If the computed value is not concrete, throw it away and make an opaque "binding", which creates a fresh What4 variable and records in a table the corresponding SAWCore term. What we'd like to do is remove the restriction that evaluates only terms not containing free variables so the simulator can notice, e.g. preconditions that interact with control-flow decisions, or useful simplifications (and a (or a b) ->= a).

Attempt 1: fully evaluate every term. This basically works, but significantly slows down most proofs. For compositional verification, we typically want to treat subcomputations as opaque, which is provided by the current strategy.

Attempt 2: treat imported Cryptol terms as opaque. This basically works, but significantly slows down some proofs. We loose some opportunities for concrete evaluation with this strategy because we treat all imported terms as opaque, even if evaluation would result in concrete terms.

Attempt 3: fully evaluate every term and keep track of every constant unfolded. Only use the result if it is concrete, or if we never unfolded any constant we consider opaque. This is generating spurious counterexamples for reasons I don't understand. I suspect that the back-and-forth mapping between What4 and SAWCore is getting confused somewhere and distinct free variables are being assigned to things that should be the same.

What we would actually like to have: evaluate terms as far as possible, mapping external constants in a way that correctly respects round-tripping. When we encounter an "opaque" constant, evaluate it, but only use the result if it is concrete. Otherwise, replace the term with an uninterpreted function, as we do for external constants. All of this is made quite a bit trickier by the fact that SAWCore simulators are lazy, so the decision about what to do with results has to be delayed until the relevant thunks are eventually forced. I don't see an obvious way to shoehorn this strategy into the current SAWCore evaluator infrastructure, and the SAWCore<->What4 mappings are a bit scattered and disorganized. In addition, all this depends on the types involved in the functions and external constants being a sufficiently simple subset that they can be properly represented in What4. Handling all the corner cases properly is going to be tricky.

I think we probably need to refactor saw-core-what4 by folding the majority of crucible-saw into that package instead, and simplifying the various code paths so we can gain a lot more confidence in the correctness of the translations and round-tripping properties. This should at least let us get a proper implementation of "attempt 3" above. After that, we should figure out how to refactor the SAWCore evaluators to allow what we actually want to do. This should probably roll up the other simulator refactor we want to do, which is to pull in the Cryptol backend class, so we can uniformly represent many of the underlying primitive types and operations.

This was referenced Nov 3, 2020

Structured names #875

Closed

Crucible/LLVM: Override matching branches on concrete bitvector equalities #544

Closed

robdockins force-pushed the what4-eval3 branch 3 times, most recently from c6abb56 to 29b3121 Compare November 13, 2020 23:51

robdockins marked this pull request as ready for review November 14, 2020 00:08

robdockins requested review from andreistefanescu and brianhuffman November 14, 2020 00:08

robdockins force-pushed the what4-eval3 branch from a4be264 to 1451010 Compare November 16, 2020 18:41

robdockins commented Nov 16, 2020

View reviewed changes

src/SAWScript/Crucible/LLVM/ResolveSetupValue.hs Outdated Show resolved Hide resolved

robdockins commented Nov 16, 2020

View reviewed changes

src/SAWScript/Crucible/LLVM/ResolveSetupValue.hs Outdated Show resolved Hide resolved

robdockins commented Nov 16, 2020

View reviewed changes

src/SAWScript/Builtins.hs Outdated Show resolved Hide resolved

robdockins force-pushed the what4-eval3 branch from e2b75f7 to c95c185 Compare November 16, 2020 23:18

robdockins force-pushed the what4-eval3 branch from c95c185 to d178be6 Compare November 17, 2020 17:53

robdockins mentioned this pull request Nov 17, 2020

Structured names #910

Merged

robdockins force-pushed the what4-eval3 branch from d178be6 to 3074a67 Compare November 18, 2020 00:33

robdockins mentioned this pull request Nov 19, 2020

More evaluation for crucible_term and friends #855

Open

Evaluate saw-core terms into What4 more thoughougly when setting up

e5955c7

the symbolic simulator. This currently breaks some tests, as there are some constructs that cannot be handled by this new code path.

robdockins force-pushed the what4-eval3 branch from 3074a67 to e5955c7 Compare December 4, 2020 18:16

robdockins marked this pull request as draft December 8, 2020 19:33

robdockins closed this Jan 15, 2021

RyanGlScott deleted the what4-eval3 branch March 22, 2024 14:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What4 eval3 #887

What4 eval3 #887

robdockins commented Nov 2, 2020

robdockins commented Nov 16, 2020

robdockins commented Nov 17, 2020

robdockins commented Dec 4, 2020

robdockins commented Dec 4, 2020

robdockins commented Dec 8, 2020

What4 eval3 #887

What4 eval3 #887

Conversation

robdockins commented Nov 2, 2020

robdockins commented Nov 16, 2020

robdockins commented Nov 17, 2020

robdockins commented Dec 4, 2020

robdockins commented Dec 4, 2020

robdockins commented Dec 8, 2020