Let `measure-complexity` output the worst performing verification tasks by resource count #5631

keyboardDrummer · 2024-07-18T13:07:36Z

Description

Let measure-complexity output the worst performing verification tasks by resource count
Add an option --top-x to measure-complexity to configure how many of the worst tasks are shown
Change the default of --iterations from 10 to 1

How has this been tested?

Add a CLI test

By submitting this pull request, I confirm that my contribution is made under the terms of the MIT license.

atomb

This is a very nice feature! There are a few places where I think slightly different terminology might be better, but I otherwise think it looks good.

atomb · 2024-07-19T15:32:33Z

Source/DafnyDriver/Commands/MeasureComplexityCommand.cs

+    CliCompilation cliCompilation,
+    IObservable<CanVerifyResult> verificationResults) {
+
+    PriorityQueue<VerificationTaskResult, int> worstPerformers = new();


Perfect place to use a priority queue!

atomb · 2024-07-19T15:35:21Z

Source/DafnyDriver/Commands/MeasureComplexityCommand.cs

+    }
+
+    foreach (var performer in decreasingWorst) {
+      await output.WriteLineAsync($"Verification task on line {performer.Task.Token.line} in file {performer.Task.Token.filename} consumed {performer.Result.ResourceCount} resources");


This is a really long message to parse. I'd prefer something like `file.dfy(9): <some description of context, maybe function/method/lemma name> used 4389 RU". If you leave it as-is, the term "resources" is kind of vague. I'd maybe use "RU" to match what the IDE does?

Having the line number in there is a must. We will move towards --isolate-assertions being the default, so having a function/method/lemma name won't say much.

Updated it to:

Starting verification of iteration 1/1 with seed 0 measure-complexity.dfy(5,18): Error: assertion might not hold The most demanding 100 verification tasks consumed these resources: measure-complexity.dfy(8,18): 9984 measure-complexity.dfy(7,18): 9065 measure-complexity.dfy(8,15): 8745 ...

I'm not a fan of abbreviations in user interfaces. resources is understandable for someone who has not read the manual, while RU is not. resource units as opposed to resources does not give me any extra information. Alternatively we could call it complexity. Then it would say "The most complex 100 verification tasks had this complexity:", which I think would be more intuitive than resources. I'd be in favor of that change but then we should make it everywhere.

atomb · 2024-07-19T15:40:30Z

Source/DafnyDriver/Commands/MeasureComplexityCommand.cs

  }

+  private static readonly Option<uint> TopX = new("--top-x", () => 10U,


I feel like there must be a better name for this. But I haven't been able to come up with one yet!

I changed it to --worst-amount

atomb · 2024-07-19T15:42:13Z

Source/IntegrationTests/TestFiles/LitTests/LitTest/cli/measure-complexity.dfy.expect

@@ -0,0 +1,30 @@
+Starting verification of iteration 1/1 with seed 0
+TestFiles/LitTests/LitTest/cli/measure-complexity.dfy(5,18): Error: assertion might not hold
+Verification task on line 8 in file measure-complexity.dfy consumed 9984 resources


I think these lines will be fragile (and might cause the nightly to fail, since Z3 tends to have different resource use on different platforms). Could you move it from an .expect file to CHECK: directives in the source file, with wildcards for the actual numbers?

Add test and tweak output

3d3fb0e

keyboardDrummer requested a review from atomb July 18, 2024 13:07

Add release note

0254a9f

keyboardDrummer enabled auto-merge (squash) July 18, 2024 13:08

Run formatter

866684f

atomb requested changes Jul 19, 2024

View reviewed changes

Do not look at actual resource counts

3b0697e

keyboardDrummer requested a review from atomb July 23, 2024 09:21

keyboardDrummer added 2 commits July 23, 2024 11:22

Updates

2cfe46f

Merge remote-tracking branch 'origin/master' into maximumResourceCounts

d1354e7

keyboardDrummer force-pushed the maximumResourceCounts branch from 199bb77 to d1354e7 Compare July 23, 2024 09:23

keyboardDrummer added 4 commits July 23, 2024 12:35

Run formatter

b40c06c

Report total resources

50e2076

Update expect file

014d89a

Add total

788d449

atomb approved these changes Jul 24, 2024

View reviewed changes

Merge branch 'master' into maximumResourceCounts

39688c6

keyboardDrummer merged commit e397b4f into dafny-lang:master Jul 24, 2024
21 checks passed

keyboardDrummer deleted the maximumResourceCounts branch July 24, 2024 16:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Let `measure-complexity` output the worst performing verification tasks by resource count #5631

Let `measure-complexity` output the worst performing verification tasks by resource count #5631

keyboardDrummer commented Jul 18, 2024

atomb left a comment

atomb Jul 19, 2024

atomb Jul 19, 2024

keyboardDrummer Jul 23, 2024 •

edited

Loading

atomb Jul 19, 2024

keyboardDrummer Jul 23, 2024

atomb Jul 19, 2024

		}

		private static readonly Option<uint> TopX = new("--top-x", () => 10U,

Let measure-complexity output the worst performing verification tasks by resource count #5631

Let measure-complexity output the worst performing verification tasks by resource count #5631

Conversation

keyboardDrummer commented Jul 18, 2024

Description

How has this been tested?

atomb left a comment

Choose a reason for hiding this comment

atomb Jul 19, 2024

Choose a reason for hiding this comment

atomb Jul 19, 2024

Choose a reason for hiding this comment

keyboardDrummer Jul 23, 2024 • edited Loading

Choose a reason for hiding this comment

atomb Jul 19, 2024

Choose a reason for hiding this comment

keyboardDrummer Jul 23, 2024

Choose a reason for hiding this comment

atomb Jul 19, 2024

Choose a reason for hiding this comment

Let `measure-complexity` output the worst performing verification tasks by resource count #5631

Let `measure-complexity` output the worst performing verification tasks by resource count #5631

keyboardDrummer Jul 23, 2024 •

edited

Loading