[ test ] Fix spec001 test #3156

dunhamsteve · 2023-12-02T22:28:12Z

Description

The spec001 test appears to be broken in CI because the numbers in the machine names are different in CI in the test output. This is noted in #3155. Inspired by 1aff26e, I'm rewriting the numbers in resolved, arg, and conArg machine names to be XXX in the test output.

mattpolzin · 2023-12-03T00:37:38Z

The only question in my mind is whether this test output was supposed to be useful in spotting a change that actually caused argument numbers to change not just from x to y but from x to sometimes y and sometimes z (i.e. inconsistently). I really don't know, maybe that's not at all a perceived benefit because it's so unlikely and/or would be caught by other tests if it occurred.

dunhamsteve · 2023-12-03T07:21:26Z

I took a closer look at why these numbers are different.

Where they are different, the args and conArgs are 138 higher and $resolved is 2 higher. For resolved, I think this can happen by two more names in the global context.

I bisected locally and it looks like f2e6dc4 introduced the change:

FAIL f2e6dc431 Add dependent congruence of arity 1 and 2 for heterogeneous equality [G. Allais]
PASS 82e4fd9da Add foldrImplGoLemma to Data.Vect (#2835) [GitHub]

The change adds two functions to the Prelude, which explains $resolved. And the args and conArgs changes look like they're machine generated names inside the prelude, e.g. (this is a git "word diff")

(Just [-{arg:8742})-]{+{arg:8880})+} a b)

I don't think we can avoid a bunch of these numbers changing any time we add something to the prelude. Dunno if there is a better solution, other than updating the tests whenever things change. At least the fact that specialization is happening, and the general shape of things are captured in the output.

buzden · 2023-12-03T08:20:14Z

whether this test output was supposed to be useful in spotting a change that actually caused argument numbers to change

I think since there is no specification on what values those numbers have to be and anyway are changed blindly in any change in prelude and/or imported libraries, it is definitely okay for now. As soon as the logic of numbering is changed to be more stable, is can be revised.

The only thing, I think, that can be improved is not just hiding those numbers but bijecively mapping them to new ones with a stable numbering (say, local to particular test) so that we can check in a test that equal machine names indeed stay equal in the output, and different ones indeed stay different. But, I think, this is not a thing that should be required for this PR.

dunhamsteve · 2023-12-03T19:07:41Z

A more complicated awk script might be able to assign new numbers as it finds them in the log (and maintain the mapping of scene stuff). Ideally the $resolved stuff would handled by use of toFullNames.

So now I've got one of those situations where I've thought through a solution and want to implement it.

mattpolzin · 2023-12-03T21:02:20Z

@dunhamsteve that's an impressive awk script! I hesitate to be the one to officially approve or merge this, but I'm quite happy with your solution for my part!

gallais · 2023-12-04T07:36:18Z

Inspired by 1aff26e, I'm rewriting the numbers in resolved, arg, and conArg machine names to be XXX in the test output.

Should we have a unique function in testutils implementing that cleanup strategy uniformly?

buzden · 2023-12-04T09:16:30Z

Should we have a unique function in testutils implementing that cleanup strategy uniformly?

I think we defintely should!

BTW, @dunhamsteve, can I take your code to use in the tests in my own projects, outside the compiler?

dunhamsteve · 2023-12-04T19:23:49Z

Feel free to use the script elsewhere.

In testutils, would this look like a clean_names function and the user still has to do:

{
...commands
} | clean_names

Or would it be a special version of the idris2 function that runs idris and applies the filtering?

gallais · 2023-12-04T20:13:30Z

I'd be in favour of clean_names, yes.

dunhamsteve · 2023-12-05T05:19:37Z

I've moved the function into testutils. It is clean_names and used as described above. I had to silence the linter on one line of code, because it was upset that I was hiding $ references from the shell.

gallais · 2023-12-05T09:07:44Z

Can we use it for the test cases in find tests/ -name "run" | xargs grep " sed "?

dunhamsteve · 2023-12-06T19:58:16Z

We can use it as-is for many of them, for the others, we’d have to add the [0-9]+:[0-9]+ regex. I guess that is for the FC printing? Probably okay to rewrite those too in all cases. I’ll try to find some time tonight for that.

gallais · 2023-12-06T21:42:15Z

I don't mind if the golden values change. Uniformity first and foremost.

dunhamsteve · 2023-12-08T01:42:54Z

Ok, I've added support for:

   ttc/NNNNNNNNNN
   Foo.Bar:NN:NN--NN:NN
   P:xyz:NNNNN

Which covers most of the other sed cases.

There was one case that was special for src/TTImp/ProcessDef.idr, because it printed a bare resolved number rather than the $resolved name. I decided to address that with (getFullName (Resolved n)) rather than (Resolved n). I figured the real name would be more useful to users.

gallais · 2023-12-08T14:10:43Z

Fantastic!

buzden · 2024-03-07T10:09:48Z

@dunhamsteve I finally used your script from this PR for tests of my library, and I found out that it renames values unexpectedly. For example, I have the following diff after that:

                (MkTy <DerivedGen.X>[0, 1, 2, 3]
                      (IPi.  (MW ExplicitArg : IVar Data.Fuel.Fuel)
-                         -> (MW ExplicitArg {arg:3630} : IVar Prelude.Types.Nat)
-                         -> (MW ExplicitArg {arg:3633} : IVar Prelude.Types.Nat)
-                         -> (MW ExplicitArg {arg:3636} : IVar Prelude.Types.Nat)
-                         -> (MW ExplicitArg {arg:3639} : IVar Prelude.Types.Nat)
+                         -> (MW ExplicitArg {arg:1} : IVar Prelude.Types.Nat)
+                         -> (MW ExplicitArg {arg:1} : IVar Prelude.Types.Nat)
+                         -> (MW ExplicitArg {arg:1} : IVar Prelude.Types.Nat)
+                         -> (MW ExplicitArg {arg:1} : IVar Prelude.Types.Nat)
                          -> (IApp. IVar Test.DepTyCheck.Gen.Gen
                                  $ IVar Test.DepTyCheck.Gen.Emptiness.MaybeEmpty
                                  $ (IApp. IVar DerivedGen.X
-                                        $ IVar {arg:3630}
-                                        $ IVar {arg:3633}
-                                        $ IVar {arg:3636}
-                                        $ IVar {arg:3639}))))
+                                        $ IVar {arg:1}
+                                        $ IVar {arg:1}
+                                        $ IVar {arg:1}
+                                        $ IVar {arg:1}))))
         IDef <DerivedGen.X>[0, 1, 2, 3]
              [ PatClause (IApp. IVar <DerivedGen.X>[0, 1, 2, 3]

Am I right that your script should have renamed those variables to {arg:1}, {arg:2}, {arg:3} and {arg:4} respectively, not all of them to {arg:1}?

dunhamsteve · 2024-03-07T19:57:37Z

Yeah, that was my intent. That they would get unique names, but a consistent mapping for a given name if it recurs later in the input. I can take a look after work (PST).

dunhamsteve · 2024-03-07T20:22:34Z

"There are 2 hard problems in computer science: cache invalidation, naming things, and off-by-1 errors."

Couldn't resist taking a look now. It seems that it was ignoring the last digit of the name. (I printed m as part of the name to sort this out):

diff --git a/tests/testutils.sh b/tests/testutils.sh
index 04b588cb2..d96684324 100755
--- a/tests/testutils.sh
+++ b/tests/testutils.sh
@@ -57,7 +57,7 @@ _awk_clean_name='
     while (match($0, /(P:[A-z]+:|arg:|conArg:|ttc[\\\/][0-9]+|[$]resolved)[0-9]+|[A-z.]+:[0-9]+:[0-9]+--[0-9]+:[0-9]+|[A-z]+[.][0-9]+:[0-9]+/)) {
         rs = RSTART
         rl = RLENGTH
-        m = substr($0, rs, rl - 1)
+        m = substr($0, rs, rl)
         pfx = "XXX"
         if (match(m,/^(\$resolved|arg:|conArg:|ttc[\\\/]|P:[A-z]+:|[A-z.]+:|[A-z]+[.])/)) { pfx = substr(m, RSTART, RLENGTH) }
         if (!(m in mapping)) {

I had to update a couple of expected files after making this fix. I can PR after CI runs. Thanks for catching this.

[ test ] make spec001 test less fragile

e6612b6

[ text ] Replace arg/conArg/$resolved with consistent numbers in spec001

f0b8437

[ test ] Move clean_names into testutils.sh

f66ee2c

[ tests ] Use clean_names where possible

9b71197

gallais merged commit 59e00c5 into idris-lang:main Dec 8, 2023
22 checks passed

dunhamsteve deleted the fix-spec001 branch December 9, 2023 04:28

dunhamsteve mentioned this pull request Mar 7, 2024

[ test ] fix clean_names function in testutils.sh #3227

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ test ] Fix spec001 test #3156

[ test ] Fix spec001 test #3156

dunhamsteve commented Dec 2, 2023

mattpolzin commented Dec 3, 2023

dunhamsteve commented Dec 3, 2023

buzden commented Dec 3, 2023

dunhamsteve commented Dec 3, 2023

mattpolzin commented Dec 3, 2023

gallais commented Dec 4, 2023

buzden commented Dec 4, 2023

dunhamsteve commented Dec 4, 2023

gallais commented Dec 4, 2023

dunhamsteve commented Dec 5, 2023

gallais commented Dec 5, 2023 •

edited

Loading

dunhamsteve commented Dec 6, 2023

gallais commented Dec 6, 2023

dunhamsteve commented Dec 8, 2023

gallais commented Dec 8, 2023

buzden commented Mar 7, 2024

dunhamsteve commented Mar 7, 2024

dunhamsteve commented Mar 7, 2024

[ test ] Fix spec001 test #3156

[ test ] Fix spec001 test #3156

Conversation

dunhamsteve commented Dec 2, 2023

Description

mattpolzin commented Dec 3, 2023

dunhamsteve commented Dec 3, 2023

buzden commented Dec 3, 2023

dunhamsteve commented Dec 3, 2023

mattpolzin commented Dec 3, 2023

gallais commented Dec 4, 2023

buzden commented Dec 4, 2023

dunhamsteve commented Dec 4, 2023

gallais commented Dec 4, 2023

dunhamsteve commented Dec 5, 2023

gallais commented Dec 5, 2023 • edited Loading

dunhamsteve commented Dec 6, 2023

gallais commented Dec 6, 2023

dunhamsteve commented Dec 8, 2023

gallais commented Dec 8, 2023

buzden commented Mar 7, 2024

dunhamsteve commented Mar 7, 2024

dunhamsteve commented Mar 7, 2024

gallais commented Dec 5, 2023 •

edited

Loading