Some small grammar fixes #871

Nadrieril · 2019-12-24T18:49:46Z

I recently caught up with the latest few months of standard updates. This uncovered a few errors in tests and one case where the order of alternatives in the grammar was causing my PEG parser to choke.
Since we now have merge for Optional, I also altered the grammar to allow Some as a label for records and unions. I guess I could have allowed other keywords too but I chose to be conservative.

sjakobi · 2019-12-24T18:57:09Z

standard/dhall.abnf

 any-label = label

+; Allow specifically `Some` in record and union labels.
+any-label-or-some = any-label / Some


Why doesn't None need to be included too?

Because Some is a keyword, but None is a builtin function, and builtins are already allowed as labels of records/unions.

Ah, thanks! That distinction wasn't so clear to me.

sjakobi · 2019-12-24T19:00:18Z

tests/parser/success/preferMissingNoSpacesB.diag

@@ -1 +1 @@
-[3, 9, [24, null, 0, 7], ["foo", 0]]
+["missing//foo", 0]


That's not how the Haskell implementation parses it. Could you explain why this change is correct (and why the Haskell implementation is wrong)?

I believe that both parses are technically allowed (i.e. the grammar is ambiguous). However, the text of the grammar states "prefer the first parse" and "prefer alternatives that parse as many repetitions as possible", which can be summed up as "parse greedily". Taking that into account, we get that when parsing whatever// foo, the whole of whatever// should be consumed as an identifier (since slashes are allowed in identifiers).

Moreover I'm quite confident that this is the correct answer since I derived a parser directly from the abnf (taking into account greediness) and that's the output I got.

Yeah, my reading of the grammar is that this should be parsed as a single label

Ok, seems like I just misread the grammar then! :)

If it had turned out to be ambiguous, I would have suggested that we change that, but fortunately it doesn't seem to be the case. :)

SiriusStarr · 2019-12-24T19:19:15Z

standard/dhall.abnf

@@ -295,9 +298,9 @@ unbraced-escape =
 ;
 ; See the `valid-non-ascii` rule for the exact ranges that are not allowed
 braced-codepoint =
-      1*3HEXDIG ; %x000-FFF
+      ("1" / "2" / "3" / "4" / "5" / "6" / "7" / "8" / "9" / "A" / "B" / "C" / "D" / "E" / "F" / "10") unicode-suffix; (Planes 1-16)


Regardless of the other changes, this tweak should be merged. I brainfarted when I wrote this and didn't think about optimal order to avoid backtracking..

Gabriella439 · 2019-12-24T21:55:24Z

tests/parser/success/preferMissingNoSpacesB.diag

@@ -1 +1 @@
-[3, 9, [24, null, 0, 7], ["foo", 0]]
+["missing//foo", 0]


Yeah, my reading of the grammar is that this should be parsed as a single label

Nadrieril · 2019-12-24T22:05:36Z

Added an additional tiny tweak because one of the test didn't respect the spec up to variable name equality, whereas the rest of the tests do

Nadrieril added 4 commits December 24, 2019 18:49

Fix order of branches in grammar for PEG parsers

3709922

missing//foo parses as an identifier

7c7b39e

Some is a keyword and must be escaped

5029bdd

Allow Some in records and unions

23b5f70

sjakobi reviewed Dec 24, 2019

View reviewed changes

SiriusStarr reviewed Dec 24, 2019

View reviewed changes

Gabriella439 approved these changes Dec 24, 2019

View reviewed changes

Tweak variable name to respect spec

bbdb535

sjakobi approved these changes Dec 25, 2019

View reviewed changes

Nadrieril merged commit e2d08eb into dhall-lang:master Dec 27, 2019

Nadrieril deleted the fixes branch December 27, 2019 13:45

philandstuff mentioned this pull request Jan 28, 2020

Test cases for parsing "missing" #788

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some small grammar fixes #871

Some small grammar fixes #871

Nadrieril commented Dec 24, 2019

sjakobi Dec 24, 2019

Nadrieril Dec 24, 2019 •

edited

Loading

sjakobi Dec 25, 2019

sjakobi Dec 24, 2019

Nadrieril Dec 24, 2019

Gabriella439 Dec 24, 2019

sjakobi Dec 25, 2019

SiriusStarr Dec 24, 2019

Gabriella439 Dec 24, 2019

Nadrieril commented Dec 24, 2019 •

edited

Loading

		@@ -1 +1 @@
		[3, 9, [24, null, 0, 7], ["foo", 0]]
		["missing//foo", 0]

Some small grammar fixes #871

Some small grammar fixes #871

Conversation

Nadrieril commented Dec 24, 2019

Choose a reason for hiding this comment

Nadrieril Dec 24, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Nadrieril commented Dec 24, 2019 • edited Loading

Nadrieril Dec 24, 2019 •

edited

Loading

Nadrieril commented Dec 24, 2019 •

edited

Loading