Make grammar creation deterministic #89

aphedges · 2023-04-08T23:03:26Z

I was encountering a bug in our agent where the same input DAIDE was being parsed properly by daidepp in some cases but not in others. I assumed it was the fault of the agent, but I ended up creating a minimal reproduction using only daidep:

from daidepp import PRP, create_daide_grammar, daide_visitor

ALL_GRAMMAR = create_daide_grammar(level=160, string_type="all", test=True)
msg = "PRP(XDO((ENG AMY LVP) MTO WAL))"
parse_tree = ALL_GRAMMAR.parse(msg)
parsed_msg = daide_visitor.visit(parse_tree)
print(f"parsed_msg = {parsed_msg!r}")
print(f"type(parsed_msg) = {type(parsed_msg)}")
assert isinstance(parsed_msg, PRP)

The string PRP(XDO((ENG AMY LVP) MTO WAL)) should always be parsed to an object of type PRP, but it was often being parsed to a list instead. Running the above script repeatedly will show the output can differ.

I ended up figuring out that daidepp was generating grammars non-deterministically across runs. Sometimes the grammar ended up leading to a different, incorrect parse, and I was able to even show that (with a low probability) parsing can fail entirely. I have modified this library to generate the grammar deterministically. I also fixed a related bug that prevented proper parsing of arrangements and made some smaller code changes to improve some problems I found during my work.

Notes

To repeatedly run the above Python script, I used the following Bash script:

#!/usr/bin/env bash

for ((i = 0; i < 10; i++)); do
  python daide_test.py
done

To view the grammar being generated, I used the following patch:

diff --git a/src/daidepp/grammar/grammar_utils.py b/src/daidepp/grammar/grammar_utils.py
index d2b8c5e..cc69d97 100644
--- a/src/daidepp/grammar/grammar_utils.py
+++ b/src/daidepp/grammar/grammar_utils.py
@@ -45,6 +45,7 @@ def create_daide_grammar(
     level: Union[DAIDELevel, List[DAIDELevel]] = 30,
     allow_just_arrangement: bool = False,
     string_type: Literal["message", "arrangement", "all"] = "message",
+    test: bool = False,
 ) -> DAIDEGrammar:
     """Create a DAIDEGrammar object given a level of DAIDE.

@@ -71,6 +72,10 @@ def create_daide_grammar(
         string_type = "arrangement"

     grammar_str = _create_daide_grammar_str(level, string_type)
+    if test:
+        from pathlib import Path
+        from time import time_ns
+        Path("grammars", f"{string_type}_{level}_{time_ns()}.txt").write_text(grammar_str)
     grammar = DAIDEGrammar(grammar_str)
     return grammar

It assumes that a directory named grammars already exists. This specific diff used 6c10f70 as its base.

The docstrings say the parameter is deprecated, so I have removed it everywhere but external interfaces.

There was an `if` statement in `_create_grammar_str_from_dict()` to ensure `arrangement` would count as a start symbol. However, when `string_type` was set to `arrangement`, the line could never be run because the `if` statement was only executed when `string_type == message`. I modified the condition to run for either of these values of `string_type`.

The symbols are read into sets, which removes the original ordering present in the grammars. The order of symbols matters when parsing, so this would cause parsing to sometimes return different results or fail entirely on the same input. Re-sorting the symbols after the set operations prevents the failures and makes the output consistent across runs.

aphedges · 2023-04-15T23:05:49Z

Thanks for merging!

By the way, I did intend for this to merged without squashing and wrote the commit history carefully for that. For future PRs (it's obviously too late for this one now), should I note that if it's what I want? I had expected it here because previous PRs weren't squashed. I'm just not really sure about the conventions for this repo.

This reverts commit ff56a90.

mjspeck · 2023-04-15T23:16:41Z

Thanks for merging!

By the way, I did intend for this to merged without squashing and wrote the commit history carefully for that. For future PRs (it's obviously too late for this one now), should I note that if it's what I want? I had expected it here because previous PRs weren't squashed. I'm just not really sure about the conventions for this repo.

Sorry about that! Lemme double check about whether that would conflict with our auto-versioning system. If it's fine I could revert the merge and we could re-merge without the squash if you'd like. I'll also update the PR template for people to specify whether they want to squash or not.

aphedges · 2023-04-15T23:18:26Z

Sorry about that! Lemme double check about whether that would conflict with our auto-versioning system. If it's fine I could revert the merge and we could re-merge without the squash if you'd like.

It's not a big deal. I don't think it's worth the effort to do it now.

Thanks for the offer, though!

aphedges added 5 commits April 7, 2023 21:01

Remove unused function in grammar_utils.py

d4a5e25

Fix type annotation for _sort_grammar_keys()

64ce909

Remove internal use of allow_just_arrangement

932c95e

The docstrings say the parameter is deprecated, so I have removed it everywhere but external interfaces.

mjspeck self-requested a review April 15, 2023 17:01

mjspeck merged commit ff56a90 into SHADE-AI:main Apr 15, 2023

mjspeck added a commit that referenced this pull request Apr 15, 2023

Revert "Make grammar creation deterministic (#89)"

b04158f

This reverts commit ff56a90.

aphedges mentioned this pull request Jun 5, 2023

Make keyword-based grammar creation deterministic #96

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make grammar creation deterministic #89

Make grammar creation deterministic #89

aphedges commented Apr 8, 2023

aphedges commented Apr 15, 2023

mjspeck commented Apr 15, 2023 •

edited

Loading

aphedges commented Apr 15, 2023 •

edited

Loading

Make grammar creation deterministic #89

Make grammar creation deterministic #89

Conversation

aphedges commented Apr 8, 2023

Notes

aphedges commented Apr 15, 2023

mjspeck commented Apr 15, 2023 • edited Loading

aphedges commented Apr 15, 2023 • edited Loading

mjspeck commented Apr 15, 2023 •

edited

Loading

aphedges commented Apr 15, 2023 •

edited

Loading