Turn static parser on by default with the ability to shut it off. #3939

nathaniel-may · 2021-09-22T14:50:12Z

resolves #3377

Description

This change turns the static parser on by default, but also adds the ability to turn it off by introducing --no-static-parser which can be set via the cli, or an env var (STATIC_PARSER=false) and in the profile config. The sampling logic used to help determine the correctness of the experimental parser is disabled as part of this PR, but the code paths remain intact. The plan is that the "experimental parser" will be the next version of the static parser that includes new features that need to be sampled for correctness in the wild. For this same reason, I chose to duplicate logic between the two functions run_static_parser and run_experimental_parser so that we can easily add a next iteration of the static parser behind the experimental flag with minimal changes.

To test this change, I heavily rely on parsing debug log lines to get at the internals of the function because this pattern is used in other difficult-to-test places as well. I chose to use numerical codes in these log lines so that the user-facing wording can change. I have no good reason behind my choice of the 1600 block of numbers for these codes. One disadvantage of this pattern as pointed out by @jtcohen6 is that debug log lines are actually user-facing, and this will generate one per model file which could be a lot. In a future change we could consider adding a new log level for tests to reduce noise for users debugging their dbt runs.

Reviewers

The feedback I would like the most is for the quality of my test coverage. Because of the sensitivity of the correctness of this feature, I would like to cover any gaps that you might see.

render_update is a little bit complex now, and I would like any recommendations you have around making the flow simpler.

Checklist

I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have updated the CHANGELOG.md and added information about my change to the "dbt next" section.

nathaniel-may · 2021-09-22T21:12:31Z

I have to sign off, but the only thing I can't get to pass yet is the testing the DBT_NO_STATIC_PARSER environment flag in test_postgres_env_no_static_parser. It looks like it's not actually setting the flag to true.

Opening the PR for reviews though so I can address any concerns when I return from PTO.

jtcohen6

Broadly, looking really good! No blockers from me, just a couple of questions

core/dbt/main.py

core/dbt/parser/models.py

jtcohen6 · 2021-09-27T15:16:10Z

core/dbt/parser/models.py

            logger.debug(
-                f"1602: parser fallback to jinja because of extractor failure for {node.path}"
+                f"1602: parser fallback to jinja rendering on {node.path}"


Now that we're turning on the experimental parser by default, worth calling out: we see a lot of this debug message in full re-parses. You figure ~40% of models, in a project with 1k models in it, means ~400 log lines like this. That's not necessarily a bad thing, and it's definitely preferable to the other (very misleading) log line that we currently print once per node (#3137)

ahhh good call out. This is why I do not like that we're cornered into parsing the log for testing in this area of the code which is the only reason this message is here. We could add another log level "Test" that we put this stuff in for tests to read so we don't muddy the debug log.

Ah, I see how it's needed for tests. Ok, it's decidedly debug-level logging, I don't feel a need to be too precious about taking up space. We can reserve the right to swing back later and cut it.

jtcohen6 · 2021-09-27T15:18:07Z

core/dbt/parser/models.py

+            )
+            return "cannot_parse"
+
+    def run_experimental_parser(


It looks like this is an exact dupe of the logic in run_static_parser, except subbing out "static" for "experimental." I get that, at a future point where we have more experimental features to sample/enable, we'll want to do other things here. Is this how we want the code to be in the meantime? Honest q, I'm happy with either answer

If the team wanted this to logic to be deduplicated, I can totally get behind that. However, the rationale for this decision was to leave the scaffolding for what this feature actually needs to maximize maintainability. It's not just for the rest of the team, but also for future me when I forget what the plan was when we go to change this a few months down the line 😅

nathaniel-may · 2021-09-28T17:26:31Z

To resolve my previous comment, @gshank helped me figure out how to handle the environment variable issue I was having. Everything should work exactly as intended now.

gshank

This looks good. I do agree that all of those debug lines in the logs could make for a lot of noise. Could you open a testing ticket to add a 'test' log output? Just so we don't forget about it.

nathaniel-may · 2021-09-29T19:54:33Z

#3977

cla-bot bot added the cla:yes label Sep 22, 2021

nathaniel-may changed the title ~~bump sampling back to 100~~ turn static parser on by default Sep 22, 2021

nathaniel-may force-pushed the static-parser-by-default branch 3 times, most recently from 308f591 to e46c6c0 Compare September 22, 2021 20:21

nathaniel-may requested review from gshank, jtcohen6 and leahwicz September 22, 2021 21:13

nathaniel-may marked this pull request as ready for review September 22, 2021 21:13

nathaniel-may changed the title ~~turn static parser on by default~~ Turn static parser on by default with the ability to shut it off. Sep 22, 2021

jtcohen6 reviewed Sep 27, 2021

View reviewed changes

nathaniel-may force-pushed the static-parser-by-default branch 3 times, most recently from 6a8b9a5 to d8e1588 Compare September 28, 2021 17:04

turn on static parser by default and add --no-static-parser flag

6925ceb

nathaniel-may force-pushed the static-parser-by-default branch from e1e4f6c to 6925ceb Compare September 28, 2021 17:21

nathaniel-may requested review from jtcohen6 and kwigley September 28, 2021 17:24

jtcohen6 approved these changes Sep 29, 2021

View reviewed changes

nathaniel-may removed the request for review from kwigley September 29, 2021 13:59

nathaniel-may mentioned this pull request Sep 29, 2021

Add Jinja Sampling to Stable Static Parser #3970

Merged

4 tasks

gshank approved these changes Sep 29, 2021

View reviewed changes

nathaniel-may mentioned this pull request Sep 29, 2021

Add "test" log level #3977

Open

nathaniel-may merged commit 38eb46d into develop Sep 29, 2021

nathaniel-may deleted the static-parser-by-default branch September 29, 2021 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Turn static parser on by default with the ability to shut it off. #3939

Turn static parser on by default with the ability to shut it off. #3939

nathaniel-may commented Sep 22, 2021 •

edited

Loading

nathaniel-may commented Sep 22, 2021 •

edited

Loading

jtcohen6 left a comment

jtcohen6 Sep 27, 2021

nathaniel-may Sep 28, 2021

jtcohen6 Sep 29, 2021

jtcohen6 Sep 27, 2021

nathaniel-may Sep 28, 2021

nathaniel-may commented Sep 28, 2021

gshank left a comment

nathaniel-may commented Sep 29, 2021

Turn static parser on by default with the ability to shut it off. #3939

Turn static parser on by default with the ability to shut it off. #3939

Conversation

nathaniel-may commented Sep 22, 2021 • edited Loading

Description

Reviewers

Checklist

nathaniel-may commented Sep 22, 2021 • edited Loading

jtcohen6 left a comment

Choose a reason for hiding this comment

jtcohen6 Sep 27, 2021

Choose a reason for hiding this comment

nathaniel-may Sep 28, 2021

Choose a reason for hiding this comment

jtcohen6 Sep 29, 2021

Choose a reason for hiding this comment

jtcohen6 Sep 27, 2021

Choose a reason for hiding this comment

nathaniel-may Sep 28, 2021

Choose a reason for hiding this comment

nathaniel-may commented Sep 28, 2021

gshank left a comment

Choose a reason for hiding this comment

nathaniel-may commented Sep 29, 2021

nathaniel-may commented Sep 22, 2021 •

edited

Loading

nathaniel-may commented Sep 22, 2021 •

edited

Loading