SNOW-1869362: Plan plotter improvements #2813

sfc-gh-aalam · 2024-12-29T00:06:49Z

Which Jira issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes SNOW-1869362
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
  - If this test skips Local Testing mode, I'm requesting review from @snowflakedb/local-testing
- I am adding new logging messages
- I am adding a new telemetry message
- I am adding new credentials
- I am adding a new dependency
- If this is a new feature/behavior, I'm adding the Local Testing parity changes.
- I acknowledge that I have ensured my changes to be thread-safe. Follow the link for more information: Thread-safe Developer Guidelines
Please describe how your code solves the related issue.

Made the following improvements with this PR:

Add threshold so only those plans which are above a threshold score are plotted.
Note the name of with query block
Print the name of SnowflakeCreateTable
Print the name of SelectableEntitity
Shade with query blocks in gray.

sfc-gh-jrose · 2025-01-02T19:34:20Z

tests/integ/test_large_query_breakdown.py

        os.environ["ENABLE_SNOWPARK_LOGICAL_PLAN_PLOTTING"] = str(enabled)
+        os.environ["SNOWPARK_LOGICAL_PLAN_PLOTTING_THRESHOLD"] = str(
+            plotting_score_threshold
+        )


This might be better done with something like:
with mock.patch.dict(os.environ, {...}):

sfc-gh-helmeleegy · 2025-01-02T19:57:57Z

tests/integ/test_large_query_breakdown.py

    try:
        os.environ["ENABLE_SNOWPARK_LOGICAL_PLAN_PLOTTING"] = str(enabled)
+        os.environ["SNOWPARK_LOGICAL_PLAN_PLOTTING_THRESHOLD"] = str(
+            plotting_score_threshold
+        )
        tmp_dir = tempfile.gettempdir()

        with patch("graphviz.Graph.render") as mock_render:
            large_query_df.collect()


Can we perhaps add a comment explaining that the actual complexity for large_query_df falls somewhere between 0 and 10M?

sfc-gh-helmeleegy

Looks good, thanks!

sfc-gh-yzou · 2025-01-02T22:11:18Z

src/snowflake/snowpark/_internal/compiler/utils.py

@@ -381,15 +383,27 @@ def plot_plan_if_enabled(root: LogicalPlan, filename: str) -> None:
    ):
        return

+    if int(


what is this Plotting threshold used for? seems it is used for restricting the complexity score? maybe call this SNOWPARK_LOGICAL_PLAN_PLOTTING_COMPLEXITY_THRESHOLD to be more clear

sfc-gh-yzou · 2025-01-02T22:12:16Z

src/snowflake/snowpark/_internal/compiler/utils.py

            if node is None:
                return "EMPTY_SOURCE_PLAN"  # pragma: no cover
            addr = hex(id(node))
            name = str(type(node)).split(".")[-1].split("'")[0]
-            return f"{name}({addr})"
+            suffix = ""
+            if isinstance(node, SnowflakeCreateTable):


add a comment here about what are the different printing used here

sfc-gh-yzou · 2025-01-02T22:19:16Z

src/snowflake/snowpark/_internal/compiler/utils.py

@@ -381,15 +383,27 @@ def plot_plan_if_enabled(root: LogicalPlan, filename: str) -> None:
    ):
        return

+    if int(
+        os.environ.get("SNOWPARK_LOGICAL_PLAN_PLOTTING_THRESHOLD", 0)


let's simply make the default threshold -1, be clear that by default plot out all nodes.

was there a reason about why we want to add this threshold?

yeah. In my tests, I generally want to plot and debug "big" plans but sometime the plans get overwritten by smaller plan if they are present somewhere. That's why I added this variable. I don't think this is the best way - I'm open to suggestions.

Can you be more specific about " sometime the plans get overwritten by smaller plan if they are present somewhere"? not quite getting this part, and what information you want to get to help your debugging process?

init

229933f

sfc-gh-aalam added the NO-CHANGELOG-UPDATES This pull request does not need to update CHANGELOG.md label Dec 29, 2024

sfc-gh-aalam added 2 commits December 28, 2024 17:50

highlight with query blocks

0f8b069

fix codecov and pyright

7733028

sfc-gh-aalam marked this pull request as ready for review January 2, 2025 19:13

sfc-gh-aalam requested review from a team as code owners January 2, 2025 19:13

sfc-gh-aalam requested review from sfc-gh-jdu, sfc-gh-yixie and sfc-gh-jrose January 2, 2025 19:13

sfc-gh-jrose approved these changes Jan 2, 2025

View reviewed changes

sfc-gh-helmeleegy reviewed Jan 2, 2025

View reviewed changes

sfc-gh-helmeleegy approved these changes Jan 2, 2025

View reviewed changes

sfc-gh-yzou reviewed Jan 2, 2025

View reviewed changes

address comments

f97df29

sfc-gh-aalam merged commit e75b506 into main Jan 3, 2025
40 checks passed

sfc-gh-aalam deleted the aalam-SNOW-1869362-improve-plan-plotter branch January 3, 2025 18:23

github-actions bot locked and limited conversation to collaborators Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SNOW-1869362: Plan plotter improvements #2813

SNOW-1869362: Plan plotter improvements #2813

sfc-gh-aalam commented Dec 29, 2024 •

edited

Loading

sfc-gh-jrose Jan 2, 2025

sfc-gh-helmeleegy Jan 2, 2025

sfc-gh-helmeleegy left a comment

sfc-gh-yzou Jan 2, 2025

sfc-gh-yzou Jan 2, 2025

sfc-gh-yzou Jan 2, 2025

sfc-gh-aalam Jan 3, 2025

sfc-gh-yzou Jan 3, 2025

                   ):
                       return
+                  if int(

SNOW-1869362: Plan plotter improvements #2813

SNOW-1869362: Plan plotter improvements #2813

Conversation

sfc-gh-aalam commented Dec 29, 2024 • edited Loading

sfc-gh-jrose Jan 2, 2025

Choose a reason for hiding this comment

sfc-gh-helmeleegy Jan 2, 2025

Choose a reason for hiding this comment

sfc-gh-helmeleegy left a comment

Choose a reason for hiding this comment

sfc-gh-yzou Jan 2, 2025

Choose a reason for hiding this comment

sfc-gh-yzou Jan 2, 2025

Choose a reason for hiding this comment

sfc-gh-yzou Jan 2, 2025

Choose a reason for hiding this comment

sfc-gh-aalam Jan 3, 2025

Choose a reason for hiding this comment

sfc-gh-yzou Jan 3, 2025

Choose a reason for hiding this comment

sfc-gh-aalam commented Dec 29, 2024 •

edited

Loading