Feat: Generate per-dialect options on init project #3733

VaggelisD · 2025-01-28T13:37:02Z

Dynamically generate the engine-appropriate connection options through Pydantic's model_fields. The following conventions are applied:

If the Field is not required, it's commented out
If the Field has a (literal) default value it's appended to the RHS

sqlmesh/cli/example_project.py

tests/cli/test_cli.py

sqlmesh/cli/example_project.py

macros/__init__.py

sqlmesh/core/config/connection.py

sungchun12 · 2025-01-28T17:48:07Z

I tested this out locally and realized it's not helpful unless you explain in plain terms what each config means paired with an example. I recommend you update the template with this look and feel. You can leverage our existing docs to fill out the config descriptions.

This mainly needs to be done for: snowflake, databricks, bigquery, postgres, duckdb, redshift

What will also help is that most users if they're serious will want to read our docs to verify what connection config settings matter to them. Let's save them the extra step and print it to the terminal: https://sqlmesh.readthedocs.io/en/stable/integrations/overview/#execution-engines

Nice to have, dynamically print a link in the terminal to the exact engine connection options based on the database. Example for sqlmesh init postgres -> https://sqlmesh.readthedocs.io/en/stable/integrations/engines/postgres/#connection-options

Actually, as I was writing this. The "nice to have" path may be the best of both worlds where you add in the commented out configs and then dynamically print the link to the relevant engine as it's much easier to onboard in that flow vs. cluttering the yaml with elongated comments. Let me know what you think about the nice to have path being the need to have one.

gateways:
  local:
    connection:
      type: bigquery
      # concurrent_tasks: 1
      # register_comments: True
      # pre_ping: 
      # pretty_sql: 
      # method: BigQueryConnectionMethod.OAUTH
      # project: 
      # execution_project: # The name of the GCP project to bill for the execution of the models. If not set, the project associated with the model will be used. type: string, ex: my-project
      # quota_project: 
      # location: 
      keyfile: # Path to the keyfile to be used with service-account method # type: string, ex: '/source/keyfile.json'
      # keyfile_json: 
      # token: 
      # refresh_token: 
      # client_id: 
      # client_secret: 
      # token_uri: 
      # scopes: 
      # job_creation_timeout_seconds: 
      # job_execution_timeout_seconds: 
      # job_retries: 1
      # job_retry_deadline_seconds: 
      # priority: 
      # maximum_bytes_billed:

VaggelisD · 2025-01-28T18:02:11Z

Actually, as I was writing this. The "nice to have" path may be the best of both worlds where you add in the commented out configs and then dynamically print the link to the relevant engine as it's much easier to onboard in that flow vs. cluttering the yaml with elongated comments. Let me know what you think about the nice to have path being the need to have one.

This was what I was about to recommend too, I think the dynamic generation is needed in order to be up to date with the latest options and outputting the documentation link as in "check here for more details" should do the trick.

So, will simply keep the fields empty and/or commented out for now to maintain the existing implementation if everyone's fine with this.

georgesittas

LGTM. I think we may need to update a few existing doc pages if this is merged, to avoid showing the outdated config template.

sqlmesh/cli/example_project.py

VaggelisD marked this pull request as draft January 28, 2025 13:37

VaggelisD force-pushed the vaggelisd/project_init branch from d3cb328 to c5414fd Compare January 28, 2025 13:38

georgesittas reviewed Jan 28, 2025

View reviewed changes

sqlmesh/cli/example_project.py Outdated Show resolved Hide resolved

sqlmesh/cli/example_project.py Outdated Show resolved Hide resolved

tests/cli/test_cli.py Outdated Show resolved Hide resolved

sqlmesh/cli/example_project.py Outdated Show resolved Hide resolved

georgesittas reviewed Jan 28, 2025

View reviewed changes

macros/__init__.py Outdated Show resolved Hide resolved

georgesittas reviewed Jan 28, 2025

View reviewed changes

sqlmesh/core/config/connection.py Outdated Show resolved Hide resolved

VaggelisD force-pushed the vaggelisd/project_init branch from c7e1221 to 6b9bd8c Compare January 29, 2025 17:44

VaggelisD added 5 commits January 30, 2025 18:52

Feat: Generate per-dialect options on init project

b9bb000

PR Feedback 1

c4066b1

PR Feedback 2

5713ef2

Rename gateway

26304bf

Fix tests

dc3213c

VaggelisD force-pushed the vaggelisd/project_init branch 3 times, most recently from bf43bb7 to 62f8494 Compare January 30, 2025 16:59

Add doc link generation & more tests

1bac001

VaggelisD force-pushed the vaggelisd/project_init branch from 62f8494 to 1bac001 Compare January 30, 2025 17:11

VaggelisD marked this pull request as ready for review January 30, 2025 17:31

VaggelisD requested a review from a team January 30, 2025 17:31

georgesittas approved these changes Jan 30, 2025

View reviewed changes

sqlmesh/cli/example_project.py Outdated Show resolved Hide resolved

sqlmesh/cli/example_project.py Outdated Show resolved Hide resolved

georgesittas requested a review from a team January 30, 2025 18:04

PR Feedback 3

bb6be58

VaggelisD force-pushed the vaggelisd/project_init branch from 1c84f83 to bb6be58 Compare January 30, 2025 18:32

tobymao approved these changes Jan 30, 2025

View reviewed changes

VaggelisD merged commit ef7893d into main Jan 31, 2025
21 checks passed

VaggelisD deleted the vaggelisd/project_init branch January 31, 2025 08:28

treysp pushed a commit that referenced this pull request Jan 31, 2025

Feat: Generate per-dialect options on init project (#3733)

ef2c8cb

treysp mentioned this pull request Feb 6, 2025

Fix: specify init duckdb database so quickstart works #3800

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Generate per-dialect options on init project #3733

Feat: Generate per-dialect options on init project #3733

VaggelisD commented Jan 28, 2025

sungchun12 commented Jan 28, 2025 •

edited

Loading

VaggelisD commented Jan 28, 2025 •

edited

Loading

georgesittas left a comment

Feat: Generate per-dialect options on init project #3733

Feat: Generate per-dialect options on init project #3733

Conversation

VaggelisD commented Jan 28, 2025

sungchun12 commented Jan 28, 2025 • edited Loading

VaggelisD commented Jan 28, 2025 • edited Loading

georgesittas left a comment

Choose a reason for hiding this comment

sungchun12 commented Jan 28, 2025 •

edited

Loading

VaggelisD commented Jan 28, 2025 •

edited

Loading