CLI to interact with the column-level lineage calculated by dbt Cloud and visible in dbt Explorer. Presented at Coalesce 2024.
The slides of the presentation are available here and the recording will be added shortly.
- clone this repo
- install
poetry
- type
poetry install
to install the relevant dependencies - type
poetry run dbt-cll-evaluator ...
to use the tool
poetry run dbt-cll-evaluator --help
will list the different options available in the tool
All commands require you to provide a dbt Cloud API token with access to the Discovery API. You can use a Personal Access Token or a Service Token.
We also need to provide the dbt Cloud environment ID from which we want to retrieve the information. You will likely want to pick the Production environment ID of the dbt Cloud project you are interested in.
Finally, today, the API returns the column-level linage information for all columns upstream and downstream of a give model (e.g. it returns the CLL for all the +my_model+
). This means that we currently can't get the entire column level lineage at once and we need to provide a node_id
to start from (e.g. model.my_dbt_project.my_model
).
All the commands that support a --fix
option will also require you to provide the path where your dbt project is located.