TurboVault4dbt is an open source tool that automatically generates dbt models according to our datavault4dbt-templates. It uses a metadata input of Your Data Vault 2.0 from one of the supported databases and creates ready-to-process dbt-models.
TurboVault4dbt requires a metadata analysis done by hand and stored in a supported metadata storage. Furthermore, Python must be installed as TurboVault4dbt is a software written in Python.
To use the generated models, a dbt project is required. Addtionally, our dbt package datavault4dbt must be used, because the dbt models are calling macros of this package.
You can find DDL scripts and templates for the metadata tables and the excel sheet here.
Your metadata needs to be stored in the following five tables/worksheets:
Currently, TurboVault4dbt supports metadata input from
- Snowflake
- BigQuery
- Google Sheets
- Excel
Our developers are constantly working on adding new connectors for more databases.
To install Turbovault4dbt, follow the instructions on this page.
You can configure the connection to Your metadata storage in the config.ini. Further explanation for the configuration input can be found here.
To execute TurboVault4dbt, You need Python installed. Execute the script according to Your database, where Your metadata is stored e.g. Snowflake --> turbovault_snowflake.py, BigQuery --> turbovault_bigquery.py, and so on.
Then, a GUI will open that looks like this:
On the left side you can select which object types you want to generate. These are:
The right side lists all available source objects inside the connected metadata storage. You can select as many of them as you like.
Now you can click on "start" and Turbovault4dbt will generate all neccessary dbt models that work with datavault4dbt!