-
Notifications
You must be signed in to change notification settings - Fork 1.7k
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Column level lineage #3226
Comments
@dataexpertz-blr Thanks for opening, I'm surprised there wasn't already an issue for this :) It's something we're hearing and talking about a lot these days. MechanismsI view column-level lineage as existing in two orders of complexity:
Use casesAs with any compelling feature, column-level lineage feels both immensely valuable and a bit vague. If dbt could produce an EXPLAIN-style plan, of every single SQL function performed to produce a single column, that would be very cool, and also tricky to read and reason about as a human being. So I do find it useful to think concretely about the kinds of things we'd hope to enable here:
I'm curious to hear what other things come to mind! |
+4 for the use case you've mentioned @jtcohen6 , I'd like to add the following, IMO important use case:
Instead of building this yourself, you could also think about integrating DBT with the only data lineage focused SAAS product I have heard of: https://getmanta.com/integrations/ |
we are planning to use DBT Cloud for our project and this table by table lineage is really a killer feature which we would like to have as This would make data lineage and debugging much much easier |
Hi 👋 , does anyone know how Datafold + DBT claims to provide this? |
Datafold use their own lineage capabilities to do this — while they do read in your dbt project, the column level lineage is part of their platform (it's an awesome feature though!) |
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
Describe the feature
Currently have table/view level lineage captured, can this be extended to field/column based?
Additional context
Not specific to a database, applies to the product.
Who will this benefit?
There could be multiple renames to a field and with multiple joins the complexity to track field level changes and source of a particular column becomes difficult. From an Ops perspective it always good to know where a field is coming from to quickly solve data issues.
The text was updated successfully, but these errors were encountered: