-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to customize originEntity. #1848
Comments
Actually, this is the full list of possible values:
This is a problem with enums in general. They don't expand well to open source (one cannot easily customize it for their use case; these values made sense at LI but probably not elsewhere). @mars-lan thoughts on how to resolve this? Both for the existing enums and going forward. I feel like, going forward, perhaps we should ban enums in new models? Enforce the fields to be strings. Then, the would-be-enum-values can be constants in a Java utility class somewhere (e.g. |
I think "data origin" isn't the correct choice here in the first place. @cobolbaby when you say "identify the database instance", do you mean "different database hosts with different tables", or "different database hosts with same tables via replication"? |
In my opinion, |
@cobolbaby please help us understand your use case better so we can make the correct recommendation here |
Data sets often contain some attributes used to indicate the source of the data:
In order to better show the business hierarchy in Datahub, I temporarily thought of establishing the following mapping relationship.
But the current origin is an enumerated constant. In the prod env, we don't have dev, only prod, so it doesn't make much sense to us. So I hope to skip this layer |
I believe this is somewhat related to #1853 (comment)? Each part of the Dataset URN (or any other URNs for that matter) has a specific use,
As a result, the correct mapping for your Postgres dataset should be
At this point you'll be asking "what about db host/instance?", which is why I was asking #1848 (comment). Internally we have an entity called We do plan to open source the |
Thank you for your suggestion. I will make some adjustments based on this standard. Close the issue for now. I will always pay attention to the progress of the |
Currently originEntity is defined as dev or prod. In actual, we prefer it to identify the database instance.
Now it is not clear how to customize this information. Looking forward to some suggestions.
The text was updated successfully, but these errors were encountered: