-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DataEntry should be refactored #520
Comments
i would suggest we don’t call this AbstractDataEntry, since Abstract has specific meaning in our general dispatch use. Why not just DataEntry and it can derive from AbstractDataEntry if that abstraction is required? |
rest looks good to me! |
Agreed, that was more of a conceptual design. In practice these fields would have to be populated in |
guessing you mean “Data” :-) |
xref #97 (comment) |
What hashing function do we want to use and will it be fixed to only one? |
Do we want a timestamp? |
SHA2 is good I think? |
Notes from Slack:
|
I have a suggestion on this:
The idea behind immutable id => blob pairs is that the location of the blob does not matter. in fact its highly likely that there will be multiple copies of the blob flying all over the place (hence the immutable requirement), so it seems little awkward to store a single blob store location into the Entry? perhaps i’m understanding wrong and we should just make that bullet point a bit more explicit. |
targeting v0.9 or v0.10 now that we pushing for nullhypo stuff in v0.9? |
For 0.9 we should refactor the data entries to include more information.
The proposed data entry structure that every data source (e.g. MongoDB) should implement is:
We can't actually implement this in
AbstractDataEntry
, but we should enforce that these fields are populated.JT EDIT: I just swapped label and id for easier deprecation.
JT EDIT: source -> origin from slack comment
JT EDIT: added createdTimestamp
The text was updated successfully, but these errors were encountered: