Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

_timestamp index attribute needs to be unique for original published data #231

Open
durack1 opened this issue Dec 12, 2023 · 2 comments
Open

Comments

@durack1
Copy link

durack1 commented Dec 12, 2023

We currently only have a single _timestamp = 2023-05-12T14:48:11.983Z attribute in the ESGF index. This needs to be unique for every dataset, such that when a new replica (the same dataset, bit for bit) is republished, the original timestamp is preserved. A new e.g. _timestampReplica1 = 2023-12-12T19:48:11.983Z is generated allowing for this original version/timestamp to be used persistently, e.g. in the data citation/license info etc.

Example from https://esgf-node.llnl.gov/search/input4mips/?institution_id=PCMDI&source_id=PCMDI-AMIP-1-1-9

@MartinaSt @wolfiex @SebastienDenvil @sashakames ping

@MartinaSt
Copy link

Thoughts on clean-up:

  1. ESGF index
  2. Metagrid search for version implementation
  3. data citation instructions

@sashakames
Copy link
Contributor

Just catching up on this. the issue title might be better rewritten to express the concern. _timestamp is always unique because it reflects the index-assigned timstamp at a sub-second granularity. What we want in the publisher is to write the original creation time of the record to the replica record? IIRC from our chat at AGU?
eg. original_timestamp

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants