Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improving the metadata on Bioversity Dataverse #8

Open
gubi opened this issue Oct 5, 2018 · 2 comments
Open

Improving the metadata on Bioversity Dataverse #8

gubi opened this issue Oct 5, 2018 · 2 comments
Assignees

Comments

@gubi
Copy link
Member

gubi commented Oct 5, 2018

As requested via mail:

Looking at the Metadata tool output we feel that there are 2 priority areas namely adding AGROVOC term-id's to the keywords and adding the geographical coverage to the metadata records.

Keywords

Would it be possible to extract all unique keywords from the Bioversity Dataverse, check these terms against AGROVOC and, if an exact match is found in AGROVOC, add the corresponding AGROVOC term-id to the relevant Dataverse entries?

Coverage (geographical)

Would it be possible to use the existing metadata e.g. the dataset title and description field, to (semi-)automaticly extract e.g. the country names and insert these in the geographical coverage field?

NB any changes made by these scripts would still be checked and approved by a human before the modified records are posted/published!

@gubi gubi self-assigned this Oct 5, 2018
@gubi
Copy link
Member Author

gubi commented Oct 24, 2018

Regarding the geospatial coverage, in the BioversityDataset there's a dedicated field datasetVersion › metadataBlocks › geospatial › fields › ... . I guess you're talking about this.
I think is not possible starting from a simple "string" value, unless using AI or another external dataset... I mean, is very complicated and requires a lot of code for text parsing.

Please, can you share the output in order to let me analyze it?

gubi added a commit that referenced this issue Oct 25, 2018
@gubi
Copy link
Member Author

gubi commented Nov 1, 2018

Created https://github.com/gubi/bioversity_agrovoc-indexing repository.
Waiting for transfership.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant