You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[the paragraph that follows need to be edited some more, as it still contains some inaccuracies; will do once I have a sec. but, to emphasize, this is a fairly exotic/uncommon issue]
Not a very common problem, but was the case with SRDA from #7624: their OAI server is supplying the record identifiers like this: 10.6141/TW-SRDA-AN010012-1 - i.e., without the doi: prefix. This is a valid doi, and resolving it, as in https://doi.org/10.6141/TW-SRDA-AN010012-1, works. However our code appears to default to hdl: (!) - and that doesn't work of course. We just need to make this configurable on the client level, which protocol to default to when the prefix is not supplied.
Upon further investigation:
It does not just default to handles; ImportGenericServiceBean has this code:
i.e., it will try to resolve every "other identifier" - i.e, identifiers that cannot be unambiguously identified as either DOI or Handle - as a handle. In IQSS production however, it makes harvesting impossible - because the attempts to resolve above take a long time (before timing out) - probably because of some firewall.
We want to be able to turn the lookup mechanism off (but we want to add it in place by default, since somebody must have included it for a reason), by being able to specify either "doi:" or "handle:", as the prefix to default to? - should be straightforward if we make this API-only functionality. (once again, this is a fairly exotic condition; never happens when harvesting from another dataverse)
The text was updated successfully, but these errors were encountered:
landreev
changed the title
Harvesting: processing oai_dc:identifiers without Global Id protocol prefix: defaults to hdl; potentially very inefficient
Harvesting: processing oai_dc:identifiers without Global Id protocol prefix: defaults to hdl in a potentially very inefficient way
Mar 26, 2024
[the paragraph that follows need to be edited some more, as it still contains some inaccuracies; will do once I have a sec. but, to emphasize, this is a fairly exotic/uncommon issue]
Not a very common problem, but was the case with SRDA from #7624: their OAI server is supplying the record identifiers like this:
10.6141/TW-SRDA-AN010012-1
- i.e., without the doi: prefix. This is a valid doi, and resolving it, as in https://doi.org/10.6141/TW-SRDA-AN010012-1, works. However our code appears to default to hdl: (!) - and that doesn't work of course. We just need to make this configurable on the client level, which protocol to default to when the prefix is not supplied.Upon further investigation:
It does not just default to handles; ImportGenericServiceBean has this code:
i.e., it will try to resolve every "other identifier" - i.e, identifiers that cannot be unambiguously identified as either DOI or Handle - as a handle. In IQSS production however, it makes harvesting impossible - because the attempts to resolve above take a long time (before timing out) - probably because of some firewall.
We want to be able to turn the lookup mechanism off (but we want to add it in place by default, since somebody must have included it for a reason), by being able to specify either "doi:" or "handle:", as the prefix to default to? - should be straightforward if we make this API-only functionality. (once again, this is a fairly exotic condition; never happens when harvesting from another dataverse)
The text was updated successfully, but these errors were encountered: