-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Maui Nui thoughts #3
Comments
@sunray1
Which fields are set to null is up to the individual institution. Null is not a requirement. It's just a common practice. For coordinates, fuzzy generalization is a common course of action. Ultimately, it's up to the provider. For datasets that I've published, we would delete decimalLatitude, decimalLongitude, county, locality, localityRemarks, verbatimLocality, and, sometimes, stateProvince for sensitive species. Determing which species are deemed "sensitive" is trickier than it may seem. Many default to IUCN, but that often omits local sensitivity. Again, this is ultimately up to the provider. For this reason (and a few others) global automated redaction has proven problematic. Ita also worth noting that redaction has different implications for different types of datasets. Camera trap data without coordinates is often not suitable for Ecological modelling, which is its primary use. Coordinates are fuzzy instead of removed for this reason. Also, images (and the associated metadata) of park rangers are never published. It can be a matter of life or death for those individuals. |
HI Chandra Multiple attributes in a record will be able to be obfuscated / withheld using the proposed model. I think that answers a few of the questions / use cases above. DWC uses vernacularName for common name regarding the questions:
|
Thanks Ben. |
That's what I thought, perfect! Definitely allows for all of the use cases I think
I think the concern here was more about the fact that adding a more specific reason might unintentionally draw more attention to those records vs not having enough information to distinguish a subcategory.
Perfect
Interesting, did not know that! Perhaps this could be noted somewhere in the docs. |
Of course just using a taxonomic name alone for a sensitive species is problematic. It must also include a region of sensitivity. Some taxa may be highly sensitive in one area (e.g. a species of Hakea in Australia) but may be a invasive species in another (e.g. South Africa). In the area where it is invasive, one would need to know precise localities, etc. |
@ArthurChapman agreed. I'm guessing when we get to the discussion about how we represent what changes have been made to the data - we have to indicate the record is sensitive according to XX list. That's kind of how we do it in the ALA. but a standard set of words or examples would be a good thing to have I think |
I haven't heard mention of dwc:dataGeneralizations in the conversation so far, but it is relevant. |
Definitely agree - we haven't got to the actual implementation yet. We're just trying to start to define the vocabularies first and test that they meet most scenarios using case studies. We might need to enlist your help once we get to that John! |
Ready and willing to be helpful when needed. |
I've been chatting about the state of the current vocabularies with some folks I'm working with out in Maui Nui and getting a sense of why they need to retract data and when - here are some thoughts from them:
And some questions:
The text was updated successfully, but these errors were encountered: