Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Possible source of additional annotations in the RDF file #671

Open
jameshowison opened this issue Oct 2, 2020 · 1 comment
Open

Possible source of additional annotations in the RDF file #671

jameshowison opened this issue Oct 2, 2020 · 1 comment

Comments

@jameshowison
Copy link
Contributor

On multiply annotated files, when we did curation we started with the annotations from the annotator with the highest count of annotations ("the top annotator"). In same cases the other annotator may have found mentions not found by the top annotator (even though they found fewer overall). It should be possible to check this as a source of additional mentions. Some of these may be included as elements in the TEI/XML file.

Putting this on the back burner for now as the numbers are likely to be low.

@caifand
Copy link
Contributor

caifand commented Oct 12, 2020

It just occurred to me that potentially we could recycle the very small portion of annotations under mention_type != "software" and validate if they are actually software (given that often these categories were labeled with lower certainty).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants