Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create emailScoringStrategy #223

Closed
paulalbert1 opened this issue Jun 30, 2018 · 0 comments
Closed

Create emailScoringStrategy #223

paulalbert1 opened this issue Jun 30, 2018 · 0 comments
Assignees

Comments

@paulalbert1
Copy link
Contributor

paulalbert1 commented Jun 30, 2018

  1. Get any emails from article.email for targetAuthor. One author may have several emails in article.email.

  2. Get any emails from identity.email. One person may have several emails in identity.email.

  3. Get domain aliases from application.properties. For Weill Cornell, these are:

@nyp.org
@weill.cornell.edu
@med.cornell.edu
@mail.med.cornell.edu
  1. Look to see if there any cases where any emails from identity are contained within article.affiliation

  2. Else, look to see if there are any cases where identifier + domain aliases are contained within article.affiliation.

  3. If there is a match, output the following for each match:

emailMatch: [email protected]
emailMatchScore: 15
  1. If there are multiple target authors, select the author with the highest score.

Maintain score in application.properties.

Unit test PMID: 27670898 (jobuck)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants