Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/bulk scoring #62

Merged
merged 9 commits into from
May 20, 2020
Merged

Feat/bulk scoring #62

merged 9 commits into from
May 20, 2020

Conversation

rhanka
Copy link
Member

@rhanka rhanka commented May 19, 2020

  1. move to mix "should":
  • blocking on a reduce scope ("should" with minimum_should_match) by defaut on name and date (at leas one should match)
  • other fields are only should (but envolved in scoring)
  1. normalize score using :
  • first and last name (handling switch)
  • location (city, citycode, depcode, country)
  • date (handling range match)
  • sex

use js-levenshtein
handles pruning for faster scoring (date then name then location)

base on top 10 ES results.

  1. concatenante source CSV to have complete result, using same separator as input

@rhanka rhanka force-pushed the feat/bulk-scoring branch from 5863520 to d866b7e Compare May 19, 2020 21:52
@rhanka rhanka requested a review from cristianpb May 19, 2020 21:52
This was linked to issues May 20, 2020
@cristianpb cristianpb mentioned this pull request May 20, 2020
Copy link
Member

@cristianpb cristianpb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👌

@cristianpb cristianpb merged commit 0c30df5 into dev May 20, 2020
@cristianpb cristianpb deleted the feat/bulk-scoring branch May 20, 2020 09:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Should et pas must Documentation par rapport au score
2 participants