-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Blocking using approximate nearest neighbours algorithms #22
Comments
Hi @BERENZ , Thanks for letting me know. This is really nice! I don't see it on CRAN yet. Still working on it? Let me know if it is on CRAN; I will then try go get a reference to your package somewhere in the documentation. One remark: in
this fails if
gives an error. Sorry for not getting back earlier. |
Hi @djvanderlaan, Thanks for bug reporting, as always I forgot about In addition, we use your package in the mecRecordLinkage an experimental package that implements: Lee, D., Zhang, L-C., and Kim, J.K. (2022). "Maximum entropy classification for record linkage," Survey Methodology, 48, 1-23. |
Hi @djvanderlaan, finally I had some time and fixed several issues with the |
I am writing to let you know that I have developed a small package called [
blocking
] (https://github.com/ncn-foreigners/blocking) that allows blocking of records based on approximate nearest neighbours algorithms (RcppAnnoy
,RcppHNSW
andmlpack
) and graphs (igraph
). The package includes the functionpair_ann
, which was developed on the basis ofpair_blocking
andpair_minsim
to allow direct integration into your package.Here is the code using the
reclin2
sample data:Feel free to test and comment. I plan to submit this package to CRAN in December.
The text was updated successfully, but these errors were encountered: