Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

String kernels as a feature for names #174

Closed
marfox opened this issue Feb 6, 2019 · 4 comments
Closed

String kernels as a feature for names #174

marfox opened this issue Feb 6, 2019 · 4 comments
Assignees
Labels
task Atomic activity
Milestone

Comments

@marfox
Copy link
Member

marfox commented Feb 6, 2019

String kernels enable the comparison of strings with character windows and character jumps: this might be a great feature for names.

@marfox marfox added the discussion Extra attention is needed label Feb 6, 2019
@marfox marfox added this to the ML linker milestone Feb 6, 2019
@marfox marfox self-assigned this Feb 7, 2019
@marfox
Copy link
Member Author

marfox commented Feb 12, 2019

scikit's built-in text analyzers may be relevant because they build character n-grams, see https://scikit-learn.org/stable/modules/feature_extraction.html#limitations-of-the-bag-of-words-representation

@marfox
Copy link
Member Author

marfox commented Feb 12, 2019

scikit's built-in text analyzers may be relevant because they build character n-grams, see https://scikit-learn.org/stable/modules/feature_extraction.html#limitations-of-the-bag-of-words-representation

These are already considered in the cosine similarity as per linker/feature_extraction.StringList#cosine_similarity

@marfox
Copy link
Member Author

marfox commented Mar 8, 2019

Subtask of #214

@marfox marfox mentioned this issue Mar 8, 2019
10 tasks
@marfox marfox added task Atomic activity and removed discussion Extra attention is needed labels May 6, 2019
marfox pushed a commit that referenced this issue May 8, 2019
…scriptions anymore, it's now done at feature extraction
@marfox marfox assigned marfox and unassigned marfox May 8, 2019
@marfox
Copy link
Member Author

marfox commented May 9, 2019

@marfox marfox closed this as completed May 9, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
task Atomic activity
Projects
None yet
Development

No branches or pull requests

2 participants