-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Quality of matches: Extended acronyms #28
Comments
Yeah mixing acronym and consecutive letters in query don't work well.
On Mon, Oct 17, 2016, 00:23 mrkishi [email protected] wrote:
|
The problem is something like 'ed' vs 'edit database' the d is ambiguous so On Mon, Oct 17, 2016, 00:31 Jean Christophe Roy [email protected] wrote:
|
Edlo count as two segment of two letters, we are not able to teleport while On Mon, Oct 17, 2016, 00:42 Jean Christophe Roy [email protected] wrote:
|
Could acronym space be extended to take more than 1 start-of-word letters into account? Without drastic changes to the algorithm, that is. Something that would represent the following rules:
|
First, before making large change to algorithm I try to evaluate how far are we from a solution. Then it's a bit hard to answer if something is possible. Mostly because I'd hate to answer no. There's a few area where I know the code that handle acronym can be improved. I rely a lot on sequence length to sort useful from garbage. (And there's multiple example of garbage accidentally hitting acronym letters). A sequence of two is one letter away from an accidental match, hence my first recommandation to try to aim for a sequence of at least 3. |
I'll add thank you for the time you put into these report. |
Makes sense! For now, I'll train to use longer consecutive matches instead of acronyms. I'm the one who should be thanking you for such an awesome library. I wish I could help you with more than simple use-case discussions, but I'm afraid it'll take me quite a while to grasp the algorithms in play here (I'm trying to, nevertheless!). Also, I just saw your Thank you, again 💯 |
Hi
One way that works well is typing all the acronym first letters, then if you need more refinement, you add parts of last letters.
Use case:
In the end it was natural language vs programming language. I'll try to answer your other analysis later today. |
@jeancroy Wow, I came here wondering if there was a way to get fuzzaldrin-plus to match out-of-order queries (i.e. "bar foo" to match "foo bar") and |
I came across another potential improvement.
Suppose we want to match
editor localization
from a long list of candidates:We could start typing it from the start, and we'd find out there were conflicts up to
editor lo
. That's no good. We also realize it'd be foolish to try querying forel
, as that's a very common acronym. Smart as we are, we decide to improve on the acronym and type:With a smug on our faces, we hit enter.
Scoring acronyms similarly to start-of-word matches is really great. However, it looks like it could be improved by also taking into account consecutive matching letters. In the previous example,
edlo
is a 4 consecutive middle-of-word letters match, but it is also an acronym with 4 start-of-word letter matches.I have no idea how hard it'd be to make this modification (I still have to dive deeper into the algorithm), but wouldn't it be a desirable addition?
The text was updated successfully, but these errors were encountered: