Sort code autocompletion options by similarity based on input #65655

Mickeon · 2022-09-11T13:38:35Z

This PR no longer adds. Maybe another time.

To see how the original algorithm sort-of-behaved, click here:

Added in the String class, the algorithm assigns a likeness value between 0.0 and 1.0 to the String calling it, relying on the base String parameter.

If either Strings are empty it returns 0.0.
If base is not a subsequence of the String it returns 0.0.

The algorithm aims to give higher scores to shorter Strings that contain the base's characters closer to the beginning of the String, and closer between each other. For example:

String base = "pos"

String("pos").assign_score(base);                        // 1.0
String("post").assign_score(base);                       // 0.938
String("position").assign_score(base);                   // 0.844
String("current_animation_position").assign_score(base); // 0.260
String("potato").assign_score(base);                     // 0.0

Showcase

BEFORE	AFTER



A short example of the options moving as values closer to the base are prioritised:	And a mock-up of how this works, made in GDScript.

To make it really brief, this PR uses a combination String.similiary, the category system introduced in a previous PR, and some filtering to yield more predictable results, instead of scattering every completion option at seemingly random.

It also gives much higher priority to strings that contain the base in full, closer to the beginning or are perfect matches,

I may add more mock-ups at a later time. But I'm only one person, not writing on Godot 4 full-time yet. As such...

I encourage people to please try this out by pulling this branch!

Final notes:

This PR moves the autocompletion sorting all the way after all options have been filtered out, instead of before. I feel like there's no point sorting values if some are going to be discarded. Please do tell if there is something I'm missing.
~~Known issues:~~
- ~~This PR no longer highlights the characters that are part of the search. I do not know how to accomplish that right now.~~
~~I do need proper names for the internal function, do suggest, please!~~
~~If necessary I can open a proposal, but this is not stuff most users are particularly "savvy" about.~~
Special thanks to @fire and @KoBeWi for helping me using the custom iterator.
~~See Sort code autocompletion options by similarity based on input #65655 (comment)~~

Bugsquad edit: This closes #63706.

Mickeon · 2022-09-11T14:03:11Z

This compiles locally but it doesn't in the checks...

RedMser · 2022-09-11T14:27:33Z

This compiles locally but it doesn't in the checks...

I assume because you're including an editor header file from outside of the editor folder. So when making a tools=no build, the editor folder is not included in the compile and the build fails because of that.

Mickeon · 2022-09-11T14:32:02Z

I assume because you're including an editor header file from outside of the editor folder. So when making a tools=no build, the editor folder is not included in the compile and the build fails because of that.

Ah, oh boy. This one is going to be... interesting to solve. Oh no...

Mickeon · 2022-09-11T14:48:13Z

Hastily attempting to address it so that this can be tested as soon as possible.

RedMser · 2022-09-11T15:29:55Z

I'm very glad you're tackling this long standing issue. But this current solution still does not seem ideal and should probably combine multiple methods of assigning score.

For example, in a script extending Node, autocompletion for tree looks like this:

It's great that it finds matches that use the same letters in the same order. But the algorithm should definitely prioritize a full substring match (tree is found as-is in get_tree), as well as further prioritizing begins_with matches (so the tree_* signals should be first).

Zireael07 · 2022-09-11T15:35:02Z

@RedMser how autocompletion should work has been bikeshedded to death or close to it. Some favor 'full matches' as you do, some favor what @Mickeon did here and personally I think it needs a follow-up PR and an option switch

Mickeon · 2022-09-11T15:38:49Z

It's great that it finds matches that use the same letters in the same order. But the algorithm should definitely prioritize a full substring match (tree is found as-is in get_tree), as well as further prioritizing begins_with matches (so the tree_* signals should be first).

It would somewhat do as you say, if the implementation of #59633 were to be completely replaced with this method. But instead, I attempted combining the two.

Currently, and before, the order is kind of like this.
[Node2D, Node properties], then [Node2D, Node methods], then [Node2D, Node constants] ...
So each option is sorted by score only in between the same kind.

What I could experiment with, is something along the lines of... "If the score is high enough, it ignores the kind of option and brings it all the way up the list", But honestly, right now I'm very fearful because some checks just do not compile! Honestly the screenshot you showed me is personally odd already, I think I broke something while attempting to make this compile.

RedMser · 2022-09-11T15:52:36Z

and personally I think it needs a follow-up PR and an option switch

So each option is sorted by score only in between the same kind and the same class

This indeed is probably the kind of thing that should introduce an editor setting or two (group by inheritance level, group by field/method/constant, etc.). Tbh I had no idea how it currently worked, so maybe my suggestion does not work well and would need a greater rewrite which is a lot to ask for. 😅
Your changes are already a large improvement since the list shows more plausible matches.

right now I'm very fearful because some checks just do not compile

@Mickeon I'm not really well-versed with C++ but you could try removing the _FORCE_INLINE_ for the comparator function and/or putting the function body directly into the header file. Seems like every comparator in the code base is implemented a bit differently in that regard.

scene/gui/code_edit.cpp

core/string/ustring.cpp

Mickeon · 2022-09-12T06:59:59Z

@RedMser I have updated the calculation.
There's a large penalty the further away the base's characters are from each other:

The score actually goes below 0.0, which shouldn't be the case (I don't want it to be the case), but it does improve the order of results considerably.
At some point later I will experiment with outright discarding any option whose score is below a certain threshold (basically humanly impossible to discern how the base is inside).

EricEzaM · 2023-01-09T15:03:03Z

Ok I have revisited this again. Found more stuff that I may have overlooked in my initial PR, or has been changed since which messed some things up. diff on this PR: https://github.com/Mickeon/godot/compare/editor-autocompletion-sort-by-score...EricEzaM:65655?expand=1

If you want to test, try the build artefact from here when it finishes: https://github.com/EricEzaM/godot/actions/runs/3874969199
See below for my testing results using peoples responses in this thread. It looks ok to me but there may be some edge cases.

Test results for every example in this thread: [toggle - long image]

Also fixes #71059

P.S. 'Container Sizing' should not be there in 2nd capture. That is a property group. Issue for another PR though.

ajreckof · 2023-01-16T06:39:52Z

in this screenshot, fourth results seems inappropriate. I don't know why this one was ranked so high. Is it because it ends with te? if so i don't think ending with a match should be ranked as high as starting with a match

EricEzaM · 2023-01-16T06:45:56Z

@ajreckof it's because it is a local function on the class.

It's true that it might be better for things which are local but are bad matches to be pushed lower.

ajreckof · 2023-01-16T06:56:32Z

yeah it feels conter productive for them to show when they are not that much pertinent. I think results should first be sorted for match and on same level of match being sorted by scope.

Kakiroi · 2023-01-16T10:05:40Z

Is there a way to remedy this? I don't think it's that useful for the first match to come up for such common abbreviation. IF I wanted the first match, I would search 'Visual' first. Maybe give more points for entry that has matched letters from the start? (or early?)

Funny because, v3 actually gives Vector3 as first entry. Feels like shortening words give more accurate result.

EricEzaM · 2023-01-16T10:30:10Z

~~That is what this work attempts to resolve.~~

Oh I see, that is with this PR? I'll take a look.

ajreckof · 2023-01-16T23:55:44Z

maybe there should be a penalty by how many times it is separated. The more parts there is to find the word the farther it should be ? this would be a more general way to differentiate than just it is in one part it is in multiple parts (don't know how hard it would be to implement sorry)

EricEzaM · 2023-01-18T13:06:45Z

I have tried to resolve the feedback above. I ended up adding a levenshtein distance implementation that I pinched from here (wikibooks - cc0). I more or less understand how it works but there is a nice implementation already documented so we're standing on the shoulders of giants etc etc

I also got rid of the sorting based on 'kind' (constant, function, signal, etc) it probably added additional complexity without measurable benefit.

I kind of hate how there are some 'magic numbers' in the implementation though....

Test results for every example in this thread: [toggle - long image]

Note one annoying thing is that the text that is highlighted is not actually the sections of text that were 'matched' by the sorting algo. The thing that the highlights uses is just a subsequence search so it sometimes highlights the wrong thing (e.g. get_tree above - the search algo actually sorted it based on tree exactly matching.

download (once action is done running)
diff with the current state of this PR
main sorting algo location in code

ps @Kakiroi that long thing was being ranked higher since it had vec3 as an exact match where nothing else did. So I added some stuff to disregard exact match if it is very far away from the start.

Kakiroi · 2023-01-19T19:22:41Z

Phenomenal. Already feels so much better. Also good call on removing sorting by type.

I'm assuming class is ranked lower? Or does it ignores capitalization? Or maybe this is intended?
Also, it is making this error whenever I type "ti".

Good luck, chief.

To make it really brief, uses a combination `String.similiary`, the category system introduced in a previous PR, and some filtering to yield more predictable results, instead of scattering every completion option at seemingly random. It also gives much higher priority to strings that contain the base in full, closer to the beginning or are perfect matches. Also moves CodeCompletionOptionCompare to code_edit.cpp

… to improve as many different use cases as possible

Mickeon · 2023-02-12T10:09:05Z

@EricEzaM Could you guide me to bring this PR to the way it is done in your branch? A git patch, perhaps?

EricEzaM · 2023-02-12T11:24:15Z

I think my branch is based on this branch so if you pull it down it should work. I have done that before with other people branches (non-pr).

Anyway, @ajreckof expressed interest in investing a solution further since while I was making progress on it, I was still not satisfied and perhaps others would have better ideas. Their solution is looking pretty good, but not polished up yet. @ajreckof Maybe this is a decent time to share your work?

ajreckof · 2023-02-12T14:57:36Z

So here is my work https://github.com/ajreckof/godot/tree/editor-autocompletion-sort-by-rules. It is based on your branch too so if you feel like adding it you can always just pull from it.

What it does is mostly make the match shown the correct ones. Which subsequence is kept is based on the ordering so that if later on we modify the way matches are ranked the right match will still show the best one.
Then it improves and simplifies the ordering by relying only on the shown match.

akien-mga · 2023-06-08T16:13:16Z

Superseded by #75746. Thanks for the contribution nevertheless!

Mickeon requested review from a team as code owners September 11, 2022 13:38

Mickeon force-pushed the editor-autocompletion-sort-by-score branch 2 times, most recently from be5e503 to d79fda7 Compare September 11, 2022 13:56

Mickeon force-pushed the editor-autocompletion-sort-by-score branch from d79fda7 to 8bcebc6 Compare September 11, 2022 14:35

Calinou added enhancement topic:editor usability labels Sep 11, 2022

Calinou added this to the 4.0 milestone Sep 11, 2022

Calinou added the needs testing label Sep 11, 2022

KoBeWi reviewed Sep 11, 2022

View reviewed changes

scene/gui/code_edit.cpp Outdated Show resolved Hide resolved

Mickeon marked this pull request as draft September 11, 2022 19:21

bruvzg reviewed Sep 11, 2022

View reviewed changes

core/string/ustring.cpp Outdated Show resolved Hide resolved

Mickeon force-pushed the editor-autocompletion-sort-by-score branch from 8bcebc6 to d308483 Compare September 12, 2022 06:55

Mickeon force-pushed the editor-autocompletion-sort-by-score branch from d308483 to a3c87ab Compare September 12, 2022 08:07

Mickeon requested review from bruvzg and KoBeWi and removed request for bruvzg and KoBeWi September 12, 2022 08:12

akien-mga mentioned this pull request Jan 9, 2023

Autocomplete works only from the beginning of the property/method name #71059

Closed

Mickeon and others added 4 commits January 29, 2023 22:00

Trying again to improve code completion

f9fbc06

Add levenshtein distance for comparisons, remove kind sort order, try…

82c0946

… to improve as many different use cases as possible

Fixups

30cc0bb

YuriSizov modified the milestones: 4.0, 4.1 Feb 10, 2023

YuriSizov added the needs work label Feb 10, 2023

Mickeon force-pushed the editor-autocompletion-sort-by-score branch from c8ea22d to 30cc0bb Compare February 12, 2023 12:01

akien-mga mentioned this pull request Feb 15, 2023

Unrelated Completion Suggestions #73352

Closed

Zireael07 mentioned this pull request Apr 6, 2023

script Suggestions SHOULD order by match-degree godotengine/godot-proposals#6645

Closed

ajreckof mentioned this pull request Apr 6, 2023

Sort code autocompletion with rules #75746

Merged

akien-mga closed this Jun 8, 2023

akien-mga added the archived label Jun 8, 2023

Mickeon deleted the editor-autocompletion-sort-by-score branch December 30, 2023 11:50

AThousandShips removed this from the 4.1 milestone Dec 30, 2023

AThousandShips removed the needs testing label Dec 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sort code autocompletion options by similarity based on input #65655

Sort code autocompletion options by similarity based on input #65655

Mickeon commented Sep 11, 2022 •

edited

Loading

Mickeon commented Sep 11, 2022

RedMser commented Sep 11, 2022

Mickeon commented Sep 11, 2022

Mickeon commented Sep 11, 2022 •

edited

Loading

RedMser commented Sep 11, 2022

Zireael07 commented Sep 11, 2022

Mickeon commented Sep 11, 2022 •

edited

Loading

RedMser commented Sep 11, 2022

Mickeon commented Sep 12, 2022 •

edited

Loading

EricEzaM commented Jan 9, 2023 •

edited

Loading

ajreckof commented Jan 16, 2023 •

edited

Loading

EricEzaM commented Jan 16, 2023

ajreckof commented Jan 16, 2023

Kakiroi commented Jan 16, 2023

EricEzaM commented Jan 16, 2023 •

edited

Loading

ajreckof commented Jan 16, 2023 •

edited

Loading

EricEzaM commented Jan 18, 2023 •

edited

Loading

Kakiroi commented Jan 19, 2023

Mickeon commented Feb 12, 2023

EricEzaM commented Feb 12, 2023

ajreckof commented Feb 12, 2023 •

edited

Loading

akien-mga commented Jun 8, 2023

Sort code autocompletion options by similarity based on input #65655

Sort code autocompletion options by similarity based on input #65655

Conversation

Mickeon commented Sep 11, 2022 • edited Loading

Showcase

I encourage people to please try this out by pulling this branch!

Mickeon commented Sep 11, 2022

RedMser commented Sep 11, 2022

Mickeon commented Sep 11, 2022

Mickeon commented Sep 11, 2022 • edited Loading

RedMser commented Sep 11, 2022

Zireael07 commented Sep 11, 2022

Mickeon commented Sep 11, 2022 • edited Loading

RedMser commented Sep 11, 2022

Mickeon commented Sep 12, 2022 • edited Loading

EricEzaM commented Jan 9, 2023 • edited Loading

ajreckof commented Jan 16, 2023 • edited Loading

EricEzaM commented Jan 16, 2023

ajreckof commented Jan 16, 2023

Kakiroi commented Jan 16, 2023

EricEzaM commented Jan 16, 2023 • edited Loading

ajreckof commented Jan 16, 2023 • edited Loading

EricEzaM commented Jan 18, 2023 • edited Loading

Kakiroi commented Jan 19, 2023

Mickeon commented Feb 12, 2023

EricEzaM commented Feb 12, 2023

ajreckof commented Feb 12, 2023 • edited Loading

akien-mga commented Jun 8, 2023

Mickeon commented Sep 11, 2022 •

edited

Loading

Mickeon commented Sep 11, 2022 •

edited

Loading

Mickeon commented Sep 11, 2022 •

edited

Loading

Mickeon commented Sep 12, 2022 •

edited

Loading

EricEzaM commented Jan 9, 2023 •

edited

Loading

ajreckof commented Jan 16, 2023 •

edited

Loading

EricEzaM commented Jan 16, 2023 •

edited

Loading

ajreckof commented Jan 16, 2023 •

edited

Loading

EricEzaM commented Jan 18, 2023 •

edited

Loading

ajreckof commented Feb 12, 2023 •

edited

Loading