fix detection of suffix/prefix changes for name-changes #4421

MoKob · 2017-08-17T13:42:24Z

Issue

Resolves #4420

Tasklist

add regression / cucumber cases (see docs/testing.md)
review
adjust for comments

oxidase

@MoKob please could you tell what the status of the PR, i left some comments about utf-8, but if it is a US-ASCII-only solution then just ignore comments

oxidase · 2017-10-05T06:29:05Z

include/util/guidance/name_announcements.hpp

+        }
+    }
+    // the best position marks the end of the string
+    return lhs.substr(best_pos - best, best);


Should it be UTF-8 friendly? Cyrillic 'а' (#xD0 #xB0) and 'б' (#xD0 #xB0) have common prefix xD0 but the returned prefix has no encoding.

http://coliru.stacked-crooked.com/a/1bdfc10033256f67

ab аб�

I don't think this is an issue, as the substring still needs to match our pre-set database of suffixes/prefixes. Unless incomplete suffixes are stored within that set, we cannot match against what is left.

oxidase · 2017-10-05T06:29:56Z

include/util/guidance/name_announcements.hpp

+        return "";
+
+    // array for dynamic programming
+    std::vector<std::vector<std::uint32_t>> dp(lhs.size(),


Only two single vectors can be used.

oxidase · 2017-10-05T06:50:01Z

include/util/guidance/name_announcements.hpp

+
+    // trim spaces, transform to lower
+    const auto trim = [](auto str) {
+        boost::to_lower(str);


utf-8 problem here http://coliru.stacked-crooked.com/a/7da4a8be589f8c36

oxidase · 2017-10-05T07:02:48Z

include/util/guidance/name_announcements.hpp

-        const auto first_prefix_and_suffixes = getPrefixAndSuffix(first);
-        const auto second_prefix_and_suffixes = getPrefixAndSuffix(second);
+            const auto checkTable = [&](const std::string &str) {
+                // workaround for cucumber tests:


well... it's a bad side-effect of having all the different roads that are just two letters and include N/E/S/W in their lettering :(. In the end, this check prevents matching suffixes that are just 1 of two letters. This could be relevant in 1N 1S, if that should exist anywhere. In general, it is more helpful on cucumber tests, where we would need to ensure that no name is actually just two letters (which is mostly the case right now)

MoKob added Guidance Review labels Aug 17, 2017

MoKob force-pushed the fix/suffix-detection branch from fc98077 to 8cdfd2a Compare August 18, 2017 12:51

MoKob added this to the 5.12.0 milestone Aug 18, 2017

MoKob force-pushed the fix/suffix-detection branch 3 times, most recently from 3e53c12 to 2a52bed Compare August 18, 2017 14:36

TheMarex modified the milestones: 5.13.0, 5.12.0 Aug 31, 2017

MoKob requested a review from oxidase October 2, 2017 11:46

oxidase requested changes Oct 5, 2017

View reviewed changes

Moritz Kobitzsch added 2 commits October 10, 2017 13:27

fix detection of suffix/prefix changes for name-changes

01e1beb

fix pedantic warning about additional ;

bea5778

MoKob force-pushed the fix/suffix-detection branch from 2a52bed to bea5778 Compare October 10, 2017 11:27

MoKob added Review - In feedback and removed Review labels Oct 11, 2017

remove workaround, reduce memory consumption in lcs computation

93281d5

MoKob force-pushed the fix/suffix-detection branch from b2a7c9c to 93281d5 Compare October 11, 2017 11:11

oxidase approved these changes Oct 11, 2017

View reviewed changes

oxidase added Ready To Merge and removed Review - In feedback labels Oct 11, 2017

MoKob merged commit fd52c80 into master Oct 11, 2017

MoKob deleted the fix/suffix-detection branch October 11, 2017 12:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix detection of suffix/prefix changes for name-changes #4421

fix detection of suffix/prefix changes for name-changes #4421

MoKob commented Aug 17, 2017 •

edited

Loading

oxidase left a comment

oxidase Oct 5, 2017

MoKob Oct 11, 2017

oxidase Oct 5, 2017

oxidase Oct 5, 2017

oxidase Oct 5, 2017

MoKob Oct 11, 2017

fix detection of suffix/prefix changes for name-changes #4421

fix detection of suffix/prefix changes for name-changes #4421

Conversation

MoKob commented Aug 17, 2017 • edited Loading

Issue

Tasklist

oxidase left a comment

Choose a reason for hiding this comment

oxidase Oct 5, 2017

Choose a reason for hiding this comment

MoKob Oct 11, 2017

Choose a reason for hiding this comment

oxidase Oct 5, 2017

Choose a reason for hiding this comment

oxidase Oct 5, 2017

Choose a reason for hiding this comment

oxidase Oct 5, 2017

Choose a reason for hiding this comment

MoKob Oct 11, 2017

Choose a reason for hiding this comment

MoKob commented Aug 17, 2017 •

edited

Loading