Editorial: refactor `IsStringPrefix` and `String.prototype.split` to use `StringIndexOf`; remove `SplitMatch` #2144

shvaikalesh · 2020-08-19T16:08:38Z

This PR merges IsStringPrefix and SplitMatch abstract ops into single well-defined StringIncludesAt helper, improving:

unobvious parameter order of IsStringPrefix;
too specific name and polymorphic return type of SplitMatch.

We can't use StringIndexOf instead, since it searches until the end of string rather than matching substring only at index.

michaelficarra · 2020-08-19T18:52:08Z

spec.html

@@ -5624,19 +5646,6 @@ <h1>IsRegExp ( _argument_ )</h1>
      </emu-alg>
    </emu-clause>

-    <emu-clause id="sec-isstringprefix" aoid="IsStringPrefix">
-      <h1>IsStringPrefix ( _p_, _q_ )</h1>


For readability, we could leave this with an implementation that just passes 0 as the third parameter to StringIncludesAt. Not sure if it's worth it.

It's maybe just for me, but parameter order of IsStringPrefix isn't immediately clear: is first parameter a string or a prefix?

I feel the same way about StringIncludesAt.

It would be great if we came up with a better name. StringMatch?

I don't really think the name will ever be sufficient. I will always need to take a quick look at the parameter names or the summary. Or infer it from context at the usage site.

spec.html

ljharb · 2020-08-19T21:22:41Z

spec.html

+          1. If ! StringIncludesAt(_px_, _py_, 0) is *true*, return *false*.
+          1. If ! StringIncludesAt(_py_, _px_, 0) is *true*, return *true*.


i don't find this change clearer :-/

ljharb · 2020-08-19T21:23:07Z

spec.html

+            1. If ! StringIncludesAt(_S_, _R_, _q_) is *false*, then set _q_ to _q_ + 1.
            1. Else,


since the "else" is multiline, i think the "if" needs to be too.

The spec has about 37 examples of an 'inline' If followed by a 'multiline' Else. And about 18 examples of a multiline If followed by an inline Else.

Compared to how many examples are both multiline or both inline, those seem like cases we should make consistent.

ljharb · 2020-08-19T22:37:00Z

Discussed this on the editor call; let's please leave IsStringPrefix and its callsites untouched (but update its implementation to call into StringIncludesAt). I'm not personally convinced that StringIncludesAt is the best name but don't have a better suggestion at this time.

gibson042

I'm in favor of removing redundant operations, but not clear on why it's necessary to introduce a new one rather than using StringIndexOf. IsStringPrefix(_possiblePrefix_, _string_) is equivalent to StringIndexOf(_string_, _possiblePrefix_, 0) is 0, and String.prototype.split can be refactored as well:

1. Let _s_ be the length of _S_.
1. Let _separatorLength_ be the length of _R_.
1. If _separatorLength_ is 0, then
  1. Let _head_ be the substring of _S_ from 0 to _lim_.
  1. Let _codeUnits_ be a List consisting of the sequence of code units that are the elements of _head_.
  1. Return ! CreateArrayFromList(_codeUnits_).
1. If _s_ is 0, return ! CreateArrayFromList(&laquo; _S_ &raquo;).
1. Let _substrings_ be a new empty List.
1. Let _i_ be 0.
1. Let _j_ = StringIndexOf(_S_, _R_, 0).
1. Repeat, while _j_ is not -1,
  1. Let _T_ be the substring of _S_ from _i_ to _j_.
  1. Append _T_ to _substrings_.
  1. Set _lengthA_ to _lengthA_ + 1.
  1. If _lengthA_ = _lim_, return ! CreateArrayFromList(_substrings_).
  1. Set _i_ to _j_ + _separatorLength_.
  1. Set _j_ to StringIndexOf(_S_, _R_, _i_).
1. Let _T_ be the substring of _S_ from _i_.
1. Append _T_ to _substrings_.
1. Return ! CreateArrayFromList(_substrings_).

gibson042 · 2020-08-19T22:47:56Z

spec.html

@@ -1038,6 +1038,28 @@ <h1>Runtime Semantics: StringIndexOf ( _string_, _searchValue_, _fromIndex_ )</h
          <p>This algorithm always returns -1 if _fromIndex_ &gt; the length of _string_.</p>
        </emu-note>
      </emu-clause>
+
+      <emu-clause id="sec-stringincludesat" aoid="StringIncludesAt">


Suggested change

<emu-clause id="sec-stringincludesat" aoid="StringIncludesAt">

<emu-clause id="sec-stringincludesat" aoid="StringIncludesAt" oldids="IsStringPrefix,SplitMatch">

…lit` to use `StringIndexOf`; remove `SplitMatch` (tc39#2144)

ljharb · 2021-10-07T04:51:48Z

I've rebased this; the first commit implements #2144 (comment); the second commit reverts that and implements #2144 (review), per one editor's preference on Matrix.

Happy to drop the second, or squash the two, as preferred.

The only unaccounted-for piece in the second commit is adding an oldid somewhere for SplitMatch - I'm not sure where it would go.

updated

…lit` to use `StringIndexOf`; remove `SplitMatch` (tc39#2144)

spec.html

…lit` to use `StringIndexOf`; remove `SplitMatch` (tc39#2144)

spec.html

bakkot

LGTM. The new version of split is much clearer. The old one is opaque enough that it's hard for me to be 100% certain they're identical, but I'm pretty confident they are.

spec.html

…use `StringIndexOf`; remove `SplitMatch` (tc39#2144) Co-authored-by: Alexey Shvayka <[email protected]> Co-authored-by: Jordan Harband <[email protected]>

Renamed: - `ExecuteModule` → `ExecuteAsyncModule` - `Abstract Equality Comparison` → `IsLooselyEqual` - `Abstract Relational Comparison` → `IsLessThan` - `Strict Equality Comparison` → `IsStrictlyEqual` Added: - `CreateNonEnumerableDataPropertyOrThrow` - `DefineField` - `DefineMethodProperty` - `GatherAvailableAncestors` - `IfAbruptCloseIterator` - `InitializeInstanceElements` - `InstallErrorCause` - `IsPrivateReference` - `IsStringWellFormedUnicode` - `MakePrivateReference` - `NewPrivateEnvironment` - `PrivateElementFind` - `PrivateFieldAdd` - `PrivateGet` - `PrivateMethodOrAccessorAdd` - `PrivateSet` - `RegExpHasFlag` - `ResolvePrivateIdentifier` - `RoundMVResult` - `SortIndexedProperties` - `StringToNumber` - `TypedArrayElementSize` - `TypedArrayElementType` Removed: - `InitializeEnvironment` - `SplitMatch` removed; `IsStringPrefix` refactored: tc39/ecma262#2144

michaelficarra reviewed Aug 19, 2020

View reviewed changes

spec.html Outdated Show resolved Hide resolved

michaelficarra reviewed Aug 19, 2020

View reviewed changes

spec.html Outdated Show resolved Hide resolved

michaelficarra approved these changes Aug 19, 2020

View reviewed changes

shvaikalesh force-pushed the string-includes-at branch 2 times, most recently from dfdd25c to 8a80eee Compare August 19, 2020 20:22

ljharb previously requested changes Aug 19, 2020

View reviewed changes

ljharb added editor call to be discussed in the next editor call editorial change labels Aug 19, 2020

ljharb removed the editor call to be discussed in the next editor call label Aug 19, 2020

gibson042 reviewed Aug 19, 2020

View reviewed changes

ljharb force-pushed the master branch 3 times, most recently from 3d0c24c to 7a79833 Compare June 29, 2021 02:21

jmdyck mentioned this pull request Jul 16, 2021

Editorial: Set [[Construct]] for default constructor in ClassDefiniti… #2459

Merged

ljharb added a commit to shvaikalesh/ecma262 that referenced this pull request Oct 7, 2021

squash: Editorial: refactor IsStringPrefix and `String.prototype.sp…

8aec5d4

…lit` to use `StringIndexOf`; remove `SplitMatch` (tc39#2144)

ljharb force-pushed the string-includes-at branch from 8a80eee to 8aec5d4 Compare October 7, 2021 04:50

ljharb requested review from syg, bakkot and a team October 7, 2021 04:52

ljharb added a commit to shvaikalesh/ecma262 that referenced this pull request Oct 7, 2021

squash: Editorial: refactor IsStringPrefix and `String.prototype.sp…

166ef10

…lit` to use `StringIndexOf`; remove `SplitMatch` (tc39#2144)

ljharb force-pushed the string-includes-at branch from 8aec5d4 to 166ef10 Compare October 7, 2021 04:53

bakkot reviewed Oct 15, 2021

View reviewed changes

spec.html Outdated Show resolved Hide resolved

spec.html Outdated Show resolved Hide resolved

spec.html Outdated Show resolved Hide resolved

spec.html Outdated Show resolved Hide resolved

spec.html Show resolved Hide resolved

ljharb added a commit to shvaikalesh/ecma262 that referenced this pull request Oct 15, 2021

squash: Editorial: refactor IsStringPrefix and `String.prototype.sp…

e7f942b

…lit` to use `StringIndexOf`; remove `SplitMatch` (tc39#2144)

ljharb force-pushed the string-includes-at branch from 166ef10 to 5d7fa09 Compare October 15, 2021 06:42

bakkot reviewed Oct 15, 2021

View reviewed changes

spec.html Show resolved Hide resolved

bakkot reviewed Oct 15, 2021

View reviewed changes

bakkot approved these changes Oct 15, 2021

View reviewed changes

bakkot reviewed Oct 15, 2021

View reviewed changes

spec.html Show resolved Hide resolved

michaelficarra approved these changes Oct 15, 2021

View reviewed changes

michaelficarra added the ready to merge Editors believe this PR needs no further reviews, and is ready to land. label Oct 15, 2021

Editorial: refactor IsStringPrefix and String.prototype.split to …

82438c3

…use `StringIndexOf`; remove `SplitMatch` (tc39#2144) Co-authored-by: Alexey Shvayka <[email protected]> Co-authored-by: Jordan Harband <[email protected]>

ljharb force-pushed the string-includes-at branch from a0aa602 to 82438c3 Compare October 15, 2021 17:48

ljharb changed the title ~~Editorial: Merge IsStringPrefix and SplitMatch abstract ops~~ Editorial: refactor IsStringPrefix and String.prototype.split to use StringIndexOf; remove SplitMatch Oct 15, 2021

ljharb merged commit 82438c3 into tc39:master Oct 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Editorial: refactor `IsStringPrefix` and `String.prototype.split` to use `StringIndexOf`; remove `SplitMatch` #2144

Editorial: refactor `IsStringPrefix` and `String.prototype.split` to use `StringIndexOf`; remove `SplitMatch` #2144

shvaikalesh commented Aug 19, 2020

michaelficarra Aug 19, 2020

shvaikalesh Aug 19, 2020

michaelficarra Aug 19, 2020

shvaikalesh Aug 19, 2020

michaelficarra Aug 19, 2020

ljharb Aug 19, 2020

ljharb Aug 19, 2020

jmdyck Aug 20, 2020 •

edited

Loading

ljharb Aug 20, 2020

ljharb commented Aug 19, 2020

gibson042 left a comment

gibson042 Aug 19, 2020

ljharb commented Oct 7, 2021

bakkot left a comment

		1. If ! StringIncludesAt(_px_, _py_, 0) is true, return false.
		1. If ! StringIncludesAt(_py_, _px_, 0) is true, return true.

		1. If ! StringIncludesAt(_S_, _R_, _q_) is false, then set _q_ to _q_ + 1.
		1. Else,

	<emu-clause id="sec-stringincludesat" aoid="StringIncludesAt">
	<emu-clause id="sec-stringincludesat" aoid="StringIncludesAt" oldids="IsStringPrefix,SplitMatch">

Editorial: refactor IsStringPrefix and String.prototype.split to use StringIndexOf; remove SplitMatch #2144

Editorial: refactor IsStringPrefix and String.prototype.split to use StringIndexOf; remove SplitMatch #2144

Conversation

shvaikalesh commented Aug 19, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmdyck Aug 20, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ljharb commented Aug 19, 2020

gibson042 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ljharb commented Oct 7, 2021

bakkot left a comment

Choose a reason for hiding this comment

Editorial: refactor `IsStringPrefix` and `String.prototype.split` to use `StringIndexOf`; remove `SplitMatch` #2144

Editorial: refactor `IsStringPrefix` and `String.prototype.split` to use `StringIndexOf`; remove `SplitMatch` #2144

jmdyck Aug 20, 2020 •

edited

Loading