Functions/DynamicCalls: various bug fixes and improvements #592

jrfnl · 2020-10-23T20:03:07Z

This PR is the result of a review of the WordPressVIPMinimum.Functions.DynamicCalls sniff.

It will be easiest to understand the different changes and their impact by reviewing this PR on the individual commits.

Functions/DynamicCalls: annotate the unit tests

... to show which ones should fail and which should pass.

Functions/DynamicCalls: remove redundant conditions [1]

This sniff only listens to T_VARIABLE tokens, so checking that what was received is a T_VARIABLE is redundant.

Functions/DynamicCalls: remove redundant conditions [2]

The condition above checks if a token is T_EQUAL and bows out if it is not.
The only code matching on T_EQUAL is the = operator, which will always have a length of 1.

Functions/DynamicCalls: improve code readability

Cutting code lines off at 50 chars is maybe taking it a little too far, especially as it makes assignments hard to read.
Use proper tags in docblocks.
Improve (fix) documentation (and move it to the right place).

Functions/DynamicCalls: minor simplification

Join two conditions which both return anyway.

Functions/DynamicCalls: bug fix - ignore comments [1]

Skip over both whitespace, as well as comments. This reduces false negatives.

Includes unit test.

Functions/DynamicCalls: bug fix - don't blindly use the next text string

A variable value may be build up of multiple tokens.

As it was, the sniff would look for the first text string token after the equal sign within the variable assignment statement, but this disregards that:

The text string token found may not be the only token in the statement.
A statement can end on a PHP close tag (possibly a bug in PHPCS itself, but that's another matter), which would lead the sniff to look at the next statement for text strings.

Fixed now.

Includes unit tests, the first four of which resulted in false positives previously.

Functions/DynamicCalls: bug fix - allow for double quotes

Text strings can use both single quotes as well as double quotes.

When the text string contains an interpolated variable, it will be tokenized as T_DOUBLE_QUOTED_STRING, but when it is a plain text string, a double quoted text string will be tokenized as T_CONSTANT_ENCAPSED_STRING, same as single quoted text string.

The sniff did not take this into account, leading to false negatives.

The sniff also would strip quotes from within a text - 'my\'text' - . This did not cause a problem for this sniff as function names cannot have back slashes in them, but it was still wrong.

Fixed now by using the WPCS strip_quotes() method.

Includes unit test which would fail previously.

Functions/DynamicCalls: bug fix - fix memory + performance issue

The sniff maintains a cache of all the variables it has seen and their assigned value.

When a variable is encountered, it would:

Check if it was an (plain text) assignment and if so, register the variable name + value to the cache.
Next, call the find_dynamic_calls() method, which first checks if any variables have been registered to the cache before doing anything.
And then checks for dynamic function calls and if one is found, checks if the variable used is one registered in the cache with a value we are looking for.

This is highly inefficient as text string variable assignments are common and, as it was, every single one would be added to the cache.

With a large code base, that means that the cache could grow pretty large.

It also means that the logic to determine if something is a dynamic function call would be executed even when there would be no text strings registered in the cache which could match any of the ones we're looking for.

By changing the order of the logic, the memory leak and performance inefficiency is removed.

With the updated logic, the sniff will:

Check if it was an (plain text) assignment and if the text string matches one we're looking for and if so, register the variable name + value to the cache.
Next, call the find_dynamic_calls() method, which first checks if any variables have been registered to the cache before doing anything.
And then checks for dynamic function calls.

This means that if none of the previous assignments encountered matches any of the target text strings (~ 99% of the time), this sniff will bow out at step 2 before executing the logic to check if a variable assignment is a dynamic function call.

Functions/DynamicCalls: rename private property

Rename the private $blacklisted_functions property to $function_names to get rid of the use of a non-inclusive term.

Loosely related to #492

Functions/DynamicCalls: bug fix - ignore comments [2]

Skip over both whitespace, as well as comments and take live coding into account. This reduces false negatives, as well as fixing issue #590.

Includes unit tests.

Fixes #590

Functions/DynamicCalls: error message tweak

... to show which ones should fail and which should pass.

This sniff only listens to `T_VARIABLE` tokens, so checking that what was received is a `T_VARIABLE` is redundant.

The condition above checks if a token is `T_EQUAL` and bows out if it is not. The only code matching on `T_EQUAL` is the `=` operator, which will always have a `length` of `1`.

* Cutting code lines off at 50 chars is maybe taking it a little too far, especially as it makes assignments hard to read. * Use proper tags in docblocks. * Improve (fix) documentation (and move it to the right place).

Join two conditions which both return anyway.

Skip over both whitespace, as well as comments. This reduces false negatives. Includes unit test.

A variable value may be build up of multiple tokens. As it was, the sniff would look for the first text string token after the equal sign within the variable assignment statement, but this disregards that: 1. The text string token found may not be the only token in the statement. 2. A statement can end on a PHP close tag (possibly a bug in PHPCS itself, but that's another matter), which would lead the sniff to look at the next statement for text strings. Fixed now. Includes unit tests, the first four of which resulted in false positives previously.

Text strings can use both single quotes as well as double quotes. When the text string contains an interpolated variable, it will be tokenized as `T_DOUBLE_QUOTED_STRING`, but when it is a plain text string, a double quoted text string will be tokenized as `T_CONSTANT_ENCAPSED_STRING`, same as single quoted text string. The sniff did not take this into account, leading to false negatives. The sniff also would strip quotes from within a text - `'my\'text'` - . This did not cause a problem for this sniff as function names cannot have back slashes in them, but it was still wrong. Fixed now by using the WPCS `strip_quotes()` method. Includes unit test which would fail previously.

The sniff maintains a cache of all the variables it has seen and their assigned value. When a variable is encountered, it would: * Check if it was an (plain text) assignment and if so, register the variable name + value to the cache. * Next, call the `find_dynamic_calls()` method, which first checks if any variables have been registered to the cache before doing anything. * And then checks for dynamic function calls and if one is found, checks if the variable used is one registered in the cache with a value we are looking for. This is highly inefficient as text string variable assignments are common and, as it was, _every single one_ would be added to the cache. With a large code base, that means that the cache could grow pretty large. It also means that the logic to determine if something is a dynamic function call would be executed even when there would be no text strings registered in the cache which could match any of the ones we're looking for. By changing the order of the logic, the memory leak and performance inefficiency is removed. With the updated logic, the sniff will: * Check if it was an (plain text) assignment **and if the text string matches one we're looking for** and if so, register the variable name + value to the cache. * Next, call the `find_dynamic_calls()` method, which first checks if any variables have been registered to the cache before doing anything. * And then checks for dynamic function calls. This means that if none of the previous assignments encountered matches any of the target text strings (~ 99% of the time), this sniff will bow out at step 2 before executing the logic to check if a variable assignment is a dynamic function call.

Rename the `private` `$blacklisted_functions` property to `$function_names` to get rid of the use of a non-inclusive term.

Skip over both whitespace, as well as comments and take live coding into account. This reduces false negatives, as well as fixing issue 590. Includes unit tests. Fixes 590

GaryJones · 2020-10-25T09:51:59Z

WordPressVIPMinimum/Sniffs/Functions/DynamicCallsSniff.php

@@ -120,10 +120,6 @@ private function collect_variables() {
 			return;
 		}

-		if ( $this->tokens[ $t_item_key ]['length'] !== 1 ) {


Just curious - what tokens are == and ===?

== => T_IS_EQUAL

=== => T_IS_IDENTICAL

Ref: https://www.php.net/manual/en/tokens.php

GaryJones

Love the performance fix amongst the general fixes!

jrfnl · 2020-10-25T10:19:45Z

Please also see my general review comment about this sniff which I've left in the review ticket: #517 (comment)

rebeccahum · 2020-11-23T19:32:15Z

WordPressVIPMinimum/Sniffs/Functions/DynamicCallsSniff.php

-		'get_defined_vars',
-		'mb_parse_str',
-		'parse_str',
+	private $function_names = [


Thank you for re-naming this!

jrfnl added 12 commits October 23, 2020 20:26

Functions/DynamicCalls: annotate the unit tests

af82f36

... to show which ones should fail and which should pass.

Functions/DynamicCalls: remove redundant conditions [1]

d8f958d

This sniff only listens to `T_VARIABLE` tokens, so checking that what was received is a `T_VARIABLE` is redundant.

Functions/DynamicCalls: remove redundant conditions [2]

0f55b11

The condition above checks if a token is `T_EQUAL` and bows out if it is not. The only code matching on `T_EQUAL` is the `=` operator, which will always have a `length` of `1`.

Functions/DynamicCalls: improve code readability

db13572

* Cutting code lines off at 50 chars is maybe taking it a little too far, especially as it makes assignments hard to read. * Use proper tags in docblocks. * Improve (fix) documentation (and move it to the right place).

Functions/DynamicCalls: minor simplification

b6ebf86

Join two conditions which both return anyway.

Functions/DynamicCalls: bug fix - ignore comments [1]

df8c596

Skip over both whitespace, as well as comments. This reduces false negatives. Includes unit test.

Functions/DynamicCalls: rename private property

6b4a43d

Rename the `private` `$blacklisted_functions` property to `$function_names` to get rid of the use of a non-inclusive term.

Functions/DynamicCalls: bug fix - ignore comments [2]

3118358

Skip over both whitespace, as well as comments and take live coding into account. This reduces false negatives, as well as fixing issue 590. Includes unit tests. Fixes 590

Functions/DynamicCalls: error message tweak

1d1a977

jrfnl added Type: Bug Type: Enhancement Standard: VIPMinimum labels Oct 23, 2020

jrfnl added this to the 2.3.0 milestone Oct 23, 2020

jrfnl requested a review from a team as a code owner October 23, 2020 20:03

This was referenced Oct 23, 2020

Review the WordPressVIPMinimum.Functions.DynamicCalls sniff #517

Open

Undefined offset: 42509 in /opt/composer/vendor/automattic/vipwpcs/WordPressVIPMinimum/Sniffs/Functions/DynamicCallsSniff.php on line 218 #590

Closed

GaryJones reviewed Oct 25, 2020

View reviewed changes

GaryJones approved these changes Oct 25, 2020

View reviewed changes

rebeccahum reviewed Nov 23, 2020

View reviewed changes

rebeccahum merged commit 387d448 into develop Nov 23, 2020

rebeccahum deleted the fix/590-prevent-undefined-offset-notice branch November 23, 2020 19:33

rebeccahum mentioned this pull request Nov 23, 2020

Functions/DynamicCalls: Remove the word "blacklisted" and use "disallowed" #596

Merged

jrfnl mentioned this pull request Apr 16, 2021

Update changelog to reflect 2.3.0 #662

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Functions/DynamicCalls: various bug fixes and improvements #592

Functions/DynamicCalls: various bug fixes and improvements #592

jrfnl commented Oct 23, 2020

GaryJones Oct 25, 2020

jrfnl Oct 25, 2020

GaryJones left a comment

jrfnl commented Oct 25, 2020

rebeccahum Nov 23, 2020

Functions/DynamicCalls: various bug fixes and improvements #592

Functions/DynamicCalls: various bug fixes and improvements #592

Conversation

jrfnl commented Oct 23, 2020

Functions/DynamicCalls: annotate the unit tests

Functions/DynamicCalls: remove redundant conditions [1]

Functions/DynamicCalls: remove redundant conditions [2]

Functions/DynamicCalls: improve code readability

Functions/DynamicCalls: minor simplification

Functions/DynamicCalls: bug fix - ignore comments [1]

Functions/DynamicCalls: bug fix - don't blindly use the next text string

Functions/DynamicCalls: bug fix - allow for double quotes

Functions/DynamicCalls: bug fix - fix memory + performance issue

Functions/DynamicCalls: rename private property

Functions/DynamicCalls: bug fix - ignore comments [2]

Functions/DynamicCalls: error message tweak

GaryJones Oct 25, 2020

Choose a reason for hiding this comment

jrfnl Oct 25, 2020

Choose a reason for hiding this comment

GaryJones left a comment

Choose a reason for hiding this comment

jrfnl commented Oct 25, 2020

rebeccahum Nov 23, 2020

Choose a reason for hiding this comment