FURB109 explanation is technically incorrect #44

henryiii · 2022-10-04T22:28:56Z

This:

Since tuples cannot change value over time, it is more performant to use
them in `for` loops, generators, etc.:

Is wrong. Besides the fact tuples are not usually more performant than lists, the expression in question actually produces the same byte code:

>>> import dist
>>> def f(x):
...     return x in [1, 2, 3]
>>> dis.dis(f)
  2           0 LOAD_FAST                0 (x)
              2 LOAD_CONST               1 ((1, 2, 3))
              4 CONTAINS_OP              0
              6 RETURN_VALUE
>>> def f(x):
...     return x in (1, 2, 3)
>>> dis.dis(f)
  2           0 LOAD_FAST                0 (x)
              2 LOAD_CONST               1 ((1, 2, 3))
              4 CONTAINS_OP              0
              6 RETURN_VALUE

In general, lists tend to be homogenous variable length "lists" and tuples heterogeneous data.
This actually is potentially faster and is logically the correct thing:

>>> def f(x):
...     return x in {1,2,3}
...
>>> dis.dis(f)
  2           0 LOAD_FAST                0 (x)
              2 LOAD_CONST               1 (frozenset({1, 2, 3}))
              4 CONTAINS_OP              0
              6 RETURN_VALUE

However, it's not equivalent - the items need to be hashable, and the comparison is not quite the same. Sets are also much slower to construct, but as long as they are inline they are directly loaded in byte code. This could be much faster to check, but that depends on the size, and inline is not likely to be that large.

The text was updated successfully, but these errors were encountered:

dosisod · 2022-10-05T02:58:14Z

I created this Godbolt link in case you or anyone else wants to play with this more.

Perhaps there are instances where the bytecode would differ, but from the few examples I tried the bytecode was the same. I think propping up this check as a "performance" boost is misleading, and should be changed. Perhaps making this a "consistency" check would be more beneficial.

If that is the case, users should be able to specify whether they would prefer () over [], though there currently is no ability to pass data to a check.

dosisod · 2022-10-05T03:03:48Z

And yes, sets/frozensets are potentially faster if you have many elements, but in my opinion they look "gross", and I seldom see any codebases using sets literals for in operations. Also, quantifying when a set should be used over a tuple/list will be hard, since you will probably not be able to determine the size of the set.

henryiii · 2022-10-05T03:45:55Z

I didn't know you could use godbolt on Python.

I'm not sure why they look gross, containership is a natural question for a set or a dict, while it's "hacked on" to a list / tuple as a convenience - it's just a looping check. I was strongly in favor of x in {...} until I realized they were not fully identical. Sometimes you want one over the other - there was a case (in awkward) where they give different results. I think it might be if you have custom __eq__? But don't remember exactly what was different.

Agree on consistency (and I'm not against the check - though picking one over the other is stylistic rather than performance).

dosisod · 2022-10-05T04:44:35Z

I think gross is too strong of a word. Perhaps my dislike comes from the fact that {"key", "value"} looks very similar to {"key": "value"}, or maybe I just haven't seen it enough in practice. I am totally not opposed to using sets with in operations, just the usage of the set literal itself feels unnatural to me.

dopplershift · 2022-11-09T03:53:43Z

I found this discussion fascinating and learned something from it. In fairness to refurb (and to everyone who wasn't actually aware of this behavior), it wasn't until Python 3.8 that this optimization was made. (And I can't even find it in the "What's New".)

So when running against code still supporting <=3.7, the tool is technically still correct. 😉

dosisod self-assigned this Oct 5, 2022

dosisod added the documentation Improvements or additions to documentation label Oct 5, 2022

dosisod mentioned this issue Oct 5, 2022

Cleanup the "in tuple" check #49

Merged

dosisod closed this as completed in #49 Oct 5, 2022

dosisod mentioned this issue Nov 8, 2022

Configuration option for FURB109 (in_tuple.py)? #100

Open

jamesbraza mentioned this issue Jul 25, 2023

Enhancement: Performance Rules #28

Open

dosisod mentioned this issue Aug 3, 2023

Improved docs #273

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FURB109 explanation is technically incorrect #44

FURB109 explanation is technically incorrect #44

henryiii commented Oct 4, 2022

dosisod commented Oct 5, 2022

dosisod commented Oct 5, 2022 •

edited

Loading

henryiii commented Oct 5, 2022 •

edited

Loading

dosisod commented Oct 5, 2022

dopplershift commented Nov 9, 2022

FURB109 explanation is technically incorrect #44

FURB109 explanation is technically incorrect #44

Comments

henryiii commented Oct 4, 2022

dosisod commented Oct 5, 2022

dosisod commented Oct 5, 2022 • edited Loading

henryiii commented Oct 5, 2022 • edited Loading

dosisod commented Oct 5, 2022

dopplershift commented Nov 9, 2022

dosisod commented Oct 5, 2022 •

edited

Loading

henryiii commented Oct 5, 2022 •

edited

Loading