[red-knot] Resolve symbols from `builtins.pyi` in the stdlib if they cannot be found in other scopes #12390

AlexWaygood · 2024-07-18T19:23:52Z

Summary

This PR means that red-knot is now able to understand builtin symbols -- resolving them to symbols in a builtins.pyi stub file (either in a custom typeshed directory, if one was supplied, or to the vendored stubs we ship as part of the binary).

The first commit here moves some code around in the module resolver and adds a new public function exported by the module resolver, resolve_builtins. This is a thin wrapper around a new Salsa query, resolve_builtins_query. The query short-circuits most of the module resolution logic we do for other Python modules, because this is what Python does at runtime: builtin symbols are (nearly) always resolved to the builtins module shipped as part of the interpreter, even if a builtins.py file exists in the first-party workspace.

The second commit uses this new query exposed by the module resolver to obtain the builtins scope, and uses the builtins scope to resolve builtin symbols and infer the types of those symbols.

Test Plan

New tests have been added to red_knot_module_resolver and red_knot_python_semantic

Co-authored-by: Carl Meyer [email protected]

AlexWaygood · 2024-07-18T19:34:11Z

Haha, that's fun. This makes the red-knot benchmarks crash. I belive that's because the benchmarks just use the vendored typeshed stubs, and the benchmark code uses str as a return annotation in one function, and the definition of str in typeshed's builtins.pyi file is

class str(Sequence[str]):
    ...

which we obviously can't cope with right now, so we panic 😆

github-actions · 2024-07-18T19:37:04Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

crates/ruff_benchmark/benches/red_knot.rs

codspeed-hq · 2024-07-18T19:52:01Z

CodSpeed Performance Report

Merging #12390 will degrade performances by 97.48%

_{Comparing builtins-resolution (0f0f5b2) with main (1c7b840)}

Summary

❌ 3 (👁 3) regressions
✅ 30 untouched benchmarks

Benchmarks breakdown

	Benchmark	`main`	`builtins-resolution`	Change
👁	`red_knot_check_file[cold]`	347.1 µs	13,754.9 µs	-97.48%
👁	`red_knot_check_file[incremental]`	211.3 µs	423.7 µs	-50.12%
👁	`red_knot_check_file[without_parse]`	260.2 µs	6,359.1 µs	-95.91%

carljm

Looks great!

crates/red_knot_python_semantic/src/types.rs

crates/ruff_benchmark/benches/red_knot.rs

AlexWaygood · 2024-07-18T19:57:55Z

Merging #12390 will degrade performances by 98.09%

Sort-of expected, I think... not sure there's any way around that; this is just what we have to do, I think!

carljm · 2024-07-18T19:58:13Z

Wait, are those super-high regression numbers on the benchmarks because it was failing, or are those from after the benchmark fix?

crates/red_knot_python_semantic/src/types/infer.rs

MichaReiser

It would be helpful for reviews if the summary could explain some of the newly introduced concepts.

We should also a analyze the performance regression. The increase seems to big for the few builtins that we resolve

crates/red_knot_module_resolver/src/resolver.rs

crates/red_knot_python_semantic/src/builtins.rs

MichaReiser · 2024-07-18T20:08:23Z

I think we have to update our benchmarks first, for example by pre-parsing builtins

MichaReiser · 2024-07-18T20:18:01Z

This is neat! And awesome how few changes weren't required

#12390 adds support for resolving types to classes in typeshed's `builtins.pyi` stub file. This causes redknot to crash when attempting to execute this benchmark, as the `str` definition in typeshed is too complex for us to handle right now. `object` is a simpler type definition which we can resolve symbols to without crashing.

crates/red_knot_python_semantic/src/builtins.rs

carljm · 2024-07-18T20:30:07Z

It would be helpful for reviews if the summary could explain some of the newly introduced concepts.

Definitely agree, but it's not obvious to me which concepts in the PR were not well explained in the summary?

MichaReiser · 2024-07-18T21:01:19Z

It would be helpful for reviews if the summary could explain some of the newly introduced concepts.

Definitely agree, but it's not obvious to me which concepts in the PR were not well explained in the summary?

I'm mainly interested in understanding newly introduced salsa queries and how they relate

carljm · 2024-07-18T21:02:21Z

I think we have to update our benchmarks first, for example by pre-parsing builtins

There's a cycle problem here, because we kind of need the builtins_module function added in this PR in order to add pre-parsing of builtins to the benchmarks. I guess I can split builtins_module out into its own PR, along with the change to the "without_parse" benchmark.

AlexWaygood · 2024-07-18T21:06:20Z

I think we have to update our benchmarks first, for example by pre-parsing builtins

This would mean that our benchmarks wouldn't catch any performance regressions caused by upstream changes to typeshed's stubs for builtins. Mypy has had several of these in the past; I think it would be very useful if our benchmarks automatically flagged any performance degradations as part of the CI run for an automated PR syncing our vendored typeshed stubs.

MichaReiser · 2024-07-18T21:08:18Z

I think we have to update our benchmarks first, for example by pre-parsing builtins

This would mean that our benchmarks wouldn't catch any performance regressions caused by upstream changes to typeshed's stubs for builtins. Mypy has had several of these in the past; I think it would be very useful if our benchmarks automatically flagged any performance degradations as part of the CI run for an automated PR syncing our vendored typeshed stubs.

I think we want more benchmarks. The once we have today are intentionally narrow in scope so that they're very sensitive to overhead in the type inference machinery

AlexWaygood · 2024-07-18T21:11:29Z

I think we want more benchmarks. The once we have today are intentionally narrow in scope so that they're very sensitive to overhead in the type inference machinery

That makes sense. In that case, my instinct would be to update the benchmarks to use a custom typeshed directory with a minimal builtins stub, rather than using the vendored typeshed builtins stub.

carljm · 2024-07-18T21:15:18Z

I think the without_parse red-knot benchmark should exclude parsing builtins also, and the other benchmarks can be left as-is (the "incremental" one should automatically exclude parsing builtins, since builtins won't have changed, and the "cold" one should include parsing builtins.) I'm already working on a PR for this.

I don't think we should use a fake builtins for the benchmarks.

carljm · 2024-07-18T21:34:20Z

There's a cycle problem here

Also this was wrong, it's easy enough to just create a VendoredPath and call vendored_path_to_file in the benchmark, we don't need to pull in anything from this PR.

carljm · 2024-07-18T21:35:16Z

#12395 adds builtins pre-parsing to the without-parse benchmark, and #12396 fixes what looks like reversed naming of benchmarks. This PR is rebased on both.

carljm · 2024-07-18T23:46:22Z

Hmm, it doesn't seem like the fixes I made to avoid redundant globals/builtins queries made a big dent in the perf regression here, so something I don't understand is still going on. Trying to dig into the CodSpeed data to understand what it could be.

carljm · 2024-07-19T00:29:30Z

Ok, after poring over the CodSpeed flame graphs for a while, my conclusion is that in the non-incremental benchmarks (cold and without_parse) the main difference is that we are now paying for semantic indexing of a very large file (builtins.pyi) which is many orders of magnitude larger than any file the benchmark previously touched, and in the incremental benchmark we pay for deep validation of that semantic index and the other ingredients depending on it.

I also pored over the traces from locally linting the same files that the benchmark runs on, and I didn't see any issues in the traces: it looked to me like we are doing the work we expect to do.

One way we could potentially reduce this cost would be to semantic-index by scope instead of by file? But this might be over-indexing on the current example, where we use very little of a large file; in real-world large projects I expect the proportional cost of semantic indexing for stuff we don't use would be much, much lower.

At this point I am open to further exploration, but my inclination based on what I've seen is that this regression is accurate based on adding semantic index of a much larger file, and we should merge it and keep paying attention to the benchmarks as we go; once we are able to check a much larger real-world program, we should take a careful look at where the bottlenecks are.

crates/red_knot_python_semantic/src/builtins.rs

Co-authored-by: Carl Meyer <[email protected]>

crates/red_knot_module_resolver/src/resolver.rs

AlexWaygood · 2024-07-19T16:28:59Z

(It doesn't look like I have permissions to acknowledge the perf regression on CodSpeed. Somebody else might have to do that for me -- or give me permission to do so ;)

carljm

Looks good! I think we should also add a test that first-party builtins.py doesn't override the builtin one.

AlexWaygood · 2024-07-19T16:31:53Z

Looks good! I think we should also add a test that first-party builtins.py doesn't override the builtin one.

I added that test in the latest push ;)

AlexWaygood added the red-knot Multi-file analysis & type inference label Jul 18, 2024

AlexWaygood requested review from carljm and MichaReiser as code owners July 18, 2024 19:23

AlexWaygood commented Jul 18, 2024

View reviewed changes

crates/ruff_benchmark/benches/red_knot.rs Outdated Show resolved Hide resolved

carljm approved these changes Jul 18, 2024

View reviewed changes

crates/red_knot_python_semantic/src/types.rs Outdated Show resolved Hide resolved

crates/ruff_benchmark/benches/red_knot.rs Outdated Show resolved Hide resolved

carljm reviewed Jul 18, 2024

View reviewed changes

crates/red_knot_python_semantic/src/types/infer.rs Outdated Show resolved Hide resolved

crates/red_knot_python_semantic/src/types/infer.rs Outdated Show resolved Hide resolved

MichaReiser reviewed Jul 18, 2024

View reviewed changes

crates/red_knot_module_resolver/src/resolver.rs Outdated Show resolved Hide resolved

crates/red_knot_python_semantic/src/builtins.rs Show resolved Hide resolved

AlexWaygood mentioned this pull request Jul 18, 2024

[red-knot] Remove use of str from benchmarks #12392

Closed

MichaReiser reviewed Jul 18, 2024

View reviewed changes

crates/red_knot_python_semantic/src/builtins.rs Show resolved Hide resolved

carljm force-pushed the builtins-resolution branch from 3ca68a0 to 0357af7 Compare July 18, 2024 20:24

carljm changed the base branch from main to cjm/simplify-benchmark July 18, 2024 20:25

Base automatically changed from cjm/simplify-benchmark to main July 18, 2024 21:04

carljm mentioned this pull request Jul 18, 2024

[red-knot] preparse builtins in without_parse benchmark #12395

Merged

carljm force-pushed the builtins-resolution branch from 68775e1 to bc8aa77 Compare July 18, 2024 21:34

carljm changed the base branch from main to cjm/fix-benchmark-naming July 18, 2024 21:40

carljm force-pushed the cjm/fix-benchmark-naming branch from 5600b49 to d901769 Compare July 18, 2024 21:45

carljm force-pushed the builtins-resolution branch from bc8aa77 to 4239eb2 Compare July 18, 2024 21:46

carljm changed the base branch from cjm/fix-benchmark-naming to cjm/fix-incremental-bench July 19, 2024 01:09

carljm force-pushed the builtins-resolution branch from 353c486 to 6df0ac3 Compare July 19, 2024 01:09

MichaReiser approved these changes Jul 19, 2024

View reviewed changes

crates/red_knot_python_semantic/src/builtins.rs Show resolved Hide resolved

carljm force-pushed the cjm/fix-incremental-bench branch from ac32227 to 5afe2a2 Compare July 19, 2024 14:59

Base automatically changed from cjm/fix-incremental-bench to main July 19, 2024 15:32

carljm force-pushed the builtins-resolution branch from 6df0ac3 to 3d58005 Compare July 19, 2024 15:41

AlexWaygood and others added 7 commits July 19, 2024 08:42

Add support for resolving the builtins file from typeshed directly

e34c06f

Support resolving symbols to the builtins scope as a fallback

6a6e01b

Update crates/red_knot_python_semantic/src/types.rs

159bf7c

Co-authored-by: Carl Meyer <[email protected]>

make builtins_module a regular function, not a query

f58f279

review comments

0ef0400

review comments

a5cd588

fix tracing

44977e3

carljm force-pushed the builtins-resolution branch from 3d58005 to 44977e3 Compare July 19, 2024 15:42

Fixes per review

2970faa

AlexWaygood commented Jul 19, 2024

View reviewed changes

crates/red_knot_module_resolver/src/resolver.rs Outdated Show resolved Hide resolved

MichaReiser approved these changes Jul 19, 2024

View reviewed changes

carljm reviewed Jul 19, 2024

View reviewed changes

Make it lazy static

0f0f5b2

AlexWaygood merged commit d8cf8ac into main Jul 19, 2024
20 checks passed

AlexWaygood deleted the builtins-resolution branch July 19, 2024 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] Resolve symbols from `builtins.pyi` in the stdlib if they cannot be found in other scopes #12390

[red-knot] Resolve symbols from `builtins.pyi` in the stdlib if they cannot be found in other scopes #12390

AlexWaygood commented Jul 18, 2024

AlexWaygood commented Jul 18, 2024 •

edited

Loading

github-actions bot commented Jul 18, 2024 •

edited

Loading

codspeed-hq bot commented Jul 18, 2024 •

edited

Loading

carljm left a comment

AlexWaygood commented Jul 18, 2024

carljm commented Jul 18, 2024

MichaReiser left a comment

MichaReiser commented Jul 18, 2024

MichaReiser commented Jul 18, 2024

carljm commented Jul 18, 2024 •

edited

Loading

MichaReiser commented Jul 18, 2024

carljm commented Jul 18, 2024

AlexWaygood commented Jul 18, 2024

MichaReiser commented Jul 18, 2024 •

edited

Loading

AlexWaygood commented Jul 18, 2024

carljm commented Jul 18, 2024 •

edited

Loading

carljm commented Jul 18, 2024

carljm commented Jul 18, 2024

carljm commented Jul 18, 2024

carljm commented Jul 19, 2024

AlexWaygood commented Jul 19, 2024 •

edited

Loading

carljm left a comment

AlexWaygood commented Jul 19, 2024

[red-knot] Resolve symbols from builtins.pyi in the stdlib if they cannot be found in other scopes #12390

[red-knot] Resolve symbols from builtins.pyi in the stdlib if they cannot be found in other scopes #12390

Conversation

AlexWaygood commented Jul 18, 2024

Summary

Test Plan

AlexWaygood commented Jul 18, 2024 • edited Loading

github-actions bot commented Jul 18, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

codspeed-hq bot commented Jul 18, 2024 • edited Loading

CodSpeed Performance Report

Merging #12390 will degrade performances by 97.48%

Summary

Benchmarks breakdown

carljm left a comment

Choose a reason for hiding this comment

AlexWaygood commented Jul 18, 2024

carljm commented Jul 18, 2024

MichaReiser left a comment

Choose a reason for hiding this comment

MichaReiser commented Jul 18, 2024

MichaReiser commented Jul 18, 2024

carljm commented Jul 18, 2024 • edited Loading

MichaReiser commented Jul 18, 2024

carljm commented Jul 18, 2024

AlexWaygood commented Jul 18, 2024

MichaReiser commented Jul 18, 2024 • edited Loading

AlexWaygood commented Jul 18, 2024

carljm commented Jul 18, 2024 • edited Loading

carljm commented Jul 18, 2024

carljm commented Jul 18, 2024

carljm commented Jul 18, 2024

carljm commented Jul 19, 2024

AlexWaygood commented Jul 19, 2024 • edited Loading

carljm left a comment

Choose a reason for hiding this comment

AlexWaygood commented Jul 19, 2024

[red-knot] Resolve symbols from `builtins.pyi` in the stdlib if they cannot be found in other scopes #12390

[red-knot] Resolve symbols from `builtins.pyi` in the stdlib if they cannot be found in other scopes #12390

AlexWaygood commented Jul 18, 2024 •

edited

Loading

github-actions bot commented Jul 18, 2024 •

edited

Loading

`ruff-ecosystem` results

codspeed-hq bot commented Jul 18, 2024 •

edited

Loading

carljm commented Jul 18, 2024 •

edited

Loading

MichaReiser commented Jul 18, 2024 •

edited

Loading

carljm commented Jul 18, 2024 •

edited

Loading

AlexWaygood commented Jul 19, 2024 •

edited

Loading