Add optimization to avoid load of address #76683

simonvandel · 2020-09-13T20:50:58Z

Look for the sequence

_2 = &_1;
...
_5 = (*_2)

in which we can replace the last statement with _5 = _1 to avoid the load of _2

rust-highfive · 2020-09-13T20:51:01Z

r? @oli-obk

(rust_highfive has picked a reviewer for you, use r? to override)

compiler/rustc_mir/src/transform/instcombine.rs

simonvandel · 2020-09-14T20:55:14Z

Hi @jonas-schievink, thanks for the review.
In the newest commits I:

Rebased on master to fix conflict
Added a lookback of 6 (the commit has a table for the numbers I got)
Added a test and fixed the miscompilation you pointed out

@rustbot modify labels: +S-waiting-on-review -S-waiting-on-author

oli-obk

This optimization is very scary, it has lots of edge cases that we need to think about

compiler/rustc_mir/src/transform/instcombine.rs

oli-obk · 2020-09-17T15:43:05Z

compiler/rustc_mir/src/transform/instcombine.rs

+                            }
+                        }
+                    }
+                    _ => {}


if local_being_derefed is used on the rhs of an assignment to another local, your optimization may misfire I think.

let x = 42; let a = 99; let mut y = &x; let z = &mut y; *z = &a; println!("{}", *y);

will not print 99 after your optimization but 42

I added a test for this. It does not seem like it will misfire. I also added more clarifying comments on the matches. I have persuaded myself that since we are only applying this optimization on immutable references, we can't have a mutable reference at the same time, so nothing (besides asm) can break this optimization. My reasoning may be wrong though..

You are correct. Setting the lookback higher does indeed cause a miscompilation. I'll look into fixing it.

The general way to look for mutation is to define a visitor and visit the construct you want to analyze (Statement in your case) and implement https://doc.rust-lang.org/nightly/nightly-rustc/rustc_middle/mir/visit/trait.Visitor.html#method.visit_local to and check the PlaceContext if the local matches the local you want to check.

Ah, I saw your comment a tad too late. I already pushed a fix, but i'll see if it is cleaner/less adhoc to use the visitor. Thanks for the hint.

The latest commits now check that local_being_derefed is not mutated using a visitor. I have verified that setting the lookback to 10000 causes a misoptimization without the fix, and fixes it with the fix. I have set it back to to avoid bad runtime complexity. I'm not sure how best to represent that the fix is actually working. For now I separated it into two commits.

Thanks! Splitting it into individual commits made this great to review.

I also don't know how to test it. A bogus idea (Don't do this 😆): instead of 6 use 6 + mir_opt_level, then we can do -Zmir-opt-level=99999 on that test.

An alternative to having this magic number would be to implement the optimization as a feed forward (am I using words correctly here?) optimization. What I mean is that we don't go through every deref and look back, but we walk forward through each block and keep a set of "deref-optimizable" locals that we update as we go.

That said, I'm fine with merging this optimization as long as there's a tracking issue for exploring such a change to the optimization. I'm not sure whether that change is feasible in practice, but we should explore it

Cool. Yeah it should be possible to do this optimization without the lookback, such that a single statement is only visited once. I'll open a new pr for that.

Can we do a perf run on the current pr? I'm curious if this has any impact

compiler/rustc_mir/src/transform/instcombine.rs

oli-obk · 2020-09-20T08:46:01Z

@bors try @rust-timer queue

rust-timer · 2020-09-20T08:46:03Z

Awaiting bors try build completion

bors · 2020-09-20T08:46:13Z

⌛ Trying commit 116283b910a7f8809260ba91cb9b9b647a8900a3 with merge 7d835e12834e0230779883b269523b01a940c2df...

bors · 2020-09-20T09:33:56Z

☀️ Try build successful - checks-actions, checks-azure
Build commit: 7d835e12834e0230779883b269523b01a940c2df (7d835e12834e0230779883b269523b01a940c2df)

rust-timer · 2020-09-20T09:33:57Z

Queued 7d835e12834e0230779883b269523b01a940c2df with parent 10b3595, future comparison URL.

rust-timer · 2020-09-20T12:15:39Z

Finished benchmarking try commit (7d835e12834e0230779883b269523b01a940c2df): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never

oli-obk · 2020-09-20T12:18:17Z

perf looks ok. Please squash so we get all the back and forth out of the timeline

simonvandel · 2020-09-20T13:04:31Z

Squashed and rebased on master

oli-obk · 2020-09-20T13:21:19Z

@bors r+

bors · 2020-09-20T13:21:20Z

📌 Commit 4dedb76 has been approved by oli-obk

bors · 2020-09-21T00:13:21Z

⌛ Testing commit 4dedb76 with merge 40fe3db7186781f895de3f6d860dbd5d570ec21c...

bors · 2020-09-21T00:43:18Z

💔 Test failed - checks-actions

oli-obk · 2020-09-21T07:18:20Z

Looks like you need to rebase and rebless again?

…heckedAdd This makes the test run deterministic regardless of noopt testruns

simonvandel · 2020-09-21T20:16:40Z

Rebased on master and added a new commit such that Add is always generated instead of CheckedAdd, which broke the test diff on noopt testruns.

oli-obk · 2020-09-21T20:55:47Z

@bors r+

bors · 2020-09-21T20:55:49Z

📌 Commit dfc469d has been approved by oli-obk

bors · 2020-09-21T22:04:21Z

⌛ Testing commit dfc469d with merge f47df31...

bors · 2020-09-22T00:22:11Z

☀️ Test successful - checks-actions, checks-azure
Approved by: oli-obk
Pushing f47df31 to master...

eddyb · 2020-09-22T01:56:51Z

@rust-lang/wg-mir-opt I think this would be better suited to constant-folding with a notion of "symbolic" values.
(not necessarily symbolic miri evaluation, but propagatable abstract values that represent runtime values which are effectively "constant" within one execution of the function)

But also I'm a bit worried this kind of optimization may break stacked borrows type assumptions if the indirection was "weakening" the semantics of the access, and making it direct "strengthens" it too much - though this may only be a problem with raw pointers.

rust-highfive assigned oli-obk Sep 13, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Sep 13, 2020

jonas-schievink reviewed Sep 13, 2020

View reviewed changes

compiler/rustc_mir/src/transform/instcombine.rs Outdated Show resolved Hide resolved

compiler/rustc_mir/src/transform/instcombine.rs Outdated Show resolved Hide resolved

This comment has been minimized.

Sign in to view

simonvandel force-pushed the inst-combine-deref branch from 62de698 to 71219e7 Compare September 14, 2020 20:52

jyn514 added A-mir-opt Area: MIR optimizations T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Sep 16, 2020

oli-obk requested changes Sep 17, 2020

View reviewed changes

oli-obk requested changes Sep 19, 2020

View reviewed changes

simonvandel force-pushed the inst-combine-deref branch from 116283b to 4dedb76 Compare September 20, 2020 13:03

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Sep 20, 2020

bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Sep 21, 2020

simonvandel added 2 commits September 21, 2020 22:08

Add optimization to avoid load of address

2bb3844

Run the test with explicit -O such that Add is generated instead of C…

dfc469d

…heckedAdd This makes the test run deterministic regardless of noopt testruns

simonvandel force-pushed the inst-combine-deref branch from 4dedb76 to dfc469d Compare September 21, 2020 20:15

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Sep 21, 2020

bors added the merged-by-bors This PR was explicitly merged by bors. label Sep 22, 2020

bors merged commit f47df31 into rust-lang:master Sep 22, 2020

rustbot added this to the 1.48.0 milestone Sep 22, 2020

bors mentioned this pull request Sep 22, 2020

[MIR-OPT]: Optimization that turns Eq-Not pair into Ne #77031

Closed

tmiasko mentioned this pull request Oct 21, 2020

InstCombine introduces an incorrect use of a local after its storage has ended #78192

Closed

simonvandel mentioned this pull request Oct 25, 2020

Avoid backtracking in "deref_of_address" MIR optimization #78368

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optimization to avoid load of address #76683

Add optimization to avoid load of address #76683

simonvandel commented Sep 13, 2020

rust-highfive commented Sep 13, 2020

This comment has been minimized.

simonvandel commented Sep 14, 2020

oli-obk left a comment

oli-obk Sep 17, 2020

simonvandel Sep 19, 2020

simonvandel Sep 19, 2020

oli-obk Sep 19, 2020

simonvandel Sep 19, 2020

simonvandel Sep 19, 2020

oli-obk Sep 20, 2020

oli-obk Sep 20, 2020

simonvandel Sep 20, 2020

oli-obk commented Sep 20, 2020

rust-timer commented Sep 20, 2020

bors commented Sep 20, 2020

bors commented Sep 20, 2020

rust-timer commented Sep 20, 2020

rust-timer commented Sep 20, 2020

oli-obk commented Sep 20, 2020

simonvandel commented Sep 20, 2020

oli-obk commented Sep 20, 2020

bors commented Sep 20, 2020

bors commented Sep 21, 2020

bors commented Sep 21, 2020

oli-obk commented Sep 21, 2020

simonvandel commented Sep 21, 2020

oli-obk commented Sep 21, 2020

bors commented Sep 21, 2020

bors commented Sep 21, 2020

bors commented Sep 22, 2020

eddyb commented Sep 22, 2020

Add optimization to avoid load of address #76683

Add optimization to avoid load of address #76683

Conversation

simonvandel commented Sep 13, 2020

rust-highfive commented Sep 13, 2020

This comment has been minimized.

simonvandel commented Sep 14, 2020

oli-obk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oli-obk commented Sep 20, 2020

rust-timer commented Sep 20, 2020

bors commented Sep 20, 2020

bors commented Sep 20, 2020

rust-timer commented Sep 20, 2020

rust-timer commented Sep 20, 2020

oli-obk commented Sep 20, 2020

simonvandel commented Sep 20, 2020

oli-obk commented Sep 20, 2020

bors commented Sep 20, 2020

bors commented Sep 21, 2020

bors commented Sep 21, 2020

oli-obk commented Sep 21, 2020

simonvandel commented Sep 21, 2020

oli-obk commented Sep 21, 2020

bors commented Sep 21, 2020

bors commented Sep 21, 2020

bors commented Sep 22, 2020

eddyb commented Sep 22, 2020