[Experiment] Box `diagnostic_metadata` field #98120

TaKO8Ki · 2022-06-15T02:34:51Z

closes #97954

r? @estebank

TaKO8Ki · 2022-06-15T02:45:59Z

@bors try @rust-timer queue

rust-timer · 2022-06-15T02:46:01Z

Insufficient permissions to issue commands to rust-timer.

bors · 2022-06-15T02:46:01Z

@TaKO8Ki: 🔑 Insufficient privileges: not in try users

TaKO8Ki · 2022-06-15T02:47:10Z

I don't have permission to run perf.

eggyal · 2022-06-15T03:15:32Z

This is exactly what I had thought the issue was saying, yet it surely only adds the cost of additional heap allocs? I'm sure I must be missing something, as I can't see how there could be any benefit from it?

joshtriplett · 2022-06-15T03:15:42Z

@bors try @rust-timer queue

rust-timer · 2022-06-15T03:15:44Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2022-06-15T03:15:51Z

⌛ Trying commit 5ece481 with merge e5e645fd772153e58de155a806285c5cef5d3428...

bors · 2022-06-15T04:46:47Z

☀️ Try build successful - checks-actions
Build commit: e5e645fd772153e58de155a806285c5cef5d3428 (e5e645fd772153e58de155a806285c5cef5d3428)

rust-timer · 2022-06-15T04:46:48Z

Queued e5e645fd772153e58de155a806285c5cef5d3428 with parent 2d1e075, future comparison URL.

rust-timer · 2022-06-15T06:04:44Z

Finished benchmarking commit (e5e645fd772153e58de155a806285c5cef5d3428): comparison url.

Instruction count

This benchmark run did not return any relevant results for this metric.

Max RSS (memory usage)

Results

Primary benchmarks: 🎉 relevant improvement found
Secondary benchmarks: 🎉 relevant improvements found

	mean¹	max	count²
Regressions 😿 (primary)	N/A	N/A	0
Regressions 😿 (secondary)	N/A	N/A	0
Improvements 🎉 (primary)	-5.4%	-5.4%	1
Improvements 🎉 (secondary)	-3.4%	-4.5%	2
All 😿🎉 (primary)	-5.4%	-5.4%	1

Cycles

This benchmark run did not return any relevant results for this metric.

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf -perf-regression

the arithmetic mean of the percent change ↩
number of relevant changes ↩

joshtriplett · 2022-06-15T06:21:21Z

@TaKO8Ki Very nice!

eggyal · 2022-06-15T07:47:46Z

Really surprised by this... can someone explain it?

Kobzol · 2022-06-15T10:40:59Z

In general, boxing an attribute moves it to the heap, so you pay the cost for allocating (both in executed instructions and extra memory usage). But on the other hand, it can reduce the size of the structure that contains the attribute (e.g. if a field has 64 bytes and you box it, the size gets reduced to 8 bytes), which in turn can improve performance (less data loaded from memory, better cache utilization etc.).

That being said, I'm not really seeing any great improvements in the perf. results here. Instruction counts and cycles are a wash and Max RSS was improved only in three cases with quite low significance factor (on average it's just -0.24% improvement). So I'm not sure if this is worth it.

eggyal · 2022-06-15T10:50:11Z

But on the other hand, it can reduce the size of the structure that contains the attribute (e.g. if a field has 64 bytes and you box it, the size gets reduced to 8 bytes), which in turn can improve performance (less data loaded from memory, better cache utilization etc.).

Indeed, but this particular structure is quite large already so the reduction in its stack size shouldn't have a material impact. Moreover, the identified change is to total memory consumption: yet the heap allocs are made on object creation and maintained until object destruction so total memory should in fact have increased (by the size of the pointer).

Anyway, thank you for your thoughts! I'll leave it at that; I just found the idea of this experiment, and it's outcome, rather surprising/unintuitive.

estebank · 2022-06-15T17:02:48Z

this particular structure is quite large already

Is it? The only things that stand out to me as "potentially big" are ParentScope and PerNS, all other fields are in the heap, in one way or another (Vec, &, HashMap).

rust/compiler/rustc_resolve/src/late.rs

Lines 521 to 552 in 5ece481

    
           struct LateResolutionVisitor<'a, 'b, 'ast> { 
        
               r: &'b mut Resolver<'a>, 
        
               /// The module that represents the current item scope. 
        
               parent_scope: ParentScope<'a>, 
        
               /// The current set of local scopes for types and values. 
        
               /// FIXME #4948: Reuse ribs to avoid allocation. 
        
               ribs: PerNS<Vec<Rib<'a>>>, 
        
               /// The current set of local scopes, for labels. 
        
               label_ribs: Vec<Rib<'a, NodeId>>, 
        
               /// The current set of local scopes for lifetimes. 
        
               lifetime_ribs: Vec<LifetimeRib>, 
        
               /// The trait that the current context can refer to. 
        
               current_trait_ref: Option<(Module<'a>, TraitRef)>, 
        
               /// Fields used to add information to diagnostic errors. 
        
               diagnostic_metadata: Box<DiagnosticMetadata<'ast>>, 
        
               /// State used to know whether to ignore resolution errors for function bodies. 
        
               /// 
        
               /// In particular, rustdoc uses this to avoid giving errors for `cfg()` items. 
        
               /// In most cases this will be `None`, in which case errors will always be reported. 
        
               /// If it is `true`, then it will be updated when entering a nested function or trait body. 
        
               in_func_body: bool, 
        
               /// Count the number of places a lifetime is used. 
        
               lifetime_uses: FxHashMap<LocalDefId, LifetimeUseSet>, 
        
           }

That being said, given how resolution behaves, without needing to clone or be moved too much, I guess it's not surprising this didn't have significant perf impacts, beyond the margins. That being said, I can go either on whether we should merge this or not, particularly given the patch size :)

estebank · 2022-07-18T18:41:32Z

@bors r+

bors · 2022-07-18T18:41:33Z

📌 Commit 5ece481 has been approved by estebank

It is now in the queue for this repository.

bors · 2022-07-19T03:02:32Z

⌛ Testing commit 5ece481 with merge 96c2df8...

bors · 2022-07-19T05:46:38Z

☀️ Test successful - checks-actions
Approved by: estebank
Pushing 96c2df8 to master...

rust-timer · 2022-07-19T08:18:57Z

Finished benchmarking commit (96c2df8): comparison url.

Instruction count

Primary benchmarks: no relevant changes found
Secondary benchmarks: 😿 relevant regression found

	mean¹	max	count²
Regressions 😿 (primary)	N/A	N/A	0
Regressions 😿 (secondary)	0.4%	0.4%	1
Improvements 🎉 (primary)	N/A	N/A	0
Improvements 🎉 (secondary)	N/A	N/A	0
All 😿🎉 (primary)	N/A	N/A	0

Max RSS (memory usage)

Results

Primary benchmarks: mixed results
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	3.0%	3.0%	1
Regressions 😿 (secondary)	3.0%	3.5%	3
Improvements 🎉 (primary)	-2.5%	-2.5%	1
Improvements 🎉 (secondary)	-3.3%	-3.6%	2
All 😿🎉 (primary)	0.2%	3.0%	2

Cycles

Results

Primary benchmarks: mixed results
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	2.9%	3.5%	2
Regressions 😿 (secondary)	2.9%	3.0%	2
Improvements 🎉 (primary)	-2.1%	-2.1%	1
Improvements 🎉 (secondary)	-4.2%	-4.2%	1
All 😿🎉 (primary)	1.2%	3.5%	3

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

@rustbot label: -perf-regression

the arithmetic mean of the percent change ↩ ↩² ↩³
number of relevant changes ↩ ↩² ↩³

box diagnostic_metadata field

5ece481

rust-highfive assigned estebank Jun 15, 2022

rustbot added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Jun 15, 2022

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jun 15, 2022

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jun 15, 2022

Dylan-DPC added the S-experimental Status: Ongoing experiment that does not require reviewing and won't be merged in its current state. label Jun 15, 2022

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jun 15, 2022

JohnCSimon added S-experimental Status: Ongoing experiment that does not require reviewing and won't be merged in its current state. and removed S-experimental Status: Ongoing experiment that does not require reviewing and won't be merged in its current state. labels Jul 3, 2022

Dylan-DPC removed the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jul 4, 2022

bors added the S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. label Jul 18, 2022

bors added the merged-by-bors This PR was explicitly merged by bors. label Jul 19, 2022

bors merged commit 96c2df8 into rust-lang:master Jul 19, 2022

rustbot added this to the 1.64.0 milestone Jul 19, 2022

TaKO8Ki deleted the box-diagnostic-metadata-field branch July 19, 2022 08:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Experiment] Box `diagnostic_metadata` field #98120

[Experiment] Box `diagnostic_metadata` field #98120

TaKO8Ki commented Jun 15, 2022

TaKO8Ki commented Jun 15, 2022

rust-timer commented Jun 15, 2022

bors commented Jun 15, 2022

TaKO8Ki commented Jun 15, 2022

eggyal commented Jun 15, 2022

joshtriplett commented Jun 15, 2022

rust-timer commented Jun 15, 2022

bors commented Jun 15, 2022

bors commented Jun 15, 2022

rust-timer commented Jun 15, 2022

rust-timer commented Jun 15, 2022

joshtriplett commented Jun 15, 2022

eggyal commented Jun 15, 2022

Kobzol commented Jun 15, 2022 •

edited

Loading

eggyal commented Jun 15, 2022

estebank commented Jun 15, 2022

estebank commented Jul 18, 2022

bors commented Jul 18, 2022

bors commented Jul 19, 2022

bors commented Jul 19, 2022

rust-timer commented Jul 19, 2022

[Experiment] Box diagnostic_metadata field #98120

[Experiment] Box diagnostic_metadata field #98120

Conversation

TaKO8Ki commented Jun 15, 2022

TaKO8Ki commented Jun 15, 2022

rust-timer commented Jun 15, 2022

bors commented Jun 15, 2022

TaKO8Ki commented Jun 15, 2022

eggyal commented Jun 15, 2022

joshtriplett commented Jun 15, 2022

rust-timer commented Jun 15, 2022

bors commented Jun 15, 2022

bors commented Jun 15, 2022

rust-timer commented Jun 15, 2022

rust-timer commented Jun 15, 2022

Instruction count

Max RSS (memory usage)

Cycles

Footnotes

joshtriplett commented Jun 15, 2022

eggyal commented Jun 15, 2022

Kobzol commented Jun 15, 2022 • edited Loading

eggyal commented Jun 15, 2022

estebank commented Jun 15, 2022

estebank commented Jul 18, 2022

bors commented Jul 18, 2022

bors commented Jul 19, 2022

bors commented Jul 19, 2022

rust-timer commented Jul 19, 2022

Instruction count

Max RSS (memory usage)

Cycles

Footnotes

[Experiment] Box `diagnostic_metadata` field #98120

[Experiment] Box `diagnostic_metadata` field #98120

Kobzol commented Jun 15, 2022 •

edited

Loading