Missed enum layout optimization depending on enum variant field reordering #125630

cmrschwarz · 2024-05-27T19:49:27Z

rustc fails to optimize the enum Layout of V1 and V2 of the (real world) example type LazyRwLockGuard to 24 bytes.

In case of V1, this might be be due to #101567.

But the case of V2 seems to be a separate issue, as it is identical to V3 (which is optimized correctly), except for the ordering of the fields inside the Read enum variant.

(For clarity, RwLockReadGuard and RwLockWriteGuard each use 16 bytes and contain a niche).

use std::sync::*;

// could use 24 bytes, uses 32 (probably known issue, #101567)
pub enum LazyRwLockGuardV1<'a, T> {
    Unlocked(&'a RwLock<T>),
    Read {
        lock: &'a RwLock<T>,
        guard: RwLockReadGuard<'a, T>,
    },    
    Write(RwLockWriteGuard<'a, T>),
}

// helper subtype (16 Bytes) so the main type figures out it's Niche correctly
pub enum LazyRwLockWriteGuard<'a, T> {
    Unlocked(&'a RwLock<T>),
    Write(RwLockWriteGuard<'a, T>),
}

// this type should now be 24 bytes, but unfortunately still uses 32
pub enum LazyRwLockGuardV2<'a, T> {
    Read {
        lock: &'a RwLock<T>,
        guard: RwLockReadGuard<'a, T>,
    },    
    NonRead(LazyRwLockWriteGuard<'a, T>),
}

// this type correctly uses 24 bytes
pub enum LazyRwLockGuardV3<'a, T> {
    Read {
        guard: RwLockReadGuard<'a, T>,
        lock: &'a RwLock<T>, // fields reordered
    },    
    NonRead(LazyRwLockWriteGuard<'a, T>),
}

godbolt repro
stackoverflow question

The text was updated successfully, but these errors were encountered:

the8472 · 2024-05-27T22:28:17Z

There are some field ordering heuristics that get applied to regular structs (#102750, #108106) but not to enum variants. If you extract the Read variant into a struct and make that struct into a variant payload it does work as desired.

Enums are under different constraints so the optimizations can't be ported 1:1, but with some tweaks it should be possible.

cmrschwarz · 2024-05-28T02:12:46Z

In case my practical example is a bit overcomplicated, here's a reduced version:

use std::num::NonZeroU64;

// uses 32 bytes, swapping x and y brings it down to 24
pub enum Foo {
    A {
        x: NonZeroU64,  
        y: [NonZeroU64; 2],
    },    
    B([u64; 2]),
}

godbolt repro

…, r=the8472 Get rid of niche selection's dependence on fields's order Fixes rust-lang#125630. Use the optimal niche selection decided in `univariant()` rather than picking niche field manually. r? `@the8472`

rustbot added the needs-triage This issue may need triage. Remove it if it has been sufficiently triaged. label May 27, 2024

adwinwhite mentioned this issue Sep 18, 2024

Get rid of niche selection's dependence on fields's order #130508

Merged

bors closed this as completed in 2b11f26 Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missed enum layout optimization depending on enum variant field reordering #125630

Missed enum layout optimization depending on enum variant field reordering #125630

cmrschwarz commented May 27, 2024

the8472 commented May 27, 2024

cmrschwarz commented May 28, 2024

Missed enum layout optimization depending on enum variant field reordering #125630

Missed enum layout optimization depending on enum variant field reordering #125630

Comments

cmrschwarz commented May 27, 2024

the8472 commented May 27, 2024

cmrschwarz commented May 28, 2024