Shrink our main uniform buffer by 32 bytes #16103

hrydgard · 2022-09-25T08:54:40Z

That's half a 4x4 matrix, we're down to 480 bytes now.

Ideally I'd like to be able to squeeze two VR eye matrices in here without exceeding 512 bytes... but starting to look impossible. Though if we'd merge the two proj matrices, which should be doable, we'd get closer.

There are a bunch of float4 colors that could be easily squeezed into 32 bits each (fog color etc) but not sure how those will affect performance on old hardware. I guess the rarer ones like blendFixA/B would be fine.. There's value too in keeping uniforms as similar to GL as possible.

There's a reason we want to stay below 512 bytes, because the next step up is 768 on a lot of hardware as they can only align uniform buffers on 256-byte boundaries. Whether it actually makes much of a performance difference in practice, probably not hugely...

Another way to go would be to dynamically generate uniform buffers with just the constants that each pipeline needs, but the complexity would be huge. Very likely not worth it.

unknownbrackets

A few things we could do:

texEnvColor is fairly uncommon, so could maybe be uint.
viewPos is only needed for fog.
- We could actually combine proj/proj_through/view into a single matrix (viewproj) + a single vec4 to calculate fog from worldpos.
- This would also get rid of fogCoef.
- We could still cache the matrices separately and only multiply if dirty before flush, so I don't think this would need to be that expensive.
- The fog vec4 would be cheap since it'd just be parts of the view matrix, though we'd scale by fogCoef.
- Even keeping proj_through, this would get rid of 40 bytes (remove 12 for view, remove 2 fog fogCoef, add 4 fog fogFromWorld.)
I agree about blendFix, especially blendFixB should be uncommon.

-[Unknown]

GPU/Vulkan/DrawEngineVulkan.cpp

GPU/Common/FragmentShaderGenerator.cpp

GPU/Common/ShaderUniforms.h

…d to 128 bits. Allows us to save 16 bytes from the main uniform buffer, since there's free 32-bit spaces here and there to use.

…bytes.

This is simpler and allows us to unify paths better.

GPU: Apply color test mask as a uint

hrydgard added the Vulkan label Sep 25, 2022

hrydgard added this to the v1.14.0 milestone Sep 25, 2022

unknownbrackets approved these changes Sep 25, 2022

View reviewed changes

GPU/Vulkan/DrawEngineVulkan.cpp Outdated Show resolved Hide resolved

GPU/Common/FragmentShaderGenerator.cpp Outdated Show resolved Hide resolved

GPU/Common/FragmentShaderGenerator.cpp Outdated Show resolved Hide resolved

GPU/Common/ShaderUniforms.h Outdated Show resolved Hide resolved

hrydgard added 4 commits September 26, 2022 13:04

Fragment shader uniforms: Pack color mask in 32 bits instead of expan…

f4b71e2

…d to 128 bits. Allows us to save 16 bytes from the main uniform buffer, since there's free 32-bit spaces here and there to use.

Shuffle constants around, squeezing them into gaps. Saves another 16 …

cfa427c

…bytes.

ShaderUniforms: cleanup, put every "4-float" on a line for clarity

fc30b04

ivec->uvec, comment fix

d9f74d2

hrydgard force-pushed the optimize-shader-constants branch from 39e9313 to d9f74d2 Compare September 26, 2022 11:10

unknownbrackets added 3 commits September 26, 2022 06:57

GPU: Consistently use uvec3 for colortest.

a19a057

GPU: Apply color test mask as a uint.

4329aaa

This is simpler and allows us to unify paths better.

Merge pull request #16109 from unknownbrackets/optimize-shader-constants

ce835d1

GPU: Apply color test mask as a uint

hrydgard merged commit 89e6b10 into master Sep 26, 2022

hrydgard deleted the optimize-shader-constants branch September 26, 2022 17:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shrink our main uniform buffer by 32 bytes #16103

Shrink our main uniform buffer by 32 bytes #16103

hrydgard commented Sep 25, 2022 •

edited

Loading

unknownbrackets left a comment

Shrink our main uniform buffer by 32 bytes #16103

Shrink our main uniform buffer by 32 bytes #16103

Conversation

hrydgard commented Sep 25, 2022 • edited Loading

unknownbrackets left a comment

Choose a reason for hiding this comment

hrydgard commented Sep 25, 2022 •

edited

Loading