Add reduction_size and broadcast_size attributes to XeTile.reduction and XeTile.broadcast #996

Jianhui-Li · 2025-01-09T22:08:06Z

Add broadcast_size to XeTile.reduction and XeTile.broadcast to support partial reduction.

We found two use case for partial reduction:


1. Support the staged work group reduction - first reduce within workgroup and then reduce across workgroup.
 %18 = math.exp %17#0 {map = #xetile.wg_map<sg_layout = [8, 4], sg_data = [32, 32]>} : vector<256x128xf32>
%19= vector.shape_cast %18 {map = #xetile.wg_map<sg_layout = [8, 4], sg_data = [32, 32]>} : vector<256x128xf32> to vector<8x32x128xf32>
%20 = vector.multi_reduction <add>, %19, %cst_0 {map = #xetile.wg_map<sg_layout = [8, 4], sg_data = [1, 32]>} [1] : vector<8x32x128xf32> to vector<8x128xf32>

        => 
 
 %18 = math.exp %17#0 {map = #xetile.wg_map<sg_layout = [8, 4], sg_data = [32, 32]>} : vector<256x128xf32>
 %20 = xetile.reduction <add>, %18, %cst_0 {map = #xetile.wg_map<sg_layout = [8, 4], sg_data = [1, 32]>} [1] {$reduction_size = [32]} : vector<256x128xf32> to vector<8x128xf32>

Support MXFP reduction - the reduction only reduce 32 elements to one element, so not reduce the whole dimension to one element.

This PR also fixes a few name inconsistency: tile_broadcast to broadcast, tile_reduce to reduction, tile_transpose to transpose, atomic_rmw_tile to atomic_rmw, tile_conv_layout to convert_layout.

Please review these guidelines to help with the review process:

Have you provided a meaningful PR description?
Have you added a test, a reproducer, or a reference to an issue with a reproducer?
Have you tested your changes locally for CPU and GPU devices?
Have you made sure that new changes do not introduce compiler warnings?
If this PR is a work in progress, are you filing the PR as a draft?
Have you organized your commits logically and ensured each can be built by itself?

Add broadcast_size to xeTile.broadcast operation to make name consistent: tile_broadcast to broadcast, tile_reduce to reduction, tile_transpose to transpose, atomic_rmw_tile to atomic_rmw.

nbpatel

LGTM. Can we also change xetile.tile_conv_layout to xetile.conv_layout

tile_conv_layout to convert_layout

Garra1980 · 2025-01-10T20:07:35Z

docs/rfcs/XeTile.md

 ```mlir
-   %vector_a = xetile.tile_reduce <add> %vector_b [1]: vector<64x32xfloat> into vector<64x1xfloat>
+   %vector_a = xetile.reduction <add> %vector_b [0] {$reduction_size=32}: vector<64x64xfloat>, vector<2x64xfloat> into vector<2x64xfloat>


do we need to mention vector<2x64xfloat> twice in this operation?

fixed. thanks

Jianhui-Li added 4 commits October 19, 2024 13:12

Update XeGPU.md

63ce038

Merge branch 'intel:main' into main

2a2697a

Update XeTile.md

1e7720e

Add broadcast_size to xeTile.broadcast operation to make name consistent: tile_broadcast to broadcast, tile_reduce to reduction, tile_transpose to transpose, atomic_rmw_tile to atomic_rmw.

Update XeTile.md

1398a9e

chencha3 approved these changes Jan 9, 2025

View reviewed changes

nbpatel approved these changes Jan 9, 2025

View reviewed changes

Jianhui-Li added 2 commits January 9, 2025 14:26

Update XeTile.md

6102923

tile_conv_layout to convert_layout

Update XeTile.md

455ce8f

Garra1980 reviewed Jan 10, 2025

View reviewed changes

Update XeTile.md

ebfc803

Garra1980 approved these changes Jan 10, 2025

View reviewed changes

silee2 merged commit 5dbeec7 into intel:main Jan 13, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add reduction_size and broadcast_size attributes to XeTile.reduction and XeTile.broadcast #996

Add reduction_size and broadcast_size attributes to XeTile.reduction and XeTile.broadcast #996

Jianhui-Li commented Jan 9, 2025 •

edited

Loading

nbpatel left a comment

Garra1980 Jan 10, 2025

Jianhui-Li Jan 10, 2025

Add reduction_size and broadcast_size attributes to XeTile.reduction and XeTile.broadcast #996

Add reduction_size and broadcast_size attributes to XeTile.reduction and XeTile.broadcast #996

Conversation

Jianhui-Li commented Jan 9, 2025 • edited Loading

nbpatel left a comment

Choose a reason for hiding this comment

Garra1980 Jan 10, 2025

Choose a reason for hiding this comment

Jianhui-Li Jan 10, 2025

Choose a reason for hiding this comment

Jianhui-Li commented Jan 9, 2025 •

edited

Loading