[metal] Performance Improvements for `bitmasked` #678

k-ye · 2020-03-29T02:26:27Z

Currently, Metal's listgen will launch a group of threads whose size is equal to that of the total numbers of SNodes. This could be very inefficient. Try grid-stride loops to balance the load.
Also adopt the grid-stride loop pattern to implement struct_for kernels.
If the immediate parent of a leaf place is a bitmasked, instead of appending the active bitmasked elements into ListManager, we might want to just loop through all the possible coordinates and check which are active directly (This is how LLVM backends implement it?). This is because the appending to ListManager is expensive, as it uses atomic operations.

The text was updated successfully, but these errors were encountered:

yuanming-hu · 2020-03-29T02:35:48Z

This is how LLVM backends implement it?

Right. In LLVM ListManager only generates 8x8x8 leaf blocks (instead of every 1x1x1 leaf voxels). This allows the list generation overhead to be amortized by 8x8x8=512 voxels.

k-ye added the feature request Suggest an idea on this project label Mar 29, 2020

k-ye self-assigned this Mar 29, 2020

k-ye added the mac Mac OS X platform label Mar 29, 2020

k-ye mentioned this issue Mar 29, 2020

Support basic sparsity SNode on Metal #593

Closed

k-ye added c++ C++ engineering related enhancement Make existing things or codebases better and removed feature request Suggest an idea on this project labels Mar 29, 2020

This was referenced Mar 30, 2020

[metal] Use grid-stride loop to implement listgen kernels #682

Merged

[metal] Skip listgen for leaf Snode #699

Merged

k-ye closed this as completed Apr 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[metal] Performance Improvements for `bitmasked` #678

[metal] Performance Improvements for `bitmasked` #678

k-ye commented Mar 29, 2020 •

edited

Loading

yuanming-hu commented Mar 29, 2020

[metal] Performance Improvements for bitmasked #678

[metal] Performance Improvements for bitmasked #678

Comments

k-ye commented Mar 29, 2020 • edited Loading

yuanming-hu commented Mar 29, 2020

[metal] Performance Improvements for `bitmasked` #678

[metal] Performance Improvements for `bitmasked` #678

k-ye commented Mar 29, 2020 •

edited

Loading