feat: patches uses a map in some cases #1626

danking · 2024-12-09T20:37:24Z

See this sheet for the data from take_patches.rs. I'm on an M3 Max with 96 GiB of RAM with macOS 14.4. This threshold likely depends on the ISA.

Intuitively, repeated searching is O(N_INDICES * lg N_PATCHES) and repeated map lookups is O(N_INDICES + N_PATCHES). It seems to me that the compiler & CPU would have trouble paralleling search (via SIMD or ILP) because of the branching, whereas map lookups are more obviously parallelized (e.g. SIMD hash computation). I'm not entirely sure why the cross over point seems to be around N_PATCHES / N_INDICES = 5. I believe the M3 Max has 128-bit registers, so if the indices are 32-bits then index arithmetic could be 4-way parallel.

See [this sheet for the data from take_patches.rs](https://docs.google.com/spreadsheets/d/1D9vBZ1QJ6mwcIvV5wIL0hjGgVchcEnAyhvitqWu2ugU). I'm on an M3 Max with 96 GiB of RAM with macOS 14.4. This threshold likely depends on the ISA. Intuitively, repeated searching is `O(N_INDICES * lg N_PATCHES)` and repeated map lookups is `O(N_INDICES + N_PATCHES)`. It seems to me that the compiler & CPU would have trouble paralleling search (via SIMD or ILP) because of the branching, whereas map lookups are more obviously parallelized (e.g. SIMD hash computation). I'm not entirely sure why the cross over point seems to be around N_PATCHES / N_INDICES = 5. I believe the M3 Max has 128-bit registers, so if the indices are 32-bits then index arithmetic could be 4-way parallel.

This reverts commit 0b93fe0.

Reverts #1626

A second attempt at #1626 with fixes from #1628 as well as the transition of ALPRD and SparseArray to use Patches. --- See [this sheet for the data from take_patches.rs](https://docs.google.com/spreadsheets/d/1D9vBZ1QJ6mwcIvV5wIL0hjGgVchcEnAyhvitqWu2ugU). I'm on an M3 Max with 96 GiB of RAM with macOS 14.4. This threshold likely depends on the ISA. Intuitively, repeated searching is `O(N_INDICES * lg N_PATCHES)` and repeated map lookups is `O(N_INDICES + N_PATCHES)`. It seems to me that the compiler & CPU would have trouble paralleling search (via SIMD or ILP) because of the branching, whereas map lookups are more obviously parallelized (e.g. SIMD hash computation). I'm not entirely sure why the cross over point seems to be around N_PATCHES / N_INDICES = 5. I believe the M3 Max has 128-bit registers, so if the indices are 32-bits then index arithmetic could be 4-way parallel.

A second attempt at #1626 with fixes from #1628 as well as the transition of ALPRD and SparseArray to use Patches. --- See [this sheet for the data from take_patches.rs] https://docs.google.com/spreadsheets/d/1D9vBZ1QJ6mwcIvV5wIL0hjGgVchcEnAyhvitqWu2ugU). I'm on an M3 Max with 96 GiB of RAM with macOS 14.4. This threshold likely depends on the ISA. Intuitively, repeated searching is `O(N_INDICES * lg N_PATCHES)` and repeated map lookups is `O(N_INDICES + N_PATCHES)`. It seems to me that the compiler & CPU would have trouble paralleling search (via SIMD or ILP) because of the branching, whereas map lookups are more obviously parallelized (e.g. SIMD hash computation). I'm not entirely sure why the cross over point seems to be around N_PATCHES / N_INDICES = 5. I believe the M3 Max has 128-bit registers, so if the indices are 32-bits then index arithmetic could be 4-way parallel.

danking requested a review from gatesn December 9, 2024 20:37

danking marked this pull request as ready for review December 9, 2024 20:37

lwwmanning approved these changes Dec 9, 2024

View reviewed changes

danking added 3 commits December 9, 2024 16:13

fix

116462b

clippy

500975f

more clippy

501d725

danking merged commit 0b93fe0 into develop Dec 9, 2024
16 checks passed

danking deleted the dk/restore-euro-2016-speed branch December 9, 2024 22:40

danking mentioned this pull request Dec 10, 2024

feat: faster Patches::take & use patches in alp-rd & sparse #1628

Merged

lwwmanning added a commit that referenced this pull request Dec 10, 2024

Revert "feat: patches uses a map in some cases (#1626)"

0f92b48

This reverts commit 0b93fe0.

lwwmanning mentioned this pull request Dec 10, 2024

Revert "feat: patches uses a map in some cases" #1629

Merged

lwwmanning added a commit that referenced this pull request Dec 10, 2024

Revert "feat: patches uses a map in some cases" (#1629)

3de4b29

Reverts #1626

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: patches uses a map in some cases #1626

feat: patches uses a map in some cases #1626

danking commented Dec 9, 2024 •

edited

Loading

feat: patches uses a map in some cases #1626

feat: patches uses a map in some cases #1626

Conversation

danking commented Dec 9, 2024 • edited Loading

danking commented Dec 9, 2024 •

edited

Loading