Optimisations for `LoadNextMsgMulti` #6448

neilalexander · 2025-02-04T13:23:56Z

This PR optimises LoadNextMsgMulti by doing two things:

Avoid large linear scans in firstMatchingMulti by using subject tree intersection when it looks like the FSS size is smaller than the scan range, this should be less expensive (although the threshold for choosing strategy may need further attention);
Check the dmap before calling mb.cacheLookup() in either the FSS search or the linear scan, such that we can avoid calling time.Now() a lot when we are tight-looped scanning over a large number of interior deletes.

This PR provides a 2x improvement on the 1000-filter LoadNextMsgMulti unit test and provides a 10x improvement when tested against a store that has extremely sparse messages and 8 consumer filters.

Signed-off-by: Neil Twigg [email protected]

derekcollison

In general LGTM, but when we hit a dmap entry, before we would update load ts. So maybe we should set a boolean when we match dmap and on exit update that just once?

derekcollison

LGTM

This avoids a potentially expensive linear walk if it is obvious that the block FSS contains less subjects than sequences that we would otherwise have to scan. Signed-off-by: Neil Twigg <[email protected]>

…alls Signed-off-by: Neil Twigg <[email protected]>

This updates `generatePerSubjectInfo`, `NumPending` and `NumPendingMulti` to avoid updating the last load timestamp in a tight-loop while skipping over a potentially large number of interior deletes, as this becomes noticeable in the CPU profile. Similar to one of the changes in #6448. Signed-off-by: Neil Twigg <[email protected]>

Includes the following: - #6406 - #6412 - #6408 - #6416 - #6425 - #6424 - #6438 - #6439 - #6446 - #6447 - #6448 - #6449 - #6450 - #6451 - #6452 - #6453 - #6456 - #6458 - #6457 - #6459 - #6460 - #6461 Signed-off-by: Neil Twigg <[email protected]>

neilalexander requested a review from a team as a code owner February 4, 2025 13:23

derekcollison reviewed Feb 4, 2025

View reviewed changes

neilalexander force-pushed the neil/firstmatchingmulti branch 2 times, most recently from 46d73c5 to 2a93e6c Compare February 4, 2025 14:01

derekcollison approved these changes Feb 4, 2025

View reviewed changes

neilalexander added 2 commits February 4, 2025 14:03

Optimise firstMatchingMulti using subject tree intersection

5282516

This avoids a potentially expensive linear walk if it is obvious that the block FSS contains less subjects than sequences that we would otherwise have to scan. Signed-off-by: Neil Twigg <[email protected]>

Check dmap before calling cacheLookup to avoid unnecessary time c…

5499017

…alls Signed-off-by: Neil Twigg <[email protected]>

neilalexander force-pushed the neil/firstmatchingmulti branch from 2a93e6c to 5499017 Compare February 4, 2025 14:03

derekcollison merged commit be97bc9 into main Feb 4, 2025
5 checks passed

derekcollison deleted the neil/firstmatchingmulti branch February 4, 2025 14:34

neilalexander mentioned this pull request Feb 4, 2025

Avoid last load timestamp update when ranging interior deletes #6450

Merged

neilalexander mentioned this pull request Feb 5, 2025

Cherry-picks for 2.10.26-RC.1 #6462

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimisations for `LoadNextMsgMulti` #6448

Optimisations for `LoadNextMsgMulti` #6448

neilalexander commented Feb 4, 2025 •

edited

Loading

derekcollison left a comment

derekcollison left a comment

Optimisations for LoadNextMsgMulti #6448

Optimisations for LoadNextMsgMulti #6448

Conversation

neilalexander commented Feb 4, 2025 • edited Loading

derekcollison left a comment

Choose a reason for hiding this comment

derekcollison left a comment

Choose a reason for hiding this comment

Optimisations for `LoadNextMsgMulti` #6448

Optimisations for `LoadNextMsgMulti` #6448

neilalexander commented Feb 4, 2025 •

edited

Loading