Use a priority queue for the sparse secondary iteration #49

LTLA · 2023-05-10T18:30:04Z

This can replace the current_indices store, and means that each secondary iteration doesn't have to scan across the entire extent. Only should be filled if an oracle is provided and the next element is greater than the current element.

Probably will need another field to specify the last element at which the queue was updated. Might even have to use make_heap directly to allow us to clear the priority queue quickly when a reverse order is used.

The text was updated successfully, but these errors were encountered:

LTLA · 2023-05-18T23:43:00Z

This is... much more complex than I thought, especially to handle both increments and decrements.

The real problem is that I'm not even sure it's going to be faster.

The heap takes Nz * log(Nz) time to depopulate and fill, given Nz non-zero elements in a row.
If we do any jumps, we need another Nz * log(Nz) to sort the non-zeros as they don't come out of the heap in order.
If we do a long jump, then the whole thing collapses to N * log(N) anyway, as the heap information is useless.

If Nz isn't much smaller than N (the number of columns), then we're going to see a performance degradation.

Besides, pure iteration is pretty fast if there's no action involved other than to press continue. This is especially true for the current_indices vector that can be iterated contiguously in memory, and even more so if the branch predictor guesses that action is generally not required for very sparse rows.

So it might be faster to just proceed as we are doing now, with one simple compromise; we can keep track of the next-closest index to the current position, allowing us to quickly skip all-zero rows.

LTLA · 2023-05-21T06:59:17Z

Closed by #51.

LTLA closed this as completed May 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a priority queue for the sparse secondary iteration #49

Use a priority queue for the sparse secondary iteration #49

LTLA commented May 10, 2023 •

edited

Loading

LTLA commented May 18, 2023 •

edited

Loading

LTLA commented May 21, 2023

Use a priority queue for the sparse secondary iteration #49

Use a priority queue for the sparse secondary iteration #49

Comments

LTLA commented May 10, 2023 • edited Loading

LTLA commented May 18, 2023 • edited Loading

LTLA commented May 21, 2023

LTLA commented May 10, 2023 •

edited

Loading

LTLA commented May 18, 2023 •

edited

Loading