Make OffsetArrays a stdlib #45585

LilithHafner · 2022-06-04T23:50:04Z

I think that it would be nice to have OffsetArrays as a stdlib.

My particular use case that requires it is in sorting. It is sometimes necessary/helpful for performance to have a workspace AbstractVector with the same indices as the input or the region of the input that is to be sorted. For example, radix-sort copies the whole vector back and forth, each time putting it in a slightly more ordered order. A new, faster, QuickSort would do the same. To have any chance of depending on OffsetArrays, OffsetArrays must be a stdlib.

It would be nice to have a unified way of reusing workspaces (e.g. a Vector{UInt8} that sorting methods resize! and reinterpret as needed). When the input is an AbstractVector with offset indices, we could simply construct a workspace with OffsetVector(reinterpret(eltype(input), workspace), firstindex(input)-1).

Without OffsetArrays, this becomes less elegant. Consider, for example, the case when the input is an OffsetArray with negative firstindex. We would need to sort a view into the input that discards the offset indices. This is problematic for two reasons:

It is more elegant to support arbitrary indexing wherever possible rather than explicitly convert to one-based indexing
views have non-negligible overhead in some of these cases

similar is not sufficient because the workspace must support resizing, should be reinterpretable from a Vector{UInt8}, and should support indices other than the indices of the input (e.g. MergeSort only needs half the size, sorting a Matrix needs much less).

The text was updated successfully, but these errors were encountered:

LilithHafner · 2022-06-04T23:59:09Z

Pairs with #45584 to enable #45570 and #45222

N5N3 · 2022-06-05T02:24:45Z

Can't we just add a simple wrapper in sort.jl which transforms the input offsetvector to a 1-based one? (We only need to cache the firstindex)

LilithHafner · 2022-06-05T10:59:07Z

Yes. We can do that with view(input, firstindex(input):lastindex(input)). I'll pursue that and report back if I run into performance overhead from those views.

LilithHafner · 2023-09-18T18:05:34Z

Ick. I don't like stdlibs anymore.

LilithHafner added the stdlib Julia's standard library label Jun 4, 2022

LilithHafner closed this as not planned Won't fix, can't repro, duplicate, stale Sep 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make OffsetArrays a stdlib #45585

Make OffsetArrays a stdlib #45585

LilithHafner commented Jun 4, 2022

LilithHafner commented Jun 4, 2022

N5N3 commented Jun 5, 2022

LilithHafner commented Jun 5, 2022

LilithHafner commented Sep 18, 2023

Make OffsetArrays a stdlib #45585

Make OffsetArrays a stdlib #45585

Comments

LilithHafner commented Jun 4, 2022

LilithHafner commented Jun 4, 2022

N5N3 commented Jun 5, 2022

LilithHafner commented Jun 5, 2022

LilithHafner commented Sep 18, 2023