Fix-ups for sorting workspace/buffer (#45330) #45570

LilithHafner · 2022-06-03T13:24:00Z

Fix radix_sort! for negative firstindex (Fixes Bug in sorting offset vectors #45568)
Actually use a workspace vector for sort!(A::AbstractArray; dims)
Preallocate generously right away for sort[!](A::AbstractArray; dims). Eliminating t = similar(A::AbstractArray; 0); ... resize!(t, len) is motivated by correctness concerns: similar returns an AbstractVector and resize! is defined for Vectors. I'd rather allocate a wee bit more than have this broken edge case. If radix sort is eventually used, this is the perfect size. If merge sort, it is twice as large as required, but half as many allocations, and it doesn't matter much because it is only a single slice of the array. If quicksort pre Stabilize, optimize, and increase robustness of QuickSort #45222 or insertion sort is used the allocation could be empty, but I have not observed a substantive performance cost associated with a slightly too large allocation in this case.
handle workspaces in an unconventional indexing aware manner
minor comments and style changes

LilithHafner · 2022-06-03T13:24:40Z

base/sort.jl

@@ -683,7 +683,7 @@ function radix_sort!(v::AbstractVector{U}, lo::Integer, hi::Integer, bits::Unsig
                     t::AbstractVector{U}, chunk_size=radix_chunk_size_heuristic(lo, hi, bits)) where U <: Unsigned
    # bits is unsigned for performance reasons.
    mask = UInt(1) << chunk_size - 1
-    counts = Vector{UInt}(undef, mask+2)
+    counts = Vector{Int}(undef, mask+2)


lo may be negative and counts is also used to store offsets.

LilithHafner · 2022-06-04T13:46:18Z

To do workspace management right requires access to OffsetArrays. Hold pending that.

LilithHafner · 2022-06-05T12:29:21Z

We can have only one-based indexed workspaces so long as we convert inputs to one-based indexing to match. Thanks, @N5N3 for encouraging me to pursue this approach. Unfortunately, there is a runtime penalty for this conversion, so we only do it where necessary. This results in a somewhat inelegant solution but does work reasonably well and avoids runtime penalties most of the time.

It would still be nice to be able to construct OffsetVectors when handling offset vectors as input, but they are not necessary.

LilithHafner · 2022-06-06T13:08:05Z

All tests passed!!! That's better than master (and totally unrelated to this PR)

…so minor style changes and fixups from JuliaLang#45596 and local review.

LilithHafner · 2022-06-12T15:24:33Z

base/sort.jl

+    if t !== nothing && checkbounds(Bool, t, lo:hi) # Fully preallocated and aligned workspace
+        u2 = radix_sort!(u, lo, hi, bits, reinterpret(U, t))
+        uint_unmap!(v, u2, lo, hi, o, u_min)
+    elseif t !== nothing && (applicable(resize!, t) || length(t) >= hi-lo+1) # Viable workspace


This branch is triggered in the case of sort(::OffsetMatrix; dims)

LilithHafner · 2022-06-12T15:28:40Z

test/sorting.jl

@@ -842,5 +841,6 @@ end
        end
    end
 end
+# The "searchsorted" testset is at the end of the file because it is slow.


I added these comments because by default I put new tests at the end of the file and these comments remind me not to. (see #45233, #45234)

oscardssmith · 2022-06-15T22:15:09Z

overall, this looks good to me.

LilithHafner · 2022-06-15T22:34:20Z

Thanks! Any next steps you'd like to see from me?

oscardssmith · 2022-06-16T04:30:55Z

I don't see any necessary changes (but you know this part of the code better than I do)

LilithHafner · 2022-06-16T15:12:52Z

It looks good to me too.

DilumAluthge · 2022-06-16T18:18:54Z

FYI this shouldn't have been merged with a failing whitespace CI check.

DilumAluthge · 2022-06-16T18:19:26Z

Whitespace check found 2 issues:
base/sort.jl:608 -- trailing whitespace
base/sort.jl:1563 -- trailing whitespace
make: *** [Makefile:104: check-whitespace] Error 1

DilumAluthge · 2022-06-16T18:22:30Z

#45713

LilithHafner · 2022-06-16T18:29:59Z

Sorry about that! Thanks for fixing it. I didn't check because I've been desensitized to one or two failed CI runs, but I'll make sure to check in the future.

Is Win32 the only top-level check that's allowed to fail now?

DilumAluthge · 2022-06-16T18:31:38Z

Win32 shouldn't be failing.

If a check fails, I recommend re-running it to make sure the failure isn't related to the PR.

I also recommend checking the logs of failing jobs. Sometimes the logs alone can tell you whether or not the failure is related to your PR.

In this cases, win32 failed due to an OOM in the Profile tests, so it probably wasn't related to the PR.

But it never hurts to rerun the failing job just to make sure.

LilithHafner · 2022-06-16T19:40:00Z

Thanks!

LilithHafner · 2022-06-17T01:42:21Z

Sorry to bother you again, but how do I rerun a failing job that I suspect is unrelated?

LilithHafner · 2022-06-17T21:51:49Z

Nevermind, I figured it out.

Fix and test sort!(OffsetArray(rand(200), -10))

f06107e

LilithHafner commented Jun 3, 2022

View reviewed changes

LilithHafner marked this pull request as draft June 4, 2022 13:45

LilithHafner mentioned this pull request Jun 4, 2022

Make OffsetArrays a stdlib #45585

Closed

Lilith Hafner added 2 commits June 5, 2022 07:21

Convert to 1-based indexing rather than generalize to arbitrary indexing

62e1fb7

avoid overhead of views where reasonable

07d60b3

LilithHafner force-pushed the fix-negative-offset-array branch from 817e81c to 07d60b3 Compare June 5, 2022 12:26

LilithHafner marked this pull request as ready for review June 5, 2022 12:29

Lilith Hafner added 2 commits June 7, 2022 07:07

style

8b12f5d

handle edge cases better, making the workspace function unhelpful. Al…

bbfce4d

…so minor style changes and fixups from JuliaLang#45596 and local review.

LilithHafner mentioned this pull request Jun 8, 2022

Revise workspace/buffer to be ::UInt8[] instead of ::AbstractVector{eltype(input)} #45596

Closed

LilithHafner changed the title ~~Fix sort! on long negative offset array~~ Fix-ups for #45330 Jun 8, 2022

LilithHafner changed the title ~~Fix-ups for #45330~~ Fix-ups for sorting workspace/buffer (#45330) Jun 8, 2022

LilithHafner requested a review from oscardssmith June 8, 2022 13:56

LilithHafner commented Jun 12, 2022

View reviewed changes

move comments in tests for discoverability

be522f9

LilithHafner commented Jun 12, 2022

View reviewed changes

oscardssmith merged commit 6e79796 into JuliaLang:master Jun 16, 2022

LilithHafner deleted the fix-negative-offset-array branch June 16, 2022 15:34

LilithHafner added the sorting Put things in order label Jul 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix-ups for sorting workspace/buffer (#45330) #45570

Fix-ups for sorting workspace/buffer (#45330) #45570

LilithHafner commented Jun 3, 2022 •

edited

Loading

LilithHafner Jun 3, 2022

LilithHafner commented Jun 4, 2022

LilithHafner commented Jun 5, 2022

LilithHafner commented Jun 6, 2022

LilithHafner Jun 12, 2022

LilithHafner Jun 12, 2022

oscardssmith commented Jun 15, 2022

LilithHafner commented Jun 15, 2022

oscardssmith commented Jun 16, 2022

LilithHafner commented Jun 16, 2022

DilumAluthge commented Jun 16, 2022 •

edited

Loading

DilumAluthge commented Jun 16, 2022 •

edited

Loading

DilumAluthge commented Jun 16, 2022

LilithHafner commented Jun 16, 2022

DilumAluthge commented Jun 16, 2022 •

edited

Loading

LilithHafner commented Jun 16, 2022

LilithHafner commented Jun 17, 2022

LilithHafner commented Jun 17, 2022

Fix-ups for sorting workspace/buffer (#45330) #45570

Fix-ups for sorting workspace/buffer (#45330) #45570

Conversation

LilithHafner commented Jun 3, 2022 • edited Loading

LilithHafner Jun 3, 2022

Choose a reason for hiding this comment

LilithHafner commented Jun 4, 2022

LilithHafner commented Jun 5, 2022

LilithHafner commented Jun 6, 2022

LilithHafner Jun 12, 2022

Choose a reason for hiding this comment

LilithHafner Jun 12, 2022

Choose a reason for hiding this comment

oscardssmith commented Jun 15, 2022

LilithHafner commented Jun 15, 2022

oscardssmith commented Jun 16, 2022

LilithHafner commented Jun 16, 2022

DilumAluthge commented Jun 16, 2022 • edited Loading

DilumAluthge commented Jun 16, 2022 • edited Loading

DilumAluthge commented Jun 16, 2022

LilithHafner commented Jun 16, 2022

DilumAluthge commented Jun 16, 2022 • edited Loading

LilithHafner commented Jun 16, 2022

LilithHafner commented Jun 17, 2022

LilithHafner commented Jun 17, 2022

LilithHafner commented Jun 3, 2022 •

edited

Loading

DilumAluthge commented Jun 16, 2022 •

edited

Loading

DilumAluthge commented Jun 16, 2022 •

edited

Loading

DilumAluthge commented Jun 16, 2022 •

edited

Loading