Remove permute!! and invpermute!! #44869

LilithHafner · 2022-04-06T00:11:41Z

I don't think that these algorithms provide an unambiguous performance improvement that justifies their inclusion in base.

Perhaps a more complex dispatch system that looks at length and eltype might be warranted, but I'm in favor of excision and putting fancy and high-efficiency permutations somewhere like Permutations.jl. Here are some benchmarks that shed light on the consequences of this PR, green is an improvement and red is a regression:

Roguhly, I'm benchmarking [inv]permute!(Vector{NTuple{width, Int}}(undef, length), shuffle!(collect(1:length)))

Benchmark code

using Random, Plots
_permute!(a, p::AbstractVector) = copyto!(a, a[p])
_invpermute!(a, p::AbstractVector) = copyto!(a, a[invperm(p)])

function t(::Type{T}, n, fs...) where T
    m = max(1, 10^6 ÷ max(10, n * sizeof(T)))
    A = [Vector{T}(undef, n) for _ in 1:m]
    P = [shuffle!(collect(1:n)) for _ in 1:m]
    [@elapsed(for (a,p) in zip(A,P) 
        f(a,p)
    end) for f in fs]
    [@elapsed(for (a,p) in zip(A,P) 
        f(a,p)
    end) for f in fs]
end

function compute()
    global fs = permute!,_permute!,invpermute!,_invpermute!
    global data = [t(NTuple{x,Int},2^y,fs...) 
        for x in 1:15, y in 1:20]
end
function plot()
    for (i,f) in collect(enumerate(fs))[[1,3]]
        display(heatmap(
            log10.(getindex.(data, i+1) ./ getindex.(data, i)),
            c=cgrad([:black, :green, :white, :red, :black]),
            xlabel="log2(length)", ylabel="width",
            title="log10(_$f / $f)",
            clims=(-2,2)))
    end
end

StefanKarpinski · 2022-04-06T14:21:33Z

Ah, I had marked this as a 2.0 change because I thought these were exported but they aren't so we could remove them in a minor release.

stevengj · 2022-04-09T19:24:03Z

Note that in practice, there are several packages using Base.permute!!: DataArrays, StructArrays, DataFrames, IndexedTables, PooledArrays, … so this change will be breaking.

LilithHafner · 2022-04-11T14:07:28Z

Good catch, @stevengj.

Going through all the packages that use Base.permute!! or Base.ipermute!!

DataArrays has been deprecated since julia 1.0
DataFrames no longer uses Base.permute!! in its master branch, it now uses a much faster approach.
PooledArrays only uses Base.permute!! to define Base.permute!! on the PooledArray type.

Some packages would need to switch from Base.permute!! to permute!. All of these packages use Base.permute!! instead of permute! only for performance reasons but may experience a speedup switching to permute! after this PR compared to the current Base.permute!!.

Clustering would almost certainly see a speedup as the eltype it's permuting is Int.
IndexdTables could be much (perhaps 2x) faster if it used a similar approach as DataFrames instead of Base.permute!!, whether it sees a speedup depends on user eltype size and length.
Qaintessent may see a minor regression.
StructArrays uses Base.permute!! on an eltype of references, so I suspect it would also be faster after this PR to simply call permute!.

If folks want to go through with this PR, I can make the necessary PRs to those packages.

oscardssmith · 2022-04-11T14:48:21Z

If these PRs are expected to be performance wins, it sounds like they should be made whether we want this PR or not.

LilithHafner · 2022-04-11T15:40:08Z

The performance wins are of the form "permute! after PR is better than Base.permute!! which is better than permute! before PR", so without this PR they would have to switch to copyto!(v, v[p]) rather than permute!(v, p), that aside, yes.

oscardssmith · 2022-04-11T15:51:39Z

In that case, I think the order should be

fix performance of permute! / invpermute!.
make PRs to packages.
decide whether we want to deprecate permute!!

LilithHafner · 2022-04-11T16:57:06Z

Step 1 is ready for review.

LilithHafner · 2022-07-06T00:25:46Z

Waiting for the release of 1.9 to begin step 2.

alyst · 2022-09-08T22:17:22Z

[inv]permute!!(v, p) doesn't allocate the new array, so while the algorithm itself might be slower than copyto!(v, v[p]), in certain contexts the GC overhead of permute!() may result in lower overall performance.

LilithHafner · 2023-05-14T18:26:54Z

The benchmarks in the OP do not exclude GC overhead.

alyst · 2023-05-14T19:17:58Z

The benchmarks in the OP do not exclude GC overhead.

They do not, but IIUC they don't test for cases where v and/or p are reused 1000x times.

LilithHafner · 2023-05-14T22:18:26Z

I'm not going to reproduce the above figure with 1000x iterations because that would take to long. Here's a single point from the figure in the OP, benchmarked while reusing both parameters 1000x times. This result is consistent with the figures in the OP.

julia> x2 = rand(1000); perm = Vector{Int}(undef, 1000); perm2 = Vector{Int}(undef, 1000); @benchmark Base.permute!!($x2, copyto!($perm2, perm)) setup=(rand!($x2); randperm!($perm)) gcsample=false gctrial=false evals=1000 samples=100
BenchmarkTools.Trial: 100 samples with 1000 evaluations.
 Range (min … max):  2.932 μs …   4.214 μs  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     3.284 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   3.293 μs ± 215.969 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

       ▃▃ ▃  ▃▁ ▁ ▁▁▁  ▆█▆▆ ▃▆▃▆                               
  ▄▄▄▁▄██▄█▁▄██▁█▁███▇▁████▇████▇▇▁▄▄▄▇▄▁▁▄▄▁▁▁▁▁▁▄▁▄▄▁▁▁▁▇▁▇ ▄
  2.93 μs         Histogram: frequency by time        3.83 μs <

 Memory estimate: 0 bytes, allocs estimate: 0.

julia> x2 = rand(1000); perm = Vector{Int}(undef, 1000); perm2 = Vector{Int}(undef, 1000); @benchmark Base.permute!($x2, copyto!($perm2, perm)) setup=(rand!($x2); randperm!($perm)) gcsample=false gctrial=false evals=1000 samples=100
BenchmarkTools.Trial: 100 samples with 1000 evaluations.
 Range (min … max):  1.018 μs … 8.594 μs  ┊ GC (min … max):  0.00% … 79.15%
 Time  (median):     1.323 μs             ┊ GC (median):     0.00%
 Time  (mean ± σ):   1.730 μs ± 1.108 μs  ┊ GC (mean ± σ):  22.74% ± 23.38%

    █                                                        
  ▄▄██▄▃▂▂▃▃▁▁▁▁▁▃▁▄▃▂▁▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▂▁▁▁▁▁▁▁▁▂ ▂
  1.02 μs        Histogram: frequency by time       6.83 μs <

 Memory estimate: 7.94 KiB, allocs estimate: 1.

stevengj · 2023-05-14T23:39:58Z

Note that these follow-the-cycles algorithms become more of a win when the elements size is large, as in Base.permutecols!! where we are permuting the columns of a matrix (used for sorting eigenvectors).

PallHaraldsson · 2023-08-14T13:05:30Z

If you want to go ahead with this, then I agree it should be dropped ASAP in 1.11/master, early in 1.11 development, in case we need to revert it later.

We shouldn't backport this to 1.10

If you drop this in 1.11 I would argue doing it in 1.10 too before its release, in case it will be LTS. It would be easy to add it back in 1.10.1, but if it stays in then it will be a (practically) breaking change to drop in in a 1.10.x, seemingly bad for an LTS.

LilithHafner · 2023-08-14T15:56:25Z

888 packages failed tests only on the current version.

😢

LilithHafner · 2023-09-08T15:51:27Z

After Rerunning after JuliaRegistries/General#91067
@nanosoldier runtests()

nanosoldier · 2023-09-09T07:54:40Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

LilithHafner · 2023-09-13T14:10:54Z

@nanosoldier runtests(["PooledArrays"], configuration=(registry="LilithHafner/General#permute-testing", ))

nanosoldier · 2023-09-13T14:32:12Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

LilithHafner · 2023-09-13T14:42:36Z

@nanosoldier runtests(["PooledArrays"], configuration=(registry="LilithHafner/General#permute-testing", ), vs = ":master")

LilithHafner · 2023-09-13T14:51:43Z

Trying again after fixing a bug in my custom registry setup:
@nanosoldier @nanosoldier runtests(["PooledArrays"], configuration=(registry="LilithHafner/General#permute-testing", ), vs = ":master")

nanosoldier · 2023-09-13T15:05:29Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

LilithHafner · 2023-09-13T20:08:43Z

tehe, lots of noise and trial and error, but not too much load on the nanosoldier machines as I experiment with this. I think my most recent invocation was ignored because I miscopied it.

@nanosoldier runtests(["PooledArrays"], configuration=(registry="LilithHafner/General#permute-testing", ), vs = ":master")

nanosoldier · 2023-09-14T06:31:25Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

LilithHafner · 2023-09-17T20:42:35Z

@nanosoldier runtests(["PooledArrays"])

nanosoldier · 2023-09-18T05:36:46Z

The package evaluation job you requested has completed - no new issues were detected.
The full report is available.

LilithHafner · 2023-09-18T13:43:24Z

@nanosoldier runtests()

nanosoldier · 2023-09-18T21:23:01Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

LilithHafner · 2023-09-18T22:22:12Z

Failures due to folks using old versions of PooledArrays and StructArrays. I'll try again in a while.

LilithHafner · 2023-12-07T14:58:59Z

Eh, whatever. This is good enough. The implementations are gone and the methods are deprecated.

remove permute!! and invpermute!!

aba8cce

LilithHafner added speculative Whether the change will be implemented is speculative performance Must go faster excision Removal of code from Base or the repository labels Apr 6, 2022

Update docstring + alpha-renaming

aae3c3c

StefanKarpinski modified the milestone: 2.0 Apr 6, 2022

This was referenced Apr 11, 2022

Stop using unexported Base.permute!! JuliaStats/Clustering.jl#229

Merged

Stop using permute!! and invpermute!! #44941

Merged

LilithHafner added this to the 1.10 milestone Jul 6, 2022

LilithHafner marked this pull request as draft July 6, 2022 00:24

petvana mentioned this pull request Feb 26, 2023

base.combinatorics: rm dead code: permute!! and permute!! #48797

Closed

LilithHafner removed the speculative Whether the change will be implemented is speculative label May 27, 2023

Merge branch 'master' into permute

90a97c5

LilithHafner marked this pull request as ready for review May 31, 2023 13:51

LilithHafner removed the performance Must go faster label May 31, 2023

oscardssmith added the needs pkgeval Tests for all registered packages should be run with this change label May 31, 2023

LilithHafner modified the milestones: 1.11, 1.12 Aug 30, 2023

LilithHafner mentioned this pull request Sep 5, 2023

Bump version to 0.6.16 JuliaArrays/StructArrays.jl#281

Merged

nalimilan mentioned this pull request Sep 11, 2023

Remove usage of Base internals (permute!!) JuliaData/PooledArrays.jl#87

Merged

LilithHafner mentioned this pull request Sep 13, 2023

Test result uploaded wrong results JuliaCI/Nanosoldier.jl#179

Closed

LilithHafner modified the milestones: 1.11, 1.12 Sep 15, 2023

LilithHafner mentioned this pull request Sep 15, 2023

Deprecate permute!! and invpermute!! #51337

Merged

LilithHafner added 2 commits September 17, 2023 15:21

Merge branch 'master' into permute

e81ce50

Update deprecated.jl

c73c649

LilithHafner closed this Dec 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove permute!! and invpermute!! #44869

Remove permute!! and invpermute!! #44869

LilithHafner commented Apr 6, 2022

StefanKarpinski commented Apr 6, 2022

stevengj commented Apr 9, 2022

LilithHafner commented Apr 11, 2022 •

edited

Loading

oscardssmith commented Apr 11, 2022

LilithHafner commented Apr 11, 2022 •

edited

Loading

oscardssmith commented Apr 11, 2022

LilithHafner commented Apr 11, 2022 •

edited

Loading

LilithHafner commented Jul 6, 2022

alyst commented Sep 8, 2022

LilithHafner commented May 14, 2023

alyst commented May 14, 2023

LilithHafner commented May 14, 2023

stevengj commented May 14, 2023

PallHaraldsson commented Aug 14, 2023

LilithHafner commented Aug 14, 2023

LilithHafner commented Sep 8, 2023

nanosoldier commented Sep 9, 2023

LilithHafner commented Sep 13, 2023

nanosoldier commented Sep 13, 2023

LilithHafner commented Sep 13, 2023

LilithHafner commented Sep 13, 2023

nanosoldier commented Sep 13, 2023

LilithHafner commented Sep 13, 2023

nanosoldier commented Sep 14, 2023

LilithHafner commented Sep 17, 2023

nanosoldier commented Sep 18, 2023

LilithHafner commented Sep 18, 2023

nanosoldier commented Sep 18, 2023

LilithHafner commented Sep 18, 2023

LilithHafner commented Dec 7, 2023

Remove permute!! and invpermute!! #44869

Remove permute!! and invpermute!! #44869

Conversation

LilithHafner commented Apr 6, 2022

StefanKarpinski commented Apr 6, 2022

stevengj commented Apr 9, 2022

LilithHafner commented Apr 11, 2022 • edited Loading

oscardssmith commented Apr 11, 2022

LilithHafner commented Apr 11, 2022 • edited Loading

oscardssmith commented Apr 11, 2022

LilithHafner commented Apr 11, 2022 • edited Loading

LilithHafner commented Jul 6, 2022

alyst commented Sep 8, 2022

LilithHafner commented May 14, 2023

alyst commented May 14, 2023

LilithHafner commented May 14, 2023

stevengj commented May 14, 2023

PallHaraldsson commented Aug 14, 2023

LilithHafner commented Aug 14, 2023

LilithHafner commented Sep 8, 2023

nanosoldier commented Sep 9, 2023

LilithHafner commented Sep 13, 2023

nanosoldier commented Sep 13, 2023

LilithHafner commented Sep 13, 2023

LilithHafner commented Sep 13, 2023

nanosoldier commented Sep 13, 2023

LilithHafner commented Sep 13, 2023

nanosoldier commented Sep 14, 2023

LilithHafner commented Sep 17, 2023

nanosoldier commented Sep 18, 2023

LilithHafner commented Sep 18, 2023

nanosoldier commented Sep 18, 2023

LilithHafner commented Sep 18, 2023

LilithHafner commented Dec 7, 2023

LilithHafner commented Apr 11, 2022 •

edited

Loading

LilithHafner commented Apr 11, 2022 •

edited

Loading

LilithHafner commented Apr 11, 2022 •

edited

Loading