[WIP] faster reductions #659

mateuszbaran · 2019-09-20T08:40:18Z

I think this is the simplest possible solution for #540 . This essentially avoids passing around keyword arguments in some common use cases. I'll change other reduce-like functions if you think it's a good solution.

coveralls · 2019-09-20T09:27:24Z

Coverage decreased (-0.05%) to 81.535% when pulling d3623b9 on mateuszbaran:mbaran/faster-reduction into 935dc85 on JuliaArrays:master.

c42f

I'll change other reduce-like functions if you think it's a good solution.

Please do, this looks like a clean solution 👍

c42f · 2019-09-21T04:15:06Z

src/mapreduce.jl

-@inline reduce(op, a::StaticArray; kw...) = mapreduce(identity, op, a; kw...)
+@inline reduce(op, a::StaticArray; dims=:, kw...) = _reduce(op, a, dims, kw.data)
+
+@inline _reduce(op, a::StaticArray, dims=:, kw::NamedTuple=NamedTuple()) = _mapreduce(identity, op, dims, kw, Size(a), a)


I guess this reduce doesn't need the dims=: default because it's internal?

mateuszbaran · 2019-09-23T10:27:27Z

Variants with default value of dims work nicely now but when its specified it's still hopelessly slow. I'll investigate it.

mateuszbaran · 2019-09-24T10:28:00Z

OK, the problem is that if you call for example any(m, dims=1) constant keyword argument dims is not properly propagated and serious type instability happens. It's still fast when using any(m, dims=Val(1)). AFAICT any(m, dims=1) can't easily be made fast in general but it's possible to hardcode a few dims like 1 and 2 in _mapreduce(f, op, D::Int, nt::NamedTuple, sz::Size{S}, a::StaticArray) to make them fast. Do you care about it?

c42f · 2019-09-25T05:17:57Z

it's possible to hardcode a few dims like 1 and 2 in _mapreduce

You mean a pattern something like

if D == 1
    _mapreduce(f, op, Val(1), nt, sz, a)
elseif D == 2
    _mapreduce(f, op, Val(2), nt, sz, a)
...

?

If that's what it takes to avoid a massive and surprising performance cliff I think it's worthwhile (just add a comment to explain the hack).

mateuszbaran · 2019-09-25T06:57:38Z

Yes, exactly that. I'll make the change then.

mateuszbaran · 2019-09-26T08:28:57Z

I've changed that part, although I'm not completely sure why it makes such a difference since that method of _mapreduce still isn't type stable.

julia> m = SArray{Tuple{3,3,3,3}}(rand(Bool, 3,3,3,3));

julia> using BenchmarkTools

julia> f1(x) = any(x, dims=1)
f1 (generic function with 1 method)

julia> f4(x) = any(x, dims=4)
f4 (generic function with 1 method)

julia> @benchmark f1($m)
BenchmarkTools.Trial: 
  memory estimate:  48 bytes
  allocs estimate:  1
  --------------
  minimum time:     39.845 ns (0.00% GC)
  median time:      40.340 ns (0.00% GC)
  mean time:        51.146 ns (17.98% GC)
  maximum time:     53.062 μs (99.91% GC)
  --------------
  samples:          10000
  evals/sample:     991

julia> @benchmark f4($m)
BenchmarkTools.Trial: 
  memory estimate:  160 bytes
  allocs estimate:  3
  --------------
  minimum time:     4.566 μs (0.00% GC)
  median time:      4.687 μs (0.00% GC)
  mean time:        4.720 μs (0.00% GC)
  maximum time:     12.804 μs (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     7

c42f · 2019-09-26T08:54:58Z

Yes that seems a bit mysterious.

It looks like (maybe) Core.kwfunc(Main.any) doesn't get the inline meta inherited from its body, so that would be something to fix in Base I guess.

KristofferC · 2019-09-26T08:57:49Z

Feels a bit like JuliaLang/julia#30411.

c42f · 2019-09-26T10:19:37Z

Yes, that seems likely to be the same underlying issue: lowering producing some extra method definitions which don't respect whatever Expr(:meta) are attached in the surface syntax.

[WIP] faster reductions

e8bdcca

c42f reviewed Sep 21, 2019

View reviewed changes

faster reductions (completed)

c7b81d6

c42f added bugfix performance runtime performance labels Sep 25, 2019

Faster reductions where dim is specified as a number instead of Val

d3623b9

c42f merged commit ae78e52 into JuliaArrays:master Sep 26, 2019

c42f mentioned this pull request Sep 26, 2019

MArray is slower than Array in reduction #540

Closed

mateuszbaran mentioned this pull request Sep 26, 2019

Regression of reduce (formerly reducedim) in Julia 0.7 #498

Open

c42f mentioned this pull request Sep 27, 2019

Functions with default arguments with @boundscheck can be confusing JuliaLang/julia#30411

Open

KristofferC mentioned this pull request Jan 26, 2020

kwfunc drops the nospecialize annotation JuliaLang/julia#34516

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] faster reductions #659

[WIP] faster reductions #659

mateuszbaran commented Sep 20, 2019

coveralls commented Sep 20, 2019 •

edited

Loading

c42f left a comment

c42f Sep 21, 2019

mateuszbaran commented Sep 23, 2019

mateuszbaran commented Sep 24, 2019

c42f commented Sep 25, 2019

mateuszbaran commented Sep 25, 2019

mateuszbaran commented Sep 26, 2019

c42f commented Sep 26, 2019

KristofferC commented Sep 26, 2019

c42f commented Sep 26, 2019

[WIP] faster reductions #659

[WIP] faster reductions #659

Conversation

mateuszbaran commented Sep 20, 2019

coveralls commented Sep 20, 2019 • edited Loading

c42f left a comment

Choose a reason for hiding this comment

c42f Sep 21, 2019

Choose a reason for hiding this comment

mateuszbaran commented Sep 23, 2019

mateuszbaran commented Sep 24, 2019

c42f commented Sep 25, 2019

mateuszbaran commented Sep 25, 2019

mateuszbaran commented Sep 26, 2019

c42f commented Sep 26, 2019

KristofferC commented Sep 26, 2019

c42f commented Sep 26, 2019

coveralls commented Sep 20, 2019 •

edited

Loading