RFC: Add `norm(A, p; dims)` #43459

mcabbott · 2021-12-18T04:24:20Z

Closes JuliaLang/LinearAlgebra.jl#697. RFC, I guess?

One motivation is that this can be faster than making slices:

julia> @btime map(norm, eachrow($(rand(100, 100))));
  12.625 μs (1 allocation: 896 bytes)

julia> @btime norm($(rand(100, 100)), 2, dims=2);  # with PR
  1.771 μs (5 allocations: 976 bytes)
  1.650 μs (1 allocation: 896 bytes)

Here, "5 allocations" is sometimes 1 as you'd expect, I don't know why, seems to depend on load order? I think there was an issue about this which I can't find again.

The 0,1,Inf norm implementations are trivial.

For the 2-norm, I initially made it check whether values in A are in the goldilocks zone, as norm(A) does. But this check alone takes longer than the happy path of sum(abs2, A; dims). Instead, the present PR does that and then checks the answer, and re-does any slices which are dangerous. I hope this is correct. I presume the typical case would not have many such zero/Inf answers, in which case this will be fast. It's a bit awkward that eachslice doesn't do what is needed here, so instead I taped something together, trying to make the common cases fast. It is a bit ugly but is there a better way?

The p-norm is much slower, ~~and for now it just slices. It could probably be done the same way, though.~~ [Edit: now works like 2-norm]

oscardssmith · 2021-12-22T15:21:26Z

I don't like this (because I don't like dims arguments in general), but this seems reasonable to have.

mcabbott · 2021-12-22T18:15:37Z

Do you think that checking for 0 & Inf afterwards is sufficient to catch all the floating point problems that the existing implementation catches?

The existing one checks beforehand, but doing that on the N^2 matrix seems to take longer than the entire operation. Whereas checking the N results afterwards is usually quick. At least when not too many are 0/Inf.

oscardssmith · 2021-12-22T18:19:44Z

That is sufficient (and a much better idea).

oscardssmith · 2021-12-28T16:21:52Z

Can you separate out

the present PR does that and then checks the answer, and re-does any slices which are dangerous.

into it's own PR? I think that that is an easy win, while the new method probably needs a triage. Alternatively, would you mind if I rewrote #43256 to use this?

mcabbott · 2021-12-28T16:28:58Z

I wondered about using the same check-afterwards idea for the complete norm. I can have a go but if you beat me to it that's even better. (Did not see #43256.)

But I don't think such a change need alter this PR. They would share the idea but not share code for it, I think. Unless you are proposing that norm(::Matrix) should apply this idea chunk-wise?

mcabbott · 2021-12-28T16:32:31Z

stdlib/LinearAlgebra/src/generic.jl

+norm0_dims!(B, A) = count!(!iszero, B, A)
+norm1_dims!(B, A) = Base.mapreducedim!(norm, +, B, A)
+normInf_dims!(B, A) = Base.mapreducedim!(norm, max, B, A)
+normMinusInf_dims!(B, A) = Base.mapreducedim!(norm, min, B, A)


BTW the reason these are all mutating B is to work around #43461. With things like mapreduce(norm, max, A; dims) instead, some were not type-stable. There's a PR to fix that, though.

oscardssmith · 2022-01-06T19:58:38Z

triage would rather not add new dims arguments and instead use #32310 (or similar) to make eachslice faster.

mcabbott · 2022-01-06T20:28:15Z

What's slow isn't eachslice, though. It's that performing the calculation slice-by-slice is slow, especially when the direction is cache-unfriendly. (The eachrow example does above does not have eachslice's type-stability issues.)

oscardssmith · 2022-01-06T20:42:59Z

with the SlicedArray type, couldn't this PR be implemented for norm(::Slices, p) though?

mcabbott · 2022-01-06T20:57:59Z

No, norm regards nested arrays as a bag of numbers, norm([[[1,2],3],4]) ≈ norm([1,2,3,4]), so I don't think the meaning of norm(::Slices) could differ from norm(collect(::Slices)).

It would be possible to overload norm.(::Slices). The two tricky things there are that sum.(eachslice(rand(2,3); dims=1)) has the opposite convention for what dims means to sum's, and that you have to decide whether this should allocate the result immediately or fuse with further operations. That seems like a bigger design decision.

EDIT:

map(::typeof(norm), ::Slices) avoids the fusion question. But A ./ map(Fix2(norm, 1), eachslice(A; dims=2, drop=false)) is quite a mouthful.

All of these have a bit of the problem that reduce(vcat, xs) has -- by magically upgrading a function which already works (but slowly) we are left guessing as to whether a given use is actually going to hit the magic fast path, or not. Whereas right now, the existence of a dims method is evidence of the existence of a special path.

There's also a problem of return types. map(norm, eachcol(A::CuArray)) isa Vector right now. If a magic fast path existed, it would want to make a CuArray, and this is what you want for uses like A ./ norm(A; dims=2). There are very few uses where you could equally accept an Array or (as a magic optimisation) a CuArray. That's an issue for all proposals like sum.(eachcol(A)) too.

mcabbott · 2022-11-22T01:57:48Z

Today I discovered that 1.9 has sortperm(randn(3,5), dims=1), and it was useful.

Rebasing this PR & timing it on 1.10-, which includes the new eachslice of #32310, the benefit is still pretty similar:

julia> @btime mapslices(norm, $(rand(100, 100)), dims=2);  # re-written for 1.9,# 40996
  30.375 μs (15 allocations: 2.12 KiB)

julia> @btime map(norm, eachrow($(rand(100, 100))));  # with JuliaLang/julia#32310
  22.000 μs (1 allocation: 896 bytes)

julia> @btime norm($(rand(100, 100)), 2, dims=2);  # with PR
  2.949 μs (5 allocations: 976 bytes)

ararslan added the linear algebra Linear algebra label Dec 18, 2021

oscardssmith added linalg triage triage This should be discussed on a triage call needs tests Unit tests are required for this change labels Dec 28, 2021

mcabbott commented Dec 28, 2021

View reviewed changes

oscardssmith mentioned this pull request Dec 28, 2021

faster and simpler generic_norm2 #43256

Open

oscardssmith removed triage This should be discussed on a triage call linalg triage labels Jan 6, 2022

mcabbott force-pushed the norm_dims branch from c6bc594 to 93348f0 Compare May 31, 2022 16:05

mcabbott force-pushed the norm_dims branch from 93348f0 to 20cf533 Compare November 22, 2022 01:45

mcabbott force-pushed the norm_dims branch from 20cf533 to d811e73 Compare November 22, 2022 01:58

mcabbott force-pushed the norm_dims branch from dfa9d9d to 33d8b7f Compare March 31, 2023 02:36

mcabbott added 7 commits February 6, 2024 21:51

norm(A, p; dims)

f623bad

normalize(A, p; dims)

8c3df94

typo, help

34a4d94

add p-norm

0185355

doc updates

9e0b671

doc updates to 1.10

e9e1356

add some tests, incomplete

4e061bb

mcabbott force-pushed the norm_dims branch from 33d8b7f to 4e061bb Compare February 7, 2024 02:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Add `norm(A, p; dims)` #43459

RFC: Add `norm(A, p; dims)` #43459

mcabbott commented Dec 18, 2021 •

edited

Loading

oscardssmith commented Dec 22, 2021

mcabbott commented Dec 22, 2021

oscardssmith commented Dec 22, 2021

oscardssmith commented Dec 28, 2021 •

edited

Loading

mcabbott commented Dec 28, 2021

mcabbott Dec 28, 2021

oscardssmith commented Jan 6, 2022

mcabbott commented Jan 6, 2022 •

edited

Loading

oscardssmith commented Jan 6, 2022

mcabbott commented Jan 6, 2022 •

edited

Loading

mcabbott commented Nov 22, 2022

RFC: Add norm(A, p; dims) #43459

Are you sure you want to change the base?

RFC: Add norm(A, p; dims) #43459

Conversation

mcabbott commented Dec 18, 2021 • edited Loading

oscardssmith commented Dec 22, 2021

mcabbott commented Dec 22, 2021

oscardssmith commented Dec 22, 2021

oscardssmith commented Dec 28, 2021 • edited Loading

mcabbott commented Dec 28, 2021

mcabbott Dec 28, 2021

Choose a reason for hiding this comment

oscardssmith commented Jan 6, 2022

mcabbott commented Jan 6, 2022 • edited Loading

oscardssmith commented Jan 6, 2022

mcabbott commented Jan 6, 2022 • edited Loading

mcabbott commented Nov 22, 2022

RFC: Add `norm(A, p; dims)` #43459

RFC: Add `norm(A, p; dims)` #43459

mcabbott commented Dec 18, 2021 •

edited

Loading

oscardssmith commented Dec 28, 2021 •

edited

Loading

mcabbott commented Jan 6, 2022 •

edited

Loading

mcabbott commented Jan 6, 2022 •

edited

Loading