Fix type instability in generic matmul #14722

simonster · 2016-01-19T03:01:07Z

For C::AbstractVecOrMat{R}, A::AbstractVecOrMat{T}, B::AbstractVecOrMat{S}, we were sometimes initializing the intermediate sum as zero(R), but it wouldn't stay that way if R was narrower than T and S.

This promotes the intermediate sums (s and Ctmp in the code) to the promotion of R and zero(T)*zero(S) + zero(T)*zero(S). Thus, for A_mul_B!(C::Matrix{Float32}, A::Matrix{Float64}, B::Matrix{Float64}, the intermediate sums are Float64. The other possibility would be to enforce that the intermediate sums are the same type as R (i.e. Float32 in this example) but it seemed more conservative to potentially use higher precision than lower precision.

Before:

julia> C = Array(Float32, 100, 100);
       A = rand(Float64, 100, 100);
       B = rand(Float64, 100, 100);

julia> @time A_mul_B!(C, A, B);
  0.031763 seconds (2.11 M allocations: 32.154 MB, 9.28% gc time)

After:

julia> @time A_mul_B!(C, A, B);
  0.000870 seconds (10 allocations: 448 bytes)

andreasnoack · 2016-01-19T03:09:03Z

LGTM

simonster · 2016-01-19T03:15:05Z

The RootInt type in one of the tests doesn't have zero defined, so that test is failing. In principle I guess we should be setting up the zero values for the blocked algorithm the same way as we currently do for the naive algorithm.

simonster · 2016-01-19T03:18:16Z

base/linalg/matmul.jl

@@ -524,10 +525,11 @@ function _generic_matmatmul!{T,S,R}(C::AbstractVecOrMat{R}, tA, tB, A::AbstractV
            if tB == 'N'
                for i = 1:mA, j = 1:nB
                    if isempty(A) || isempty(B)
-                        Ctmp = zero(R)
+                        z2 = zero(T)*zero(S) + zero(T)*zero(S)


If A or B is empty, maybe we could just skip the loop entirely and fill!(C, zero(R))?

Yes. I think this check can then be moved outside the transpose branches which will make the code a bit shorter and simpler.

andreasnoack · 2016-01-19T03:28:01Z

In principle I guess we should be setting up the zero values for the blocked algorithm the same way as we currently do for the naive algorithm.

I.e. check for non-emptiness and then use the elements of the arrays instead of the array element types?

tkelman · 2016-01-19T03:36:07Z

can this be tested against via @inferred or looking at the @code_warntype output to prevent it from regressing? cc @timholy on the zero issue

timholy · 2016-01-19T12:49:37Z

On the zero issue, there's no mathematical reason to prevent defining zero(RootInt). OTOH maybe it's a useful test case for such situations.

Also make sure that intermediate values are promoted to the output type

simonster · 2016-06-25T23:42:45Z

Updated to:

Bail out early if A or B is empty and just fill C with zeros, instead of putting a bunch of branches everywhere.
Use zero based on elements themselves rather than types for the blocked case. (I just use the first elements in both arrays, since if they're isbits the types will be the same. If you tried hard enough, I think you could construct a type where this is not mathematically correct because zero does different things for different elements, but then the current code wouldn't work either.)
Add a test that allocation doesn't scale with input size for A_mul_B!(::Matrix{Float32}, ::Matrix{Float64}, ::Matrix{Float64}). It's possibly more fragile than is ideal, but it was the easiest approach I could think of that would catch a regression here.

tkelman · 2016-06-27T23:05:43Z

test/linalg/matmul.jl

+A2 = rand(Float64, 6, 6)
+B2 = rand(Float64, 6, 6)
+A_mul_B!(C1, A1, B1)
+@test @allocated(A_mul_B!(C1, A1, B1)) == @allocated(A_mul_B!(C2, A2, B2))


this fails when inlining is off https://build.julialang.org/builders/coverage_ubuntu14.04-x64/builds/404/steps/Run%20non-inlined%20tests/logs/stdio

Bump! This is still blocking coverage. https://build.julialang.org/builders/coverage_ubuntu14.04-x64/builds/461/steps/Run%20non-inlined%20tests/logs/stdio

Sorry, I totally missed this the first time around. Filed #17327 to revert the test. Will look into adding a benchmark instead tomorrow.

This test apparently breaks with inlining disabled.

Remove test for #14722 (type instability in generic matmul)

In JuliaLang/julia#14722, I fixed an issue where the inner loop was type-unstable. This benchmark ensures that doesn't regress.

This test apparently breaks with inlining disabled.

simonster reviewed Jan 19, 2016
View reviewed changes

Fix type instability in generic matmul

10d4d2c

Also make sure that intermediate values are promoted to the output type

simonster force-pushed the sjk/matmatmul-type-instability branch from f870d93 to 10d4d2c Compare June 25, 2016 21:58

simonster merged commit 6c4429b into master Jun 27, 2016

simonster deleted the sjk/matmatmul-type-instability branch June 27, 2016 16:07

tkelman reviewed Jun 27, 2016
View reviewed changes

simonster added a commit that referenced this pull request Jul 8, 2016

Remove test for #14722 (type instability in generic matmul)

0153e7d

This test apparently breaks with inlining disabled.

simonster added a commit that referenced this pull request Jul 8, 2016

Merge pull request #17327 from JuliaLang/sjk/rm-matmul-test

0d1cee7

Remove test for #14722 (type instability in generic matmul)

simonster mentioned this pull request Jul 8, 2016

Add benchmark for A_mul_B! with different input and output types JuliaCI/BaseBenchmarks.jl#15

Merged

mfasi pushed a commit to mfasi/julia that referenced this pull request Sep 5, 2016

Remove test for JuliaLang#14722 (type instability in generic matmul)

6f50dd2

This test apparently breaks with inlining disabled.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix type instability in generic matmul #14722

Fix type instability in generic matmul #14722

simonster commented Jan 19, 2016

andreasnoack commented Jan 19, 2016

simonster commented Jan 19, 2016

simonster Jan 19, 2016

andreasnoack Jan 19, 2016

andreasnoack commented Jan 19, 2016

tkelman commented Jan 19, 2016

timholy commented Jan 19, 2016

simonster commented Jun 25, 2016

tkelman Jun 27, 2016

tkelman Jul 7, 2016

simonster Jul 8, 2016

Fix type instability in generic matmul #14722

Fix type instability in generic matmul #14722

Conversation

simonster commented Jan 19, 2016

andreasnoack commented Jan 19, 2016

simonster commented Jan 19, 2016

simonster Jan 19, 2016

Choose a reason for hiding this comment

andreasnoack Jan 19, 2016

Choose a reason for hiding this comment

andreasnoack commented Jan 19, 2016

tkelman commented Jan 19, 2016

timholy commented Jan 19, 2016

simonster commented Jun 25, 2016

tkelman Jun 27, 2016

Choose a reason for hiding this comment

tkelman Jul 7, 2016

Choose a reason for hiding this comment

simonster Jul 8, 2016

Choose a reason for hiding this comment