fix #6599 #6605

vtjnash · 2014-04-23T01:14:09Z

Jeff, can you review this. I don't know how to test it effectively.

vtjnash · 2014-04-23T01:15:46Z

base/inference.jl

                    for a in ea
+                        if first # first "arg" is the function name


this isn't strictly necessary, since the first arg is usually a TopNode. but this should be more correct

this inlining threshold seems to translates to approximately 8 expressions of low complexity, and about the same number of llvm instructions

vtjnash · 2014-04-23T02:14:47Z

base/inference.jl

-        return true
+    symlim = div(8,occurences)
+    if length(body.args) < symlim
+        symlim *= 12


from @stevengj's analysis (e805fd6#commitcomment-6089161)

and some of my own, I think we were significantly overestimating the conversion factor. i think we are roughly 1-to-1 for the symbols counted by occurs_more and the number of arguments to the llvm intrinsics

the estimate here is that 12 symbols in a line is relatively short. the limit on the number of lines is intended mostly as an optimization to skip counting when it is unlikely to be worth inlining

Ok. I'd like this part of the change to be a separate patch, to separate it from the new bugfix.

changing this back will potentially limit inlining more than before the patch, since now this test also controls inlining of arguments into functions

stevengj · 2014-04-23T03:06:39Z

I can confirm that my FFT performance seems fine with this patch.

timholy · 2014-04-23T10:16:31Z

With this patch, #6437 is 32x faster 😄 for two dimensions. But it still does not get inlined in 3 dimensions, and hence is 100x slower than a hand-written loop. It would be great to get it inlined at least up to 4 dimensions (although eventually it just needs to be inlined, period).

JeffBezanson · 2014-04-24T16:13:14Z

base/inference.jl

+                if first
+                    first = false
+                    isa(a,SymbolNode) || return false
+                    typ = (a::SymbolNode).typ


I believe this should use exprtype and then look for Type{T}, where T is an immutable type.

JeffBezanson · 2014-04-25T01:38:34Z

The new thing is strictly a bug fix, so it should really go in a separate commit. The inline_worthy change is fine too, it just needs to be separated.

vtjnash · 2014-04-25T01:49:46Z

they are in separate commits, just in the same pull request (updated again)

fix #6599

a5d54c2

vtjnash reviewed Apr 23, 2014
View reviewed changes

better fix for #6566. also increase inlining

6905d1e

this inlining threshold seems to translates to approximately 8 expressions of low complexity, and about the same number of llvm instructions

vtjnash reviewed Apr 23, 2014
View reviewed changes

vtjnash mentioned this pull request Apr 24, 2014

New "access to undefined reference" (from optimization?) #6599

Closed

JeffBezanson reviewed Apr 24, 2014
View reviewed changes

incorporate Jeff's comments for inlining patch

7033cc5

vtjnash merged commit 7033cc5 into master Apr 26, 2014

simonster mentioned this pull request May 27, 2014

Inlining-related performance regressions #6981

Closed

vtjnash deleted the jn/6599 branch August 11, 2014 22:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #6599 #6605

fix #6599 #6605

vtjnash commented Apr 23, 2014

vtjnash Apr 23, 2014

vtjnash Apr 23, 2014

JeffBezanson Apr 24, 2014

vtjnash Apr 25, 2014

stevengj commented Apr 23, 2014

timholy commented Apr 23, 2014

JeffBezanson Apr 24, 2014

JeffBezanson commented Apr 25, 2014

vtjnash commented Apr 25, 2014

fix #6599 #6605

fix #6599 #6605

Conversation

vtjnash commented Apr 23, 2014

vtjnash Apr 23, 2014

Choose a reason for hiding this comment

vtjnash Apr 23, 2014

Choose a reason for hiding this comment

JeffBezanson Apr 24, 2014

Choose a reason for hiding this comment

vtjnash Apr 25, 2014

Choose a reason for hiding this comment

stevengj commented Apr 23, 2014

timholy commented Apr 23, 2014

JeffBezanson Apr 24, 2014

Choose a reason for hiding this comment

JeffBezanson commented Apr 25, 2014

vtjnash commented Apr 25, 2014