WIP: linear IR #24027

JeffBezanson · 2017-10-06T21:32:57Z

In short, after this a call expression can only appear as the right-hand side of an assignment to an SSAValue (or in statement position, but that might change too). Here's what I've done so far:

Turn on very-linear-mode (the easy part!)
In codevalidation.jl, implement much stricter new rules for where expressions can appear.
Make sure julia-syntax.scm follows those rules, including not allowing slot = Expr(:call, ...).
Remove the loathsome typ field from Expr 🎉 , instead using the type of a call's SSAValue.
Update optimization passes to preserve the linear structure.

Still to do:

cglobal is the only thing that causes validation failures, due to needing to see its argument symbol/tuple. Need to decide how to handle this. We can either make it a special form, or have it look inside a constant jl_cgval_t argument.
Make sure all optimizations are still working.
Update code_warntype.

The new IR is of course larger (sysimg +20%), but undeniably beautiful. Inlining is already much simpler and other passes will benefit in the future as well. I think we'll be able to make up the difference with new optimizations and clever encoding. For example, we could avoid inserting source location push/pop when inlining trivial functions. Here's a typical excerpt:

        # meta: location strings/string.jl endof 202
        # meta: location strings/string.jl sizeof 62
        SSAValue(7) = Core.sizeof
        SSAValue(8) = (SSAValue(7))(s)
        # meta: pop location
        i = SSAValue(8)
        #= line 203 =#
        8: 
        # meta: location operators.jl > 249
        # meta: location int.jl < 39
        SSAValue(10) = (Base.slt_int)(0, i)
        # meta: pop location
        # meta: pop location

We could also add a special encoding for assignment to an SSAValue.

yuyichao · 2017-10-06T21:39:54Z

What's the new way of getting the rhs type of

slot = call

?

JeffBezanson · 2017-10-06T21:41:25Z

slot = call is not allowed.

yuyichao · 2017-10-06T21:52:30Z

Ah, I missed that. This level of SSAValue usage is going to hurt #23240 really badly... A lot of the optimizations there requires looking at the expression that is assigned to the slot and looking through multiple assignment is really hard with the current AST format.

From my experiment at #23240 as well some additional thought recently, I feel like our final goal should be using an purly SSA based IR that's similar to what LLVM use. As incremental steps, it seems that the form in #23240 (though preferably in frontend...) is relatively easy to analyse while not having to put everything in SSA. Getting any further with SSA values without removing slots altogether seems to make optimization much harder so I would prefer to do this at the same time as introducing a phi node. It will still require looking though phi node but they carry information about control flow with much more easy to analyse input which is not the case for slots...

yuyichao · 2017-10-06T22:10:47Z

Also note that phi node can be lowered back to slots without losing any information easily so that can be done after optimization so that other part of the system not ready for it doen't have to deal with it yet. My next target after #23240 was going to be a rewrite of it in a similar fashion but doing a transformation to BB's with phi node in order to explore control flow information.

JeffBezanson · 2017-10-07T00:22:46Z

looking through multiple assignment is really hard with the current AST format

Why? You can look up the definition of an SSAValue in an array.

the form in #23240 (though preferably in frontend...) is relatively easy to analyse while not having to put everything in SSA

What rules would you like?

If possible, it would be nice just to change the official IR format to that needed by #23240. I strongly suspect this PR can be made to implement that.

yuyichao · 2017-10-07T01:22:08Z

Why? You can look up the definition of an SSAValue in an array.

The direct assignment gives two important information.

There's a single use of the rhs
There's nothing in between the assignment and the evaluation of the rhs.

Of course these are all computable when it's put in SSAValue first but it adds a lot of checks. It interferes with some logic that's very specific to the solution used in #23240 .

Currently the invalid uses are kept in the list so and only being cleared out when I'm optimizing for that value so that I don't need to constantly scan though the use/def list.

This is why I want to avoid to look through more than just a single value every time.
The current code keep tracks of which value needs to be rescanned

Looking at multiple values also make this harder.

I'd like to get rid of both of these logic in the next version and I think using a linked list like LLVM should be able to handle 1 easily. Having 1 removed and be able to look at multiple values at the same time should also make 2 easier (so that I can scan deeper and see which value is affected). If the scan of use/def become much simpler than what I have right now, the whole rescan table might not even be needed anymore.

What rules would you like?

Allow the rhs of slot assignment to be any expression. (so hold off the second commit until a better optimization pass is ready). On a related note, I feel like a good representation for optimization/type inference would just have the ssavalue, the type and the rhs be stored together since the optimization frequently need to go from one to another (ssa->type, ssa->rhs, rhs(instruction)->ssa). One can argue if the type is going with ssavalue or the rhs at that point but it does seem like a representation that's a superset of what Expr has so I don't think we have to get rid of the typ field in Expr that urgently.

JeffBezanson · 2017-10-07T02:09:28Z

Allow the rhs of slot assignment to be any expression.

Ok, I think I can allow that. I believe that currently, an assignment LHS is never a TypedSlot, so we could use one of those to store the type of the RHS.

have the ssavalue, the type and the rhs be stored together

That would be fine with me --- the current representation of

Expr
  head: Symbol =
  args: Array{Any}((2,))
    1: SSAValue
      id: Int64 0
    2: Expr
      head: Symbol call
      args: Array{Any}((2,))
        1: Symbol f
        2: Symbol x

is pretty inefficient anyway. It could be something like

SSADef
  id: Int64 0
  typ: Any
  rhs: Expr
    head: Symbol call
      args: Array{Any}((2,))
        1: Symbol f
        2: Symbol x

yuyichao · 2017-10-07T02:18:20Z

It could be something like

Yes, exactly.

an assignment LHS is never a TypedSlot, so we could use one of those to store the type of the RHS.

Yeah. That works too.

update optimization passes to keep IR in linear form use typedslot LHS to store type of RHS

[ci skip]

JeffBezanson · 2017-10-12T22:08:27Z

Closing temporarily. I'll put up a more incremental change first, but please keep this branch.

JeffBezanson added compiler:codegen Generation of LLVM IR and native code compiler:inference Type inference labels Oct 6, 2017

JeffBezanson force-pushed the jb/linear-ir branch from bca2951 to 2bc4c38 Compare October 10, 2017 14:11

JeffBezanson added 5 commits October 10, 2017 15:02

turn on linear IR

d3a031b

remove typ field from Expr

74ccb5c

update optimization passes to keep IR in linear form use typedslot LHS to store type of RHS

allow replacing ssavalue with globalref

0c4cdab

better effect_free support for :isdefined exprs

4aa0560

WIP

50e0b87

[ci skip]

JeffBezanson force-pushed the jb/linear-ir branch from 2bc4c38 to 50e0b87 Compare October 10, 2017 20:22

JeffBezanson closed this Oct 12, 2017

JeffBezanson mentioned this pull request Oct 13, 2017

turn on linear IR #24113

Merged

JeffBezanson mentioned this pull request Jun 8, 2018

remove Expr typ field #27499

Merged

JeffBezanson deleted the jb/linear-ir branch June 13, 2018 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: linear IR #24027

WIP: linear IR #24027

JeffBezanson commented Oct 6, 2017 •

edited

Loading

yuyichao commented Oct 6, 2017

JeffBezanson commented Oct 6, 2017

yuyichao commented Oct 6, 2017

yuyichao commented Oct 6, 2017

JeffBezanson commented Oct 7, 2017

yuyichao commented Oct 7, 2017 •

edited

Loading

JeffBezanson commented Oct 7, 2017

yuyichao commented Oct 7, 2017

JeffBezanson commented Oct 12, 2017

WIP: linear IR #24027

WIP: linear IR #24027

Conversation

JeffBezanson commented Oct 6, 2017 • edited Loading

yuyichao commented Oct 6, 2017

JeffBezanson commented Oct 6, 2017

yuyichao commented Oct 6, 2017

yuyichao commented Oct 6, 2017

JeffBezanson commented Oct 7, 2017

yuyichao commented Oct 7, 2017 • edited Loading

JeffBezanson commented Oct 7, 2017

yuyichao commented Oct 7, 2017

JeffBezanson commented Oct 12, 2017

JeffBezanson commented Oct 6, 2017 •

edited

Loading

yuyichao commented Oct 7, 2017 •

edited

Loading