improve life with large tuples a little #16460

carnaval · 2016-05-19T20:12:14Z

I put the cutoff mentioned in #15702 in there.
tuple _apply inlining was unlimited.
Otherwise it's mostly trying to avoid llvm scalarizing load/store of large aggregates. Instead it uses the memcpy intrinsics so it is lowered to a function call when the size exceeds some thresold.
It's not very systematic, I'm just going through examples I see with catastrophic IR explosion while throwing random large tuples around.
Without this patch, just passing a large tuple through to the constructor of a type like

immutable C
x :: NTuple{N,Int}
end

generates an amount IR linear wrt the size of the tuple.

I'm hoping this won't prevent optimizations on small sizes (we may want to add our own thresold for that if it happens). Anyway, I trust llvm more to break up a small memcpy than merge a lot of loads.

carnaval · 2016-05-19T20:30:02Z

just for fun, before :

$ time (./julia -e 'code_native(i->i, (NTuple{2000,Int},))' | wc -l)
4010

real    0m25.366s
user    0m25.289s
sys 0m0.285s

after

$ time (./julia -e 'code_native(i->i, (NTuple{2000,Int},))' | wc -l)
20

real    0m0.497s
user    0m0.497s
sys 0m0.210s

vtjnash · 2016-05-19T21:04:02Z

src/cgutils.cpp

+{
+    assert(!v.isboxed);
+    if (v.ispointer())
+        return tbaa_decorate(v.tbaa, build_load(builder.CreatePointerCast(v.V, t->getPointerTo()), v.typ));


should use data_pointer() instead of v.V (in case v is a constant)

this doesn't matter either then (since you already checked for isconstant)

vtjnash reviewed May 19, 2016
View reviewed changes

carnaval added 2 commits May 19, 2016 18:05

improve life with large tuples a little

71f75ce

fix align

04bba52

carnaval force-pushed the ob/cgtup branch from cb7f153 to 04bba52 Compare May 20, 2016 01:22

carnaval added 2 commits May 19, 2016 21:58

var slots too

64e38b9

fix fail test

b0d9687

vtjnash merged commit 5bef837 into master May 21, 2016

vtjnash deleted the ob/cgtup branch May 21, 2016 20:28

carnaval mentioned this pull request Jun 2, 2016

Regression in generated code #16709

Closed

shashi mentioned this pull request Jun 15, 2016

map on tuples is prohibitively slow #15695

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve life with large tuples a little #16460

improve life with large tuples a little #16460

carnaval commented May 19, 2016

carnaval commented May 19, 2016

vtjnash May 19, 2016

vtjnash May 19, 2016

improve life with large tuples a little #16460

improve life with large tuples a little #16460

Conversation

carnaval commented May 19, 2016

carnaval commented May 19, 2016

vtjnash May 19, 2016

Choose a reason for hiding this comment

vtjnash May 19, 2016

Choose a reason for hiding this comment