0.5.0-rc2 Up to 2x performance regression loading precompiled packages #18030

ufechner7 · 2016-08-15T10:37:24Z

Julia 0.4.6

tic(); using PyPlot; toc()
elapsed time: 2.569278583 seconds

Julia 0.5.0-rc2

tic(); using PyPlot; toc();
elapsed time: 5.360103935 seconds

This happens with precompiled packages, therefore - to my understanding - the root cause cannot be, that the new llvm version is slower compiling.
It would be very nice, if this regression could be fixed.
Computer: Linux 64 bit, Ubuntu 14.04, i7-3770 CPU @ 3.40GHz × 4

yuyichao · 2016-08-15T11:05:06Z

Precompile doesn't remove codegen overhead. Although it can't explain all the difference, it seems that at least 50% of the time is indeed spent in llvm.

ufechner7 · 2016-08-15T11:08:07Z

Some more results. They are not consistent: Loading Gtk (master branch) is even faster on 0.5, compared to 0.4.6. The slow down factor of loading JuMP is 1.5 .

Julia 0.4.6
tic(); using PyPlot; toc()
elapsed time: 2.569278583 seconds

julia> tic(); using Gtk; toc();
elapsed time: 1.424508958 seconds

julia> tic(); using JuMP; toc();
elapsed time: 0.617230441 seconds


Julia 0.5.0-rc2

tic(); using PyPlot; toc();
elapsed time: 5.360103935 seconds

julia> tic(); using Gtk; toc();
elapsed time: 0.943486983 seconds

julia> tic(); using JuMP; toc();
elapsed time: 0.910472513 seconds

ufechner7 · 2016-08-15T12:23:56Z

Why is precompilation not removing the codegen overhead? I thought, that this is exactly the purpose of precompilation?

tkelman · 2016-08-15T12:27:43Z

we don't currently save native code in the .ji files, so llvm still has work to do

ufechner7 · 2016-08-15T12:41:41Z

Which data format is stored in the .ji files?

yuyichao · 2016-08-15T13:49:34Z

Serialized (inferred) AST.

vtjnash · 2016-08-15T18:13:48Z

half is due to llvm. the other half is due to jl_recache_types. the worklist in that function is optimized for hundreds of items (as it had on v0.4). on v0.5, it now often has hundreds of thousands.

ufechner7 · 2016-08-15T18:30:07Z

Why does it have so many items now?

ViralBShah · 2016-08-16T04:00:30Z

From a user perspective, it would be nice if PyPlot loading time is as before or lesser. I hope there is something we can do here.

stevengj · 2016-08-16T14:57:04Z

@ufechner7, in 0.5, every function corresponds to a unique type.

ufechner7 · 2016-08-17T10:25:47Z

Is there a way to determine the number of unique types, that are defined?

JeffBezanson · 2016-08-17T13:58:29Z

Most of the types that exist in the system are not explicitly defined, but derived --- mostly tuples of various combinations of types actually. It's possible to get counts by poking into the system, but I'm not really sure how it would help to know.

ufechner7 · 2016-08-17T15:36:59Z

Well, if the performance of jl_recache_types for large number of types is the bottleneck, it would be good to know the number in different scenarios. For example, is this number after loading PyPlot really so much higher then after loading Gtk.

stevengj · 2016-08-17T18:14:53Z

Maybe just add printf("flagref_list.len = %zd", flagref_list.len); in jl_recache_types?

ufechner7 · 2016-08-24T09:26:31Z

Same results with 0.5-rc3 .

vtjnash · 2016-08-24T17:32:19Z

fixed by #18191. now there should only be dozens of types to recache.

vtjnash · 2016-08-24T17:36:29Z

fwiw, I blame @carnaval for this regression. :P

ufechner7 changed the title ~~0.5.0-rc2 2x performance regression loading precompiled packages~~ 0.5.0-rc2 Up to 2x performance regression loading precompiled packages Aug 15, 2016

ViralBShah added performance Must go faster compiler:precompilation Precompilation of modules labels Aug 15, 2016

ViralBShah added this to the 0.5.x milestone Aug 15, 2016

ViralBShah assigned vtjnash Aug 15, 2016

vtjnash removed their assignment Aug 15, 2016

vtjnash closed this as completed Aug 24, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.5.0-rc2 Up to 2x performance regression loading precompiled packages #18030

0.5.0-rc2 Up to 2x performance regression loading precompiled packages #18030

ufechner7 commented Aug 15, 2016

yuyichao commented Aug 15, 2016

ufechner7 commented Aug 15, 2016

ufechner7 commented Aug 15, 2016

tkelman commented Aug 15, 2016

ufechner7 commented Aug 15, 2016 •

edited

Loading

yuyichao commented Aug 15, 2016

vtjnash commented Aug 15, 2016

ufechner7 commented Aug 15, 2016

ViralBShah commented Aug 16, 2016

stevengj commented Aug 16, 2016

ufechner7 commented Aug 17, 2016

JeffBezanson commented Aug 17, 2016

ufechner7 commented Aug 17, 2016

stevengj commented Aug 17, 2016

ufechner7 commented Aug 24, 2016

vtjnash commented Aug 24, 2016

vtjnash commented Aug 24, 2016

0.5.0-rc2 Up to 2x performance regression loading precompiled packages #18030

0.5.0-rc2 Up to 2x performance regression loading precompiled packages #18030

Comments

ufechner7 commented Aug 15, 2016

yuyichao commented Aug 15, 2016

ufechner7 commented Aug 15, 2016

ufechner7 commented Aug 15, 2016

tkelman commented Aug 15, 2016

ufechner7 commented Aug 15, 2016 • edited Loading

yuyichao commented Aug 15, 2016

vtjnash commented Aug 15, 2016

ufechner7 commented Aug 15, 2016

ViralBShah commented Aug 16, 2016

stevengj commented Aug 16, 2016

ufechner7 commented Aug 17, 2016

JeffBezanson commented Aug 17, 2016

ufechner7 commented Aug 17, 2016

stevengj commented Aug 17, 2016

ufechner7 commented Aug 24, 2016

vtjnash commented Aug 24, 2016

vtjnash commented Aug 24, 2016

ufechner7 commented Aug 15, 2016 •

edited

Loading