LLVM generates bad code on newer architercures #19976

pabloferz · 2017-01-11T08:04:26Z

Didn't find an issue for this. Cross-reference: #19259 (comment)

I believe some of the performance "noisy" regressions seen lately are caused by this.

On my machine (Haswell) the pisum() benchmark on master gives

julia> @benchmark pisum()
BenchmarkTools.Trial: 
  memory estimate:  0.00 bytes
  allocs estimate:  0
  --------------
  minimum time:     33.362 ms (0.00% GC)
  median time:      33.445 ms (0.00% GC)
  mean time:        33.532 ms (0.00% GC)
  maximum time:     34.761 ms (0.00% GC)
  --------------
  samples:          150
  evals/sample:     1
  time tolerance:   5.00%
  memory tolerance: 1.00%

while on 0.5 I get

julia> @benchmark pisum()
BenchmarkTools.Trial: 
  memory estimate:  0.00 bytes
  allocs estimate:  0
  --------------
  minimum time:     7.548 ms (0.00% GC)
  median time:      7.571 ms (0.00% GC)
  mean time:        7.611 ms (0.00% GC)
  maximum time:     8.404 ms (0.00% GC)
  --------------
  samples:          657
  evals/sample:     1
  time tolerance:   5.00%
  memory tolerance: 1.00%

Even when both show practically the same LLVM code.

cc @yuyichao

vchuravy · 2017-01-11T09:02:02Z

A direct way of seeing the effect on the same version of Julia is:

julia -C core2 --precompiled=no pisum.jl
@benchmark(pisum()) = BenchmarkTools.Trial: 
  memory estimate:  0.00 bytes
  allocs estimate:  0
  --------------
  minimum time:     19.512 ms (0.00% GC)
  median time:      19.795 ms (0.00% GC)
  mean time:        19.797 ms (0.00% GC)
  maximum time:     21.656 ms (0.00% GC)
  --------------
  samples:          253
  evals/sample:     1
  time tolerance:   5.00%
  memory tolerance: 1.00%

vs:

julia -C haswell --precompiled=no pisum.jl
@benchmark(pisum()) = BenchmarkTools.Trial: 
  memory estimate:  0.00 bytes
  allocs estimate:  0
  --------------
  minimum time:     37.291 ms (0.00% GC)
  median time:      37.888 ms (0.00% GC)
  mean time:        37.931 ms (0.00% GC)
  maximum time:     39.078 ms (0.00% GC)
  --------------
  samples:          132
  evals/sample:     1
  time tolerance:   5.00%
  memory tolerance: 1.00%

yuyichao · 2017-01-11T13:41:04Z

It's not a regression, I've checked that both LLVM 3.3 and 3.7.1 gives the same bad code when compiling with avx enabled.

This might have been fixed on LLVM trunk right after 3.9 is branched and I'm trying to backport the patch.

Fix #19976

This fixes a bug in the patch that fixes #19976 causing encoding error on 32bit x86 and segfault when AVX/AVX2 is enabled. Ref LLVM bug report https://bugs.llvm.org//show_bug.cgi?id=29010 LLVM commit llvm-mirror/llvm@83260f2 Also ref where I saw this issue in #21849 (comment)

This fixes a bug in the patch that fixes #19976 causing encoding error on 32bit x86 and segfault when AVX/AVX2 is enabled. Ref LLVM bug report https://bugs.llvm.org//show_bug.cgi?id=29010 LLVM commit llvm-mirror/llvm@83260f2 Also ref where I saw this issue in #21849 (comment) (cherry picked from commit 5a42474) ref #22205

pabloferz added performance Must go faster regression Regression in behavior compared to a previous version labels Jan 11, 2017

vchuravy added the upstream The issue is with an upstream dependency, e.g. LLVM label Jan 11, 2017

yuyichao removed the regression Regression in behavior compared to a previous version label Jan 11, 2017

yuyichao added a commit that referenced this issue Jan 14, 2017

Backport LLVM patches to fix X86 partial register stall

c1bc99f

Fix #19976

yuyichao mentioned this issue Jan 14, 2017

Backport LLVM patches to fix X86 partial register stall #20025

Merged

Keno pushed a commit that referenced this issue Jan 16, 2017

Backport LLVM patches to fix X86 partial register stall

f59e059

Fix #19976

Keno pushed a commit that referenced this issue Jan 17, 2017

Backport LLVM patches to fix X86 partial register stall

df891e0

Fix #19976

pabloferz mentioned this issue Feb 11, 2017

Add patch to set -O2 for suitesparse build #20165

Merged

tkelman closed this as completed in #20025 Feb 23, 2017

yuyichao mentioned this issue Jun 3, 2017

Backport LLVM patches to fix AVX on i686 #22205

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLVM generates bad code on newer architercures #19976

LLVM generates bad code on newer architercures #19976

pabloferz commented Jan 11, 2017 •

edited

Loading

vchuravy commented Jan 11, 2017

yuyichao commented Jan 11, 2017

LLVM generates bad code on newer architercures #19976

LLVM generates bad code on newer architercures #19976

Comments

pabloferz commented Jan 11, 2017 • edited Loading

vchuravy commented Jan 11, 2017

yuyichao commented Jan 11, 2017

pabloferz commented Jan 11, 2017 •

edited

Loading