replace mfence with lock'ed inst on x86 #48123

d-netto · 2023-01-04T19:51:09Z

A dummy instruction with lock prefix should provide the same sequential consistency guarantees as an mfence on x86.

This had a large performance impact when benchmarking work-stealing queues for parallel marking and it would be interesting to see how/if it affects performance in general.

CC: @vchuravy

gbaraldi · 2023-01-04T20:05:24Z

This seems to be fine, linux changed to this a little while ago https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=450cbdd0125cfa5d7bbf9e2a6b6961cc48d29730
We might want to do the exact same instruction they do since apparently they did some effort on what's better

Keno

Though as the kernel people, I'm not sure why the compiler doesn't just do this and I agree with @gbaraldi that it makes sense to align with the Linux-kernel instructions.

Keno · 2023-01-04T23:30:22Z

Looks like LLVM has support to emit this since https://reviews.llvm.org/D58632 and even emits it in the fallback case when the processor has no mfence. I don't know why it still emits mfence for the llvm fence intrinsic other than that nobody bothered updating it.

Keno · 2023-01-04T23:37:03Z

@d-netto want to submit an upstream PR to remove the mfence case to see what they think?

d-netto · 2023-01-04T23:39:05Z

I believe @vchuravy already submitted a patch to LLVM to replace thefence intrinsic with the locked instruction.

Keno · 2023-01-04T23:40:00Z

Ah indeed: https://reviews.llvm.org/D129947

vtjnash · 2023-01-05T03:50:52Z

I may possibly say we should rather just wait for that to catch up with gcc and https://reviews.llvm.org/D129947, than worry about maintaining this if something changes again

vchuravy · 2023-01-05T08:49:57Z

It will take a long time for the compilers to catch up. It landed in GCC12 and it will be a second before we get it into Clang.

src/julia_atomics.h

d-netto requested a review from vtjnash January 4, 2023 19:52

Keno approved these changes Jan 4, 2023

View reviewed changes

vchuravy reviewed Jan 5, 2023

View reviewed changes

src/julia_atomics.h Outdated Show resolved Hide resolved

brenhinkeller added the compiler:codegen Generation of LLVM IR and native code label Jan 9, 2023

Diogo Netto and others added 2 commits February 10, 2023 11:04

replace mfence with lock'ed inst

704caa8

version check jl_fence

becf4cf

vchuravy force-pushed the dcn/mfence branch from 0f11cac to becf4cf Compare February 10, 2023 16:04

vchuravy mentioned this pull request Feb 10, 2023

Run GC on multiple threads #48600

Merged

vchuravy merged commit 7f4b78d into JuliaLang:master Feb 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace mfence with lock'ed inst on x86 #48123

replace mfence with lock'ed inst on x86 #48123

d-netto commented Jan 4, 2023 •

edited

Loading

gbaraldi commented Jan 4, 2023

Keno left a comment

Keno commented Jan 4, 2023

Keno commented Jan 4, 2023

d-netto commented Jan 4, 2023

Keno commented Jan 4, 2023

vtjnash commented Jan 5, 2023

vchuravy commented Jan 5, 2023

replace mfence with lock'ed inst on x86 #48123

replace mfence with lock'ed inst on x86 #48123

Conversation

d-netto commented Jan 4, 2023 • edited Loading

gbaraldi commented Jan 4, 2023

Keno left a comment

Choose a reason for hiding this comment

Keno commented Jan 4, 2023

Keno commented Jan 4, 2023

d-netto commented Jan 4, 2023

Keno commented Jan 4, 2023

vtjnash commented Jan 5, 2023

vchuravy commented Jan 5, 2023

d-netto commented Jan 4, 2023 •

edited

Loading