Implement custom memory manager for LLVM #16777

yuyichao · 2016-06-06T01:16:35Z

Use various ways to reuse the page that has the page protection set in order to avoid wasting a page of memory for each JIT event.

Fixes #14626

Tested on Linux with LLVM 3.7.1 and 3.8 with OrcJIT.
This reduces the memory used in the subarray test by ~100MB (There are around 25k JIT events each page protects at least one page of memory even though the total allocation is only around 10-20MB.)

Should in principle work on OSX, using the ~~mkstemp~~ tmpfile fallback code path.

Does not work on windows yet due to how we handle windows unwinding (we patch the code section after the code is generated, causing a segfault since the load address is never writable). It might be enough to modify those code to write to local address instead of load address. @Keno also mentioned some way to fix this properly. (This is what the request for help is mainly for....) Fixed

yuyichao · 2016-06-06T01:18:59Z

~~Another note is that LLVM 3.7.1 seems to generate corrupted eh_frame when using Keno's remote memory manager. Not sure why that happens but my current suspicion is the reuse of local address.~~ Fixed

vtjnash · 2016-06-06T01:38:08Z

also mentioned some way to fix this properly. (This is what the request for help is mainly for....)

at present, this has been blocked on the need to write our own custom memory manager (specifically, one that supports the small memory model required by ntdll on win64 for all dynamic libraries), and then patch the LLVM runtime relocation code to add support for the ADDR32NB relocation type.

yuyichao · 2016-06-06T01:47:40Z

specifically, one that supports the small memory model required by ntdll on win64 for all dynamic libraries

What's the requirement? All sections to be smaller and within 4G to each other? The current custom memory manager doesn't do that yet but it's not that hard to add (especially if we can use the reserve function).

vtjnash · 2016-06-06T01:54:58Z

yes, and consecutively allocated (images must be non-overlapping and can't be interleaved)

yuyichao · 2016-06-06T01:56:15Z

(images must be non-overlapping and can't be interleaved)

Doesn't this require the current allocation scheme? The page' where we have allocations in basically can't be reused anymore. (edit: or no W^X)

yuyichao · 2016-06-06T01:58:32Z

Or maybe we can waste virtual memory but not physical memory?

vtjnash · 2016-06-06T02:12:25Z

I'm not sure. It appears that the actual requirements are undocumented and considered to be an implementation detail. We may be able to get away with some lying, since we are JIT and not statically linked. But unfortunately, there's no way to verify the source code of ntdll to learn it's expectations.

Keno · 2016-06-06T02:16:24Z

Why would there be an expectation that the sections are non-interleaved?

vtjnash · 2016-06-06T02:38:04Z

Actually, maybe only dbghelp cares (https://msdn.microsoft.com/en-us/library/windows/desktop/ms681353(v=vs.85).aspx), so perhaps it would only cause problems for WinDbg, but not ntdll.

Keno · 2016-06-09T20:34:06Z

src/jitlayers.cpp

@@ -129,8 +131,17 @@ static void addOptimizationPasses(T *PM)
    //PM->add(createCFGSimplificationPass());     // Merge & remove BBs
 }

+#ifdef USE_MCJIT
+RTDyldMemoryManager* createRTDyldMemoryManager(void);


Move this to codegen_internal.h as well.

yuyichao · 2016-06-10T00:13:14Z

I think I addressed all the comments apart from the one about cache flushing. I was hoping that mprotect will flush the write to the memory and calling the LLVM helper will flush the instruction cache for the runtime address. Not sure what's still missing.

Use various ways to reuse the page that has the page protection set in order to avoid wasting a page of memory for each JIT event. Fixes #14626

Keno · 2016-06-15T18:25:16Z

@yuyichao I think this should be merged. Will you do the honor?

yuyichao · 2016-06-15T18:26:46Z

Sure, I was just waiting for one of you to merge it ;-p

Keno · 2016-06-15T19:07:56Z

This may be triggering a compiler bug in gcc 4.8, complaining that the copy constructor of Block was deleted (even though the use in question should have been a move constructor). We may have to comment out that deletion for the GCC 4.8 series.

yuyichao · 2016-06-15T19:15:35Z

Where does it need a copy constructor for Block (whether move or not)? temp_buff.emplace_back()?

tkelman · 2016-06-15T19:16:14Z

See buildbot log if it's more useful - https://build.julialang.org/builders/build_ubuntu14.04-x64/builds/940/steps/shell_1/logs/stdio
also quick to reproduce / test things since it only takes a couple minutes to hit.

Keno · 2016-06-15T19:16:22Z

SmallVector.

yuyichao · 2016-06-15T19:22:36Z

That one is supposed to use a placement new with the default constructor... I guess the LLVM 3.7 SmallVector implementation might be constructing a Block and use the placement new to move..... GCC 6.1.1 didn't complain though......

Keno · 2016-06-15T19:40:52Z

What default constructor would it be using? How would it get the Block into its storage without using the move constructor?

yuyichao · 2016-06-15T19:41:48Z

new (ptr) Block()? The default constructor with no argument is provided.

Keno · 2016-06-15T19:49:23Z

So you were relying on that particular function not being instantiated because you're not using it?

yuyichao · 2016-06-15T19:59:14Z

And I want to make sure I don't copy/move it accidentally since it can cause memory leak or memory free'd at the wrong time.

Keno · 2016-06-15T20:05:19Z

Make the move constructor assert(false)?

yuyichao · 2016-06-15T20:08:19Z

Actually moving construct is always fine, moving assignment is not and is deleted. I've updated #16950 to implement the moving constructor properly.

tkelman · 2016-06-16T01:10:19Z

src/cgmemmgr.cpp

+
+static int init_self_mem()
+{
+    int fd = open("/proc/self/mem", O_RDWR | O_SYNC | O_CLOEXEC);


need to check before using O_CLOEXEC https://build.julialang.org/builders/package_tarball64/builds/475/steps/make%20binary-dist/logs/stdio

That kernel is actually too old for this method to work although I guess we should still compile this method for this to be included in the generic binary.... Should be fixed in 3f4e1e7

JeffBezanson · 2016-06-16T21:12:02Z

BTW, the memory savings from this is absolutely great. Awesome work @yuyichao !

yuyichao added the compiler:codegen Generation of LLVM IR and native code label Jun 6, 2016

yuyichao force-pushed the yyc/codegen/memmgr branch 10 times, most recently from b9a83ae to c63e4ae Compare June 6, 2016 20:14

yuyichao mentioned this pull request Jun 6, 2016

Skip terminator record in processFDEs #16795

Merged

yuyichao force-pushed the yyc/codegen/memmgr branch 9 times, most recently from 7159938 to e855467 Compare June 7, 2016 00:29

Keno reviewed Jun 9, 2016
View reviewed changes

yuyichao force-pushed the yyc/codegen/memmgr branch 2 times, most recently from 73dbc4f to 07ea952 Compare June 10, 2016 00:10

yuyichao force-pushed the yyc/codegen/memmgr branch from 07ea952 to 6d79549 Compare June 10, 2016 18:54

yuyichao added 2 commits June 11, 2016 11:35

Move memory manager out of debuginfo.cpp

396f1ce

Implement custom memory manager for LLVM

8533a1c

Use various ways to reuse the page that has the page protection set in order to avoid wasting a page of memory for each JIT event. Fixes #14626

yuyichao force-pushed the yyc/codegen/memmgr branch from 6d79549 to 8533a1c Compare June 11, 2016 15:43

yuyichao merged commit b029f50 into master Jun 15, 2016

yuyichao deleted the yyc/codegen/memmgr branch June 15, 2016 18:26

tkelman mentioned this pull request Jun 15, 2016

Win32 std::bad_alloc during make testall1 #11083

Closed

JeffBezanson mentioned this pull request Jun 15, 2016

build failure: error: use of deleted function ‘{anonymous}::Block::Block(const {anonymous}::Block&)’ #16949

Closed

tkelman reviewed Jun 16, 2016
View reviewed changes

tkelman mentioned this pull request Jun 16, 2016

Don't use O_CLOEXEC if it is not defined #16956

Closed

yuyichao mentioned this pull request Sep 19, 2016

non-x86: ensure cgmemmgr caches are consistent #18516

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement custom memory manager for LLVM #16777

Implement custom memory manager for LLVM #16777

yuyichao commented Jun 6, 2016 •

edited

Loading

yuyichao commented Jun 6, 2016 •

edited

Loading

vtjnash commented Jun 6, 2016

yuyichao commented Jun 6, 2016

vtjnash commented Jun 6, 2016

yuyichao commented Jun 6, 2016 •

edited

Loading

yuyichao commented Jun 6, 2016

vtjnash commented Jun 6, 2016

Keno commented Jun 6, 2016

vtjnash commented Jun 6, 2016

Keno Jun 9, 2016

yuyichao commented Jun 10, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016 •

edited

Loading

yuyichao commented Jun 15, 2016

tkelman commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

tkelman Jun 16, 2016

yuyichao Jun 16, 2016

JeffBezanson commented Jun 16, 2016

Implement custom memory manager for LLVM #16777

Implement custom memory manager for LLVM #16777

Conversation

yuyichao commented Jun 6, 2016 • edited Loading

yuyichao commented Jun 6, 2016 • edited Loading

vtjnash commented Jun 6, 2016

yuyichao commented Jun 6, 2016

vtjnash commented Jun 6, 2016

yuyichao commented Jun 6, 2016 • edited Loading

yuyichao commented Jun 6, 2016

vtjnash commented Jun 6, 2016

Keno commented Jun 6, 2016

vtjnash commented Jun 6, 2016

Keno Jun 9, 2016

Choose a reason for hiding this comment

yuyichao commented Jun 10, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016 • edited Loading

yuyichao commented Jun 15, 2016

tkelman commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

Keno commented Jun 15, 2016

yuyichao commented Jun 15, 2016

tkelman Jun 16, 2016

Choose a reason for hiding this comment

yuyichao Jun 16, 2016

Choose a reason for hiding this comment

JeffBezanson commented Jun 16, 2016

yuyichao commented Jun 6, 2016 •

edited

Loading

yuyichao commented Jun 6, 2016 •

edited

Loading

yuyichao commented Jun 6, 2016 •

edited

Loading

Keno commented Jun 15, 2016 •

edited

Loading