GC mark-loop rewrite #47292

d-netto · 2022-10-22T18:35:29Z

Previous work

Since #21590, the GC mark-loop was implemented by keeping two manually managed stacks: one of which contained iterator states used to keep track of the object currently being marked. As an example, to mark arrays, we would pop the corresponding iterator state from the stack, iterate over the array until we found an unmarked reference, and if so, we would update the iterator state (to reflect the index we left off), "repush" it into the stack and proceed with marking the reference we just found.

This PR

This PR eliminates the need of keeping the iterator states by modifying the object graph traversal code. We keep a single stack of jl_value_t * currently being processed. To mark an object, we first pop it from the stack, push all unmarked references into the stack and proceed with marking.

I believe this doesn't break any invariant from the generational GC. Indeed, the age bits are set after marking (if the object survived one GC cycle it's moved to the old generation), so this new traversal scheme wouldn't change the fact of whether an object had references to old objects or not. Furthermore, we must not update GC metadata for objects in the remset, and we ensure this by calling gc_mark_outrefs in gc_queue_remset with meta_updated set to 1.

Additional advantages

There are no recursive function calls in the GC mark-loop code (one of the reasons why Implement GC mark loop #21590 was implemented).
Keeping a single GC queue will greatly simplify work-stealing in the multi-threaded GC we are working on (c.f. [rfc] parallel marking #45639).
Arrays of references, for example, are now marked on a regular stride fashion, which could help with hardware prefetching.
We can easily modify the traversal mode (to breadth first, for example) by only changing the jl_gc_markqueue_t(from LIFO to FIFO, for example) methods without touching the mark-loop itself, which could enable further exploration on the GC in the future.

Since this PR changes the mark-loop graph traversal code, there are some changes in the heap-snapshot, though I'm not familiar with that PR.

Some benchmark results are here: https://hackmd.io/@Idnmfpb3SxK98-OsBtRD5A/H1V6QSzvs.

vchuravy · 2022-10-26T20:55:37Z

@nanosoldier runtests(ALL, vs = ":master")

vchuravy · 2022-10-26T20:55:51Z

@nanosoldier runbenchmarks(!"scalar", vs=":master")

nanosoldier · 2022-10-27T02:51:55Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

nanosoldier · 2022-10-27T11:02:58Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

vchuravy · 2022-11-03T14:24:37Z

@nanosoldier runtests(["AbstractLogic", "AutomotiveVisualization", "Bonsai", "Contour", "FastTabular", "HydrophoneCalibrations", "IntervalTrees", "JET", "LuxLib", "MITgcmTools", "MixedAnova", "MixedModels", "OceanStateEstimation", "OutlierDetectionData", "PalmerPenguins", "Parsers", "Permanents", "ProtoBuf", "SlackThreads", "SoapySDR", "WaterWaves1D"], vs = ":master")

nanosoldier · 2022-11-03T15:23:17Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

KristofferC · 2022-11-03T15:27:04Z

The IOError thing is a Pkg bug (which might hide some other real bug).

Edit: Actually, a rebase should probably update to a Pkg where it is fixed.

vtjnash

Some minor comments from looking this over with you, but it seems to look good to me as a whole. Really solid work, Diogo.

src/gc.c

vtjnash · 2022-11-03T15:03:35Z

src/gc.c

+    size_t elsize = ((jl_array_t *)ary8_parent)->elsize / sizeof(jl_value_t *);
+#ifndef GC_VERIFY
+    // Decide whether need to chunk ary8
+    size_t nrefs = (ary8_end - ary8_begin) / elsize;


Should we multiply by * (elem_end - elem_begin) for the number of pointers in each element?

vtjnash · 2022-11-06T18:38:31Z

src/gc.c

-    gc_cache->data_stack = (jl_gc_mark_data_t *)malloc_s(init_size * sizeof(jl_gc_mark_data_t));
+
+    // Initialize GC mark-queue
+    size_t init_size = (1 << 18);


2 MB seems rather large (though 8k does seem a little small)

I think the chunk-queue shouldn't be large (the number of chunks grows with the depth of graphs of objarrays).

This is over 2 MB per thread though, which seems large

d-netto · 2022-11-18T13:41:29Z

@vtjnash any idea of what could be causing the error in analyzegc?

vtjnash · 2022-11-18T19:03:56Z

Try annotating the jl_raise_debugger function declaration with JL_NOTSAFEPOINT:

JL_DLLEXPORT void jl_raise_debugger(void) JL_NOTSAFEPOINT;

It is a heuristic evaluation tool, so any changes to the code can sometime arbitrarily change what branches it considers and what errors it thinks it might have found. You can run it locally with make -C src install-analysis-deps clang-sagc-gc

vchuravy · 2022-11-22T14:11:16Z

@nanosoldier runtests(ALL)

d-netto · 2022-12-01T17:02:50Z

@nanosoldier runtests(ALL)

maleadt · 2022-12-12T14:08:41Z

Hm, not sure why this didn't went through. Let's try again:

@nanosoldier runtests(ALL)

nanosoldier · 2022-12-12T20:23:30Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

d-netto · 2023-01-19T02:54:09Z

I'm seeing some regressions after the rebase, so not good to merge yet.

d-netto · 2023-01-19T03:27:24Z

NVM, I was running with obj pools disabled.

vchuravy · 2023-01-19T06:58:37Z

@nanosoldier runtests()

nanosoldier · 2023-01-19T13:17:38Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

d-netto · 2023-01-24T03:32:34Z

Bump

giordano · 2023-01-27T00:12:31Z

src/support/dtypes.h

@@ -117,6 +117,7 @@ typedef intptr_t ssize_t;
 #define LLT_FREE(x) free(x)

 #define STATIC_INLINE static inline
+#define FORCE_INLINE static inline __attribute__((always_inline))


This is already defined in

julia/src/support/MurmurHash3.c

Line 15 in d775750

#define FORCE_INLINE inline __attribute__((always_inline))

and now causes a warning during compilation:

In file included from /home/mose/repo/julia/src/support/hashing.c:51: /home/mose/repo/julia/src/support/MurmurHash3.c:15: warning: "FORCE_INLINE" redefined 15 | #define FORCE_INLINE inline __attribute__((always_inline)) | In file included from /home/mose/repo/julia/src/support/hashing.c:7: /home/mose/repo/julia/src/support/dtypes.h:120: note: this is the location of the previous definition 120 | #define FORCE_INLINE static inline __attribute__((always_inline)) |

d-netto force-pushed the dcn/ml2 branch from 4b0da34 to 4343e7e Compare October 22, 2022 18:44

d-netto mentioned this pull request Oct 22, 2022

GC mark-loop rewrite #45608

Closed

3 tasks

d-netto force-pushed the dcn/ml2 branch from 4343e7e to 2061578 Compare October 22, 2022 18:54

d-netto requested review from vchuravy and vtjnash October 25, 2022 15:27

d-netto force-pushed the dcn/ml2 branch from d7c940e to 66cc7c6 Compare October 26, 2022 13:57

d-netto added the GC Garbage collector label Oct 27, 2022

d-netto force-pushed the dcn/ml2 branch from 348b068 to 1912d1d Compare November 3, 2022 16:42

vtjnash reviewed Nov 6, 2022

View reviewed changes

d-netto force-pushed the dcn/ml2 branch 2 times, most recently from f594a8d to e82cd9d Compare November 17, 2022 15:50

d-netto force-pushed the dcn/ml2 branch 3 times, most recently from c0cf1ed to 5bb5bab Compare November 18, 2022 21:24

d-netto force-pushed the dcn/ml2 branch 2 times, most recently from 80f71bb to be4b416 Compare January 16, 2023 21:21

vchuravy changed the title ~~Rebasing mark-loop changes~~ GC mark-loop rewrite Jan 18, 2023

vchuravy approved these changes Jan 18, 2023

View reviewed changes

d-netto closed this Jan 23, 2023

d-netto reopened this Jan 23, 2023

Diogo Netto added 4 commits January 23, 2023 16:25

rebase

32b0daa

initialize chunk queues on GC verify

13bfff1

forgot to scan parent

40f0927

added docstring

25eefda

d-netto force-pushed the dcn/ml2 branch from 174e480 to 25eefda Compare January 23, 2023 21:26

vtjnash approved these changes Jan 24, 2023

View reviewed changes

vtjnash merged commit dfab7be into JuliaLang:master Jan 24, 2023

giordano reviewed Jan 27, 2023

View reviewed changes

giordano mentioned this pull request Jan 27, 2023

New warnings when compiling Julia with GCC 12 #45400

Closed

d-netto mentioned this pull request Jan 31, 2023

Pathological GC time on M1 mac #48473

Closed

benlorenz mentioned this pull request Feb 1, 2023

frequent CI failures with Julia nightly and 1.9 on macOS due to timeout oscar-system/Oscar.jl#1888

Closed

d-netto mentioned this pull request May 2, 2023

Run GC on multiple threads #48600

Merged

d-netto mentioned this pull request Aug 7, 2023

Backport mark loop rewrite RelationalAI/julia#26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GC mark-loop rewrite #47292

GC mark-loop rewrite #47292

d-netto commented Oct 22, 2022 •

edited

Loading

vchuravy commented Oct 26, 2022

vchuravy commented Oct 26, 2022

nanosoldier commented Oct 27, 2022

nanosoldier commented Oct 27, 2022

vchuravy commented Nov 3, 2022

nanosoldier commented Nov 3, 2022

KristofferC commented Nov 3, 2022 •

edited

Loading

vtjnash left a comment

vtjnash Nov 3, 2022

vtjnash Nov 6, 2022

d-netto Nov 15, 2022 •

edited

Loading

vtjnash Jan 24, 2023

d-netto commented Nov 18, 2022 •

edited

Loading

vtjnash commented Nov 18, 2022

vchuravy commented Nov 22, 2022

d-netto commented Dec 1, 2022

maleadt commented Dec 12, 2022

nanosoldier commented Dec 12, 2022

d-netto commented Jan 19, 2023

d-netto commented Jan 19, 2023 •

edited

Loading

vchuravy commented Jan 19, 2023

nanosoldier commented Jan 19, 2023

d-netto commented Jan 24, 2023

giordano Jan 27, 2023

GC mark-loop rewrite #47292

GC mark-loop rewrite #47292

Conversation

d-netto commented Oct 22, 2022 • edited Loading

Previous work

This PR

Additional advantages

vchuravy commented Oct 26, 2022

vchuravy commented Oct 26, 2022

nanosoldier commented Oct 27, 2022

nanosoldier commented Oct 27, 2022

vchuravy commented Nov 3, 2022

nanosoldier commented Nov 3, 2022

KristofferC commented Nov 3, 2022 • edited Loading

vtjnash left a comment

Choose a reason for hiding this comment

vtjnash Nov 3, 2022

Choose a reason for hiding this comment

vtjnash Nov 6, 2022

Choose a reason for hiding this comment

d-netto Nov 15, 2022 • edited Loading

Choose a reason for hiding this comment

vtjnash Jan 24, 2023

Choose a reason for hiding this comment

d-netto commented Nov 18, 2022 • edited Loading

vtjnash commented Nov 18, 2022

vchuravy commented Nov 22, 2022

d-netto commented Dec 1, 2022

maleadt commented Dec 12, 2022

nanosoldier commented Dec 12, 2022

d-netto commented Jan 19, 2023

d-netto commented Jan 19, 2023 • edited Loading

vchuravy commented Jan 19, 2023

nanosoldier commented Jan 19, 2023

d-netto commented Jan 24, 2023

giordano Jan 27, 2023

Choose a reason for hiding this comment

d-netto commented Oct 22, 2022 •

edited

Loading

KristofferC commented Nov 3, 2022 •

edited

Loading

d-netto Nov 15, 2022 •

edited

Loading

d-netto commented Nov 18, 2022 •

edited

Loading

d-netto commented Jan 19, 2023 •

edited

Loading