Change the heuristics to use heap size instead of collected memory #50909

gbaraldi · 2023-08-14T12:57:32Z

This also chagnes things to do a proper moving average instead of averaging with just the last measurement.
@d-netto this would be what the other interpretation of the paper meant (what they implemented in v8 but not in mmtk)

vtjnash

LGTM, assuming that this interpretation of the paper seems better to us

gbaraldi · 2023-08-14T19:22:34Z

I haven't tested it, this is just what it would look like.

gbaraldi · 2023-08-15T14:32:09Z

So testing this I see mostly similar performance but higher heap sizes overall which makes me think I might have done something wrong because I had the impression that this would make the GC run more often and that would make the max heap smaller, but that's not what i'm seeing.
Potentially the moving average changes are causing that however.

Further tests show that the moving average doesn't make that big of a difference, in fact they seem to be a not unsignificant regression.

These heuristics however make the linked list benchmark about 30% faster by using a smaller heap.

d-netto · 2023-08-15T23:14:25Z

asan timed out on this PR.

Restarting the job to see if it's reproducible.

kpamnany · 2023-09-06T17:27:01Z

@MarisaKirisame, could you please take a look at this and see if these changes make the use of MemBalancer logic correct?

Also, regarding your comment here, do you feel that the target_heap computation is wrong and min_interval should not be part of the computation?

MarisaKirisame · 2023-09-07T20:23:34Z

@kpamnany yes, the change make more sense.
I also feel think min_interval should not be relevant in the computation - there are better ways to set the constant.
Another thing - the heap size does not need to be smoothed. We smooth other quantity because they are inexact and we do noisy measurement. For heap size it is exact measurement so no need of smoothing.

gbaraldi · 2023-09-13T12:23:41Z

@d-netto could you run some tests for this? I used the tuning factor from v8, though it might be a bit too agressive.

d-netto · 2023-09-18T14:06:53Z

At least two tests are failing with ErrorException("fatal error allocating signal stack: mmap: Cannot allocate memory") which seems concerning.

gbaraldi · 2023-09-18T14:25:38Z

Yeah, it seems to be ramping very quickly

MarisaKirisame · 2023-09-20T16:01:16Z

the value you guys get from the paper is for interactive application which might be the cause of the high memory use. It might need to be scaled up by another 10x-ish. See https://github.com/MarisaKirisame/MemoryBalancer/blob/main/python/eval.py#L24 for the number we use for js under noninteractive setting.

This replaces #50909, though notably does not include the change to use heap size instead of heap memory. This adds the smoothing behavior from that prior PR (to better estimate the long-term rates / ignore transient changes), updates the GC_TIME printing to reflect the change to use MemBalancer heuristics, and adds some other guardrails to the decisions so they do not get put off too far into the future. Since, unlike several other languages that use MemBalancer, we do not have a time-based trigger for GC to update these heuristics continuously, so we need to make sure each step is reasonably conservative (both from under and over predicting the rate). Finally, this is stricter about observing limits set by the user, by strictly limiting the exceedence rate to around 10%, while avoiding some prior possible issues with the hard cut-off being disjoint at the cutoff. This should mean we will go over the threshold slowly if the program continues to demand more space. If we OOM eventually by the kerenl, we would have died anyways from OOM now by ourself.

Change the heuristics to use heap size instead of collected memory

dd5a826

gbaraldi requested review from d-netto and vtjnash August 14, 2023 15:15

vtjnash reviewed Aug 14, 2023

View reviewed changes

JeffBezanson added the GC Garbage collector label Aug 14, 2023

gbaraldi mentioned this pull request Aug 15, 2023

CI: try limiting the memory for 1.10 / nightly CI to get at least some jobs to succeed... oscar-system/Oscar.jl#2620

Closed

Merge branch 'master' into gb/gc-other-impl

3145d98

gbaraldi added 2 commits September 12, 2023 16:11

Merge branch 'master' into gb/gc-other-impl

bca08e2

Make the pacer more like the V8 implementation

89f3578

Reduce tuning_factor

6b66a1a

Make the GC less greedy

b19f2e6

d-netto mentioned this pull request Sep 28, 2023

Revert MemBalancer GC pacer from 1.10 release #51498

Closed

vtjnash mentioned this pull request Nov 16, 2023

gc: add some guard rails and refinements to MemBalancer #52197

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change the heuristics to use heap size instead of collected memory #50909

Change the heuristics to use heap size instead of collected memory #50909

gbaraldi commented Aug 14, 2023

vtjnash left a comment •

edited

Loading

gbaraldi commented Aug 14, 2023

gbaraldi commented Aug 15, 2023 •

edited

Loading

d-netto commented Aug 15, 2023

kpamnany commented Sep 6, 2023

MarisaKirisame commented Sep 7, 2023

gbaraldi commented Sep 13, 2023

d-netto commented Sep 18, 2023

gbaraldi commented Sep 18, 2023

MarisaKirisame commented Sep 20, 2023

Change the heuristics to use heap size instead of collected memory #50909

Are you sure you want to change the base?

Change the heuristics to use heap size instead of collected memory #50909

Conversation

gbaraldi commented Aug 14, 2023

vtjnash left a comment • edited Loading

Choose a reason for hiding this comment

gbaraldi commented Aug 14, 2023

gbaraldi commented Aug 15, 2023 • edited Loading

d-netto commented Aug 15, 2023

kpamnany commented Sep 6, 2023

MarisaKirisame commented Sep 7, 2023

gbaraldi commented Sep 13, 2023

d-netto commented Sep 18, 2023

gbaraldi commented Sep 18, 2023

MarisaKirisame commented Sep 20, 2023

vtjnash left a comment •

edited

Loading

gbaraldi commented Aug 15, 2023 •

edited

Loading