mrs_realloc: avoid copies when space is available #2133

brooksdavis · 2024-06-28T15:19:30Z

When the passed pointer is valid and already has enough space, return it unless it's at least twice the requested size.

Some software uses realloc to extend allocations by small, fixed amounts. In a naive implementation of realloc that blindly performes a new allocation, copies the data, and the frees the old allocation, this is O(n^2). For small values of n this isn't too noticable, but for asymptotically it's quite bad and revocation makes it much worse. With current tuning, we eventually reach a point where approximately every 4th allocating triggers a revocation pass.

We can make this somewhat less visible (or fix it entierly for snmalloc with it's power-of-two sized buckets) by omitting unnecessicary allocations and copies.

Bias towards improveing linear increase performance over space saving and only reallocate to shrink the allocation if we'd save at least 50% space. If we chose a smaller value then we end up doing full reallocs constantly with an allocator like snmalloc.

brooksdavis · 2024-06-28T15:25:05Z

This should improve the situation with running sort(1) with very large files (#2131)

lib/libc/stdlib/malloc/mrs/mrs.c

markjdb · 2024-06-28T16:41:00Z

Unfortunately, this doesn't appear to help much for sort. I haven't yet tried using utrace() probes to see how often we're allocating a new buffer now vs. before, though.

brooksdavis · 2024-07-01T15:05:43Z

Unfortunately, this doesn't appear to help much for sort. I haven't yet tried using utrace() probes to see how often we're allocating a new buffer now vs. before, though.

After switching to an offset check this might actually help a little.

markjdb · 2024-07-01T16:08:38Z

Unfortunately, this doesn't appear to help much for sort. I haven't yet tried using utrace() probes to see how often we're allocating a new buffer now vs. before, though.

After switching to an offset check this might actually help a little.

It helps, but only a little. :)

Sorting the original cscope identifier list takes about 45s with a modified sort, and OOMs when I use unmodified sort, with or without this libc patch.

If I enable utrace MRS probes and sort only the first million lines of the list, I see 353695 UTRACE_MRS_REALLOC records without this patch, versus 280436 with the patch, so a ~20% drop. With async revocation disabled (to making timing less noisy), I don't see any significant difference in runtime with or without the patch.

Not enough to fix the pathological behaviour, but a worthy change anyway IMO.

lib/libc/stdlib/malloc/mrs/mrs.c

bsdjhb · 2024-07-01T19:54:17Z

I wonder if there are a class of reallocs that stock jemalloc, etc. are able to handle in-place that CHERI cannot because we are only able to realloc if the bounds were overly-broad due to precision. That is, I wonder if it might make sense to do some bounds inflation for PAGE_SIZE or greater allocations? Perhaps dtrace on stock aarch64 would give a quick way to determine how many reallocs() have to copy, and what sizes those copy vs non-copy map to? (Maybe you could figure this out with utrace() as well?)

markjdb · 2024-07-02T02:06:12Z

I wonder if there are a class of reallocs that stock jemalloc, etc. are able to handle in-place that CHERI cannot because we are only able to realloc if the bounds were overly-broad due to precision. That is, I wonder if it might make sense to do some bounds inflation for PAGE_SIZE or greater allocations? Perhaps dtrace on stock aarch64 would give a quick way to determine how many reallocs() have to copy, and what sizes those copy vs non-copy map to? (Maybe you could figure this out with utrace() as well?)

Some experiments and code reading suggest that jemalloc always returns tightly bounded pointers, whereas snmalloc only ensures that pointers are bounded to the true allocation size (i.e., whatever malloc_usable_size() would presumably return). In that case, I would expect Brooks' optimization to be much more effective, since for large enough sizes the allocation granules are going to be powers of 2.

I repeated this test with snmalloc as the malloc implementation, and my test finishes much more quickly. However, snmalloc appears to be triggering a bug in the async revoker such that revocation never finishes. I note that snmalloc allocates large amounts of VA space, but the revoker should be able to skip over such ranges very quickly, so something else is going wrong. I will debug further this week, but this result suggests that we should try to make jemalloc behave more like snmalloc in this respect.

brooksdavis · 2024-07-02T10:23:05Z

I wonder if there are a class of reallocs that stock jemalloc, etc. are able to handle in-place that CHERI cannot because we are only able to realloc if the bounds were overly-broad due to precision. That is, I wonder if it might make sense to do some bounds inflation for PAGE_SIZE or greater allocations? Perhaps dtrace on stock aarch64 would give a quick way to determine how many reallocs() have to copy, and what sizes those copy vs non-copy map to? (Maybe you could figure this out with utrace() as well?)

Some experiments and code reading suggest that jemalloc always returns tightly bounded pointers, whereas snmalloc only ensures that pointers are bounded to the true allocation size (i.e., whatever malloc_usable_size() would presumably return). In that case, I would expect Brooks' optimization to be much more effective, since for large enough sizes the allocation granules are going to be powers of 2.

I think there are two things we could look at here:

Change the code to bound to isalloc(ptr) instead of the rounded size. That would let us grow within each extent until we hit the end which would have the effect of making linear growth calls have logarithmic cost. This should be fairly easy. It might be the case that we want this to be configurable because it will have cost to walk the rtree a second time to get the size.
Implementing size class growth with mremap. This should help quite a bit in sort(1) because you fairly quickly land somewhere you can continue growing in place infinitely without . This would require both implementing mremap (at least MREMAP_FIXED support) and adding mremap support to the extension code in jemalloc. (I think I'd require an MREMAP_RESERVATION_CAN_GROW flag to make sure people audit their code for pointers that need updating.)

markjdb · 2024-07-02T13:30:06Z

I wonder if there are a class of reallocs that stock jemalloc, etc. are able to handle in-place that CHERI cannot because we are only able to realloc if the bounds were overly-broad due to precision. That is, I wonder if it might make sense to do some bounds inflation for PAGE_SIZE or greater allocations? Perhaps dtrace on stock aarch64 would give a quick way to determine how many reallocs() have to copy, and what sizes those copy vs non-copy map to? (Maybe you could figure this out with utrace() as well?)

Some experiments and code reading suggest that jemalloc always returns tightly bounded pointers, whereas snmalloc only ensures that pointers are bounded to the true allocation size (i.e., whatever malloc_usable_size() would presumably return). In that case, I would expect Brooks' optimization to be much more effective, since for large enough sizes the allocation granules are going to be powers of 2.

I think there are two things we could look at here:
1. Change the code to bound to `isalloc(ptr)` instead of the rounded size.  That would let us grow within each extent until we hit the end which would have the effect of making linear growth calls have logarithmic cost. This should be fairly easy. It might be the case that we want this to be configurable because it will have cost to walk the rtree a second time to get the size.

2. Implementing size class growth with `mremap`. This should help quite a bit in sort(1) because you fairly quickly land somewhere you can continue growing in place infinitely without . This would require both implementing `mremap` (at least `MREMAP_FIXED` support) and adding `mremap` support to the extension code in jemalloc. (I think I'd require an `MREMAP_RESERVATION_CAN_GROW` flag to make sure people audit their code for pointers that need updating.)

We've talked about adding mremap() to FreeBSD before; I don't think there's any opposition to that.

I don't really understand the last parenthetical - could you please elaborate?

brooksdavis · 2024-07-02T16:11:19Z

I don't really understand the last parenthetical - could you please elaborate?

I'd bet that most users of mremap with MREMAP_FIXED will call it and not update all the places that store pointers to it because with integer addresses that services little purpose (the language's abstract machine might care, but nothing concrete does.) I'd like to add a new flag to enable extension of the reservation in order to provide a signal to the programmer that they have to audit the system for all stored pointers that how have wrong bounds.

markjdb · 2024-07-03T01:29:05Z

I don't really understand the last parenthetical - could you please elaborate?

I'd bet that most users of mremap with MREMAP_FIXED will call it and not update all the places that store pointers to it because with integer addresses that services little purpose (the language's abstract machine might care, but nothing concrete does.) I'd like to add a new flag to enable extension of the reservation in order to provide a signal to the programmer that they have to audit the system for all stored pointers that how have wrong bounds.

Assuming we're talking about the Linux interface, MREMAP_FIXED doesn't mean "remap in place", it just lets you specify the target address. If you want mremap() to either extend the existing mapping or fail, then no flags are needed, otherwise you need to pass MREMAP_MAYMOVE.

Then, I think you're proposing that mremap() with flags == 0 should fail unless the new size is smaller than the old one. I guess that's fine, it would be straightforward to require a flag as you suggest.

brooksdavis · 2024-07-03T12:09:23Z

I don't really understand the last parenthetical - could you please elaborate?

I'd bet that most users of mremap with MREMAP_FIXED will call it and not update all the places that store pointers to it because with integer addresses that services little purpose (the language's abstract machine might care, but nothing concrete does.) I'd like to add a new flag to enable extension of the reservation in order to provide a signal to the programmer that they have to audit the system for all stored pointers that how have wrong bounds.

Assuming we're talking about the Linux interface, MREMAP_FIXED doesn't mean "remap in place", it just lets you specify the target address. If you want mremap() to either extend the existing mapping or fail, then no flags are needed, otherwise you need to pass MREMAP_MAYMOVE.

Then, I think you're proposing that mremap() with flags == 0 should fail unless the new size is smaller than the old one. I guess that's fine, it would be straightforward to require a flag as you suggest.

Indeed, I had misremembered that the default was not moving and should have re-read the flag descriptions. I do think if we implement it we should implement the linux interface more or less not the never-implemented version documented in the 4.3(?) system manual.

markjdb · 2024-07-03T13:14:08Z

I don't really understand the last parenthetical - could you please elaborate?

I'd bet that most users of mremap with MREMAP_FIXED will call it and not update all the places that store pointers to it because with integer addresses that services little purpose (the language's abstract machine might care, but nothing concrete does.) I'd like to add a new flag to enable extension of the reservation in order to provide a signal to the programmer that they have to audit the system for all stored pointers that how have wrong bounds.

Assuming we're talking about the Linux interface, MREMAP_FIXED doesn't mean "remap in place", it just lets you specify the target address. If you want mremap() to either extend the existing mapping or fail, then no flags are needed, otherwise you need to pass MREMAP_MAYMOVE.
Then, I think you're proposing that mremap() with flags == 0 should fail unless the new size is smaller than the old one. I guess that's fine, it would be straightforward to require a flag as you suggest.

Indeed, I had misremembered that the default was not moving and should have re-read the flag descriptions. I do think if we implement it we should implement the linux interface more or less not the never-implemented version documented in the 4.3(?) system manual.

I'll take a stab at changing jemalloc as described, let's see how that goes first.

markjdb · 2024-07-10T15:13:30Z

I don't really understand the last parenthetical - could you please elaborate?

I'd bet that most users of mremap with MREMAP_FIXED will call it and not update all the places that store pointers to it because with integer addresses that services little purpose (the language's abstract machine might care, but nothing concrete does.) I'd like to add a new flag to enable extension of the reservation in order to provide a signal to the programmer that they have to audit the system for all stored pointers that how have wrong bounds.

Assuming we're talking about the Linux interface, MREMAP_FIXED doesn't mean "remap in place", it just lets you specify the target address. If you want mremap() to either extend the existing mapping or fail, then no flags are needed, otherwise you need to pass MREMAP_MAYMOVE.
Then, I think you're proposing that mremap() with flags == 0 should fail unless the new size is smaller than the old one. I guess that's fine, it would be straightforward to require a flag as you suggest.

Indeed, I had misremembered that the default was not moving and should have re-read the flag descriptions. I do think if we implement it we should implement the linux interface more or less not the never-implemented version documented in the 4.3(?) system manual.

I'll take a stab at changing jemalloc as described, let's see how that goes first.

This is #2147.

When the passed pointer is valid and already has enough space, return it unless it's at least twice the requested size. Some software uses realloc to extend allocations by small, fixed amounts. In a naive implementation of realloc that blindly performs a new allocation, copies the data, and then frees the old allocation this is O(n^2). For small values of `n` this isn't too noticable, but asymptotically it's quite bad and revocation makes it much worse. With current mrs tuning, we eventually reach a point where approximately every 4th increment triggers a revocation pass. We can make this somewhat less visible (or fix it entirely for snmalloc with it's power-of-two sized buckets) by omitting unnecessary allocations and copies.

brooksdavis requested review from markjdb and kwitaszczyk June 28, 2024 15:19

markjdb reviewed Jun 28, 2024

View reviewed changes

lib/libc/stdlib/malloc/mrs/mrs.c Outdated Show resolved Hide resolved

lib/libc/stdlib/malloc/mrs/mrs.c Outdated Show resolved Hide resolved

lib/libc/stdlib/malloc/mrs/mrs.c Outdated Show resolved Hide resolved

brooksdavis force-pushed the mrs-realloc-avoid-pointless-copies branch from ee18917 to f0ab240 Compare July 1, 2024 15:04

bsdjhb approved these changes Jul 1, 2024

View reviewed changes

lib/libc/stdlib/malloc/mrs/mrs.c Show resolved Hide resolved

lib/libc/stdlib/malloc/mrs/mrs.c Outdated Show resolved Hide resolved

markjdb mentioned this pull request Jul 2, 2024

sort: Avoid linear realloc loops #2137

Merged

brooksdavis force-pushed the mrs-realloc-avoid-pointless-copies branch from f0ab240 to b2b15d5 Compare July 2, 2024 10:05

markjdb approved these changes Jul 3, 2024

View reviewed changes

brooksdavis force-pushed the mrs-realloc-avoid-pointless-copies branch from b2b15d5 to f0f1bd4 Compare July 3, 2024 13:59

markjdb mentioned this pull request Jul 10, 2024

Make jemalloc bound returned pointers to their usable size #2147

Merged

markjdb mentioned this pull request Jul 17, 2024

sort on a large file never ends with enabled revocations #2131

Open

brooksdavis force-pushed the mrs-realloc-avoid-pointless-copies branch from f0f1bd4 to 815b236 Compare July 23, 2024 23:15

brooksdavis merged commit 028410e into dev Jul 24, 2024
29 checks passed

brooksdavis deleted the mrs-realloc-avoid-pointless-copies branch July 24, 2024 18:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mrs_realloc: avoid copies when space is available #2133

mrs_realloc: avoid copies when space is available #2133

brooksdavis commented Jun 28, 2024

brooksdavis commented Jun 28, 2024 •

edited

Loading

markjdb commented Jun 28, 2024

brooksdavis commented Jul 1, 2024

markjdb commented Jul 1, 2024

bsdjhb commented Jul 1, 2024

markjdb commented Jul 2, 2024

brooksdavis commented Jul 2, 2024

markjdb commented Jul 2, 2024

brooksdavis commented Jul 2, 2024

markjdb commented Jul 3, 2024

brooksdavis commented Jul 3, 2024

markjdb commented Jul 3, 2024

markjdb commented Jul 10, 2024

mrs_realloc: avoid copies when space is available #2133

mrs_realloc: avoid copies when space is available #2133

Conversation

brooksdavis commented Jun 28, 2024

brooksdavis commented Jun 28, 2024 • edited Loading

markjdb commented Jun 28, 2024

brooksdavis commented Jul 1, 2024

markjdb commented Jul 1, 2024

bsdjhb commented Jul 1, 2024

markjdb commented Jul 2, 2024

brooksdavis commented Jul 2, 2024

markjdb commented Jul 2, 2024

brooksdavis commented Jul 2, 2024

markjdb commented Jul 3, 2024

brooksdavis commented Jul 3, 2024

markjdb commented Jul 3, 2024

markjdb commented Jul 10, 2024

brooksdavis commented Jun 28, 2024 •

edited

Loading