-
-
Notifications
You must be signed in to change notification settings - Fork 723
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow P2P to store data in-memory #8279
Conversation
Unit Test ResultsSee test report for an extended history of previous test failures. This is useful for diagnosing flaky tests. 27 files + 1 27 suites +1 15h 49m 14s ⏱️ + 1h 8m 8s For more details on these failures, see this check. Results for commit 255e477. ± Comparison against base commit a8e5dab. This pull request removes 51 and adds 107 tests. Note that renamed tests count towards both.
♻️ This comment has been updated with latest results. |
from distributed.utils import log_errors | ||
|
||
|
||
class MemoryShardsBuffer(ShardsBuffer): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This can be refactored to avoid most of the baggage it inherits from ShardsBuffer
. I'd do this in a follow-up since I expect us to rework buffers anyhow and don't want to spend time on API compatibility with the DiskShardsBuffer
until then.
This is ready for review; I'm running some A/B tests to see the impact. |
This seems bad to me I've been playing with TPC-H recently and have been really appreciating how Dask is able to keep running even when other systems break down because we didn't have as much memory as we thought we did. I'd encourage you to generate scale-100 on your personal computer and then run local TPC-H and see what happens with this PR. My guess is that it isn't immediately obvious how to do this. I should write up instructions probably. |
This is the first iteration of diskless shuffling. It is explicitly "opt-in" by requiring you to set a config option. Falling back to disk in case a worker unexpectedly runs out of memory has always been considered out of scope for this iteration. |
Ah, sorry. Thanks for the clarification.
…On Thu, Oct 19, 2023 at 9:57 AM Hendrik Makait ***@***.***> wrote:
This seems bad to me
This is the first iteration of diskless shuffling. It is explicitly
"opt-in" by requiring you to set a config option. Falling back to disk in
case a worker unexpectedly runs out of memory has always been considered
out of scope for this iteration.
—
Reply to this email directly, view it on GitHub
<#8279 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AACKZTD6IBI2WW2TRFHHGYTYAE5U5AVCNFSM6AAAAAA6E7IYX6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONZRGE3DINBRGM>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Includes and blocked by AllowResourceLimiter
to be unlimited #8276pre-commit run --all-files