[Epic] Optionally Limit memory used by DataFusion plan #587

alamb · 2021-06-18T13:18:56Z

Is your feature request related to a problem or challenge? Please describe what you are trying to do.

The basic challenge is that as of today, DataFusion can use an unbounded amount of memory for running a plan and it is neither possible to calculate the memory before hand nor limit the use.

If DataFusion processes individual partitions that are larger than the available memory system memory, right now it will keep allocating memory from the system until it is killed by the OS or container system.

Also, when running multiple datafusion plans in the same process, each will consume memory without limit where it may be desirable to reserve / cap memory usage by any individual plan to ensure the plans don't together exceed the system memory budget

Thus, it would be nice if we could give DataFusion's plans a memory budget which they then stayed under

Describe the solution you'd like

Add an option to ExecutionConfig that has a “total plan memory budget”
Add logic to each node that requires a memory buffer to ensure it stays under the limit.

The operators that can use large amounts of memory today are:

Sort
Join
GroupByHash

There are many potential ways to ensure the limit is respected:

(Simplest) error if the budget is exceeded
(more complex): employ algorithms that can use secondary storage (e.g. temp files) like sort that spills multiple round of partial sorted results and give a final merge phase for the partition global ordering

Describe alternatives you've considered
There are some interesting tradeoffs between “up front allocation” dividing memory up across all operators that would need it and a more dynamic approach.

This is likely something that will require some major efforts over many different issues -- I suggest we use this issue to implement a simple "error if over limit" strategy and then work on more sophisticated strategies subsequently

Progress tracking

Added Jan 2022:

Remaining Work

The text was updated successfully, but these errors were encountered:

edrevo · 2021-06-18T16:28:50Z

I would add Repartition as another operation that might use a bunch of memory.

andygrove · 2021-08-08T14:04:37Z

We should also discuss creating a scheduler in DataFusion (see #64) since it is related to this work. Rather than try and run all the things at once, it would be better to schedule work based on the available resources (cores / memory). We would still need the ability to track/limit memory use within operators but the scheduler could be aware of this and only allocate tasks if there is memory budget available.

alamb · 2021-08-16T15:16:40Z

I filed #898 for tracking memory used by a plan

alamb · 2021-08-16T15:25:57Z

#899 for tracking memory used by individual operators

yjshen · 2021-11-10T03:03:57Z

I created a proposal trying to fix this. Please refer to https://docs.google.com/document/d/1BT5HH-2sKq-Jxo51PNE6l9NNd_F-FyyYcyC3SKTnkIA/edit# for the whole proposal.

alamb · 2022-01-15T12:04:40Z

I have started added a "Progress Tracking" list to the description of this ticket. Please update it with additional items as you discover them.

liukun4515 · 2022-01-15T15:10:01Z

@alamb Maybe we should take the join operation into this track.

alamb · 2022-01-15T18:43:35Z

@alamb Maybe we should take the join operation into this track.

It is a good idea @liukun4515 -- I ran out of ambition while typing up Sort and Grouping. I'll try and write up some thoughts on joins later

liukun4515 · 2022-01-16T00:56:26Z

@alamb Maybe we should take the join operation into this track.

It is a good idea @liukun4515 -- I ran out of ambition while typing up Sort and Grouping. I'll try and write up some thoughts on joins later

I'm not familiar with external operations, I will go through other databases to learn it.

alamb · 2022-01-17T15:00:23Z

I wrote up some thoughts about externalized joins on #1599

alamb · 2022-04-07T19:27:28Z

Hi @hzh0425 -- There is no estimated completion time I know of.

Thanks to @yjshen there is a way to limit the memory used in Sort. The major other operators that need to be memory limited that I now of are Group and Join -- here is hoping someone can contribute time to help in that endeavor.

alamb · 2022-10-24T15:32:31Z

Added #3941 for the project of "error if memory limits are exceeded"

alamb · 2022-11-28T15:12:52Z

Update here is that we are close to having enforced memory limits for grouping and sorting (see #3941 for more details).

We also have ideas on how to improve the grouping code that should make supporting spilling grouping easier to implement -- see #2723 (comment)

alamb · 2023-03-05T12:04:38Z

Update: we have memory limited Grouping and are now working on on joins. @korowa has added limiting for Cross Joins recently #5339 🎉

alamb · 2023-06-11T18:29:44Z

I think this is largely complete and we can track any missing items as smaller follow on PRs

SteveLauC · 2024-01-17T09:10:26Z

Hi, from this thread, it seems that DataFusion can ONLY limit the memory used by those resource-heavy operators, can it limit the memory used by the underlying FileScan operators, like ParquetExec?

Let me give a demo with the following code:

use datafusion::execution::memory_pool::{GreedyMemoryPool, MemoryPool};
use datafusion::execution::runtime_env::{RuntimeConfig, RuntimeEnv};
use datafusion::prelude::{ParquetReadOptions, SessionConfig, SessionContext};
use std::sync::Arc;

#[tokio::main(flavor = "current_thread")]
async fn main() {
    let mem_pool: Arc<dyn MemoryPool> = Arc::new(GreedyMemoryPool::new(0)); // limit memory usage to 0
    let rt_cfg = RuntimeConfig::new().with_memory_pool(mem_pool);
    let rt = RuntimeEnv::new(rt_cfg).unwrap();

    let session_cfg = SessionConfig::new();
    let ctx = SessionContext::new_with_config_rt(session_cfg, Arc::new(rt));

    ctx.register_parquet("foo", "foo.parquet", ParquetReadOptions::default())
        .await
        .unwrap();
    let df = ctx.sql("select * from foo").await.unwrap();
    df.show().await.unwrap();
}

Even though we limit the available memory to 0, the query exeuctes without any issue:

$ ls -l foo.parquet
.rw-r--r-- 484 steve 17 Jan 17:08 foo.parquet

$ cargo r -q
+-----+
| foo |
+-----+
| bar |
| bar |
| bar |
| bar |
| bar |
| bar |
| bar |
| bar |
+-----+

alamb · 2024-01-17T21:07:50Z

Hi, from this thread, it seems that DataFusion can ONLY limit the memory used by those resource-heavy operators, can it limit the memory used by the underlying FileScan operators, like ParquetExec?

That is correct, though it is concievable that we could update ParquetExec to register its memory use with the memory manager

In general DataFusion takes a pragmatic approach to memory management where the intermediate memory used as data streams through the system is not accounted (assumed to be "small") and the largest consumers of memory register their use

This trades off the additional complexity of memory tracking and management with limiting resource usage

There is some small amount more information on https://docs.rs/datafusion/latest/datafusion/execution/memory_pool/trait.MemoryPool.html

SteveLauC · 2024-01-18T01:47:37Z

Thanks for your explanation!

alamb · 2024-01-23T14:56:08Z

Thanks for your explanation!

No worries -- thanks for the good question. I filed #8966 to try and capture some of this rationale in the documentation for future readers

alamb added the enhancement New feature or request label Jun 18, 2021

alamb mentioned this issue Jun 18, 2021

Question: Can DataFusion handle larger than RAM datasets? #464

Closed

alamb mentioned this issue Aug 8, 2021

A global, shared ExecutionContext #824

Closed

alamb mentioned this issue Aug 16, 2021

Track total memory allocation used by DataFusion plans #898

Closed

jon-chuang mentioned this issue Nov 8, 2021

Task assignment between Scheduler and Executors #1221

Closed

yjshen mentioned this issue Nov 9, 2021

Managing memory usage during query execution yjshen/datafusion#3

Closed

yjshen mentioned this issue Jan 7, 2022

Initial MemoryManager and DiskManager APIs for query execution + External Sort implementation #1526

Merged

alamb closed this as completed in #1526 Jan 13, 2022

alamb reopened this Jan 14, 2022

This was referenced Jan 15, 2022

[EPIC] Memory Limited Sort (Externalized / Spill) #1568

Closed

Track memory usage in Non Limited Operators #1569

Closed

Memory Limited GroupBy (Externalized / Spill) #1570

Closed

alamb mentioned this issue Jan 17, 2022

Memory Limited Joins (Externalized / Spill) #1599

Open

5 tasks

alamb mentioned this issue Jan 21, 2022

DiskManager Performs Blocking IO #1637

Closed

yjshen mentioned this issue Jan 28, 2022

Add MemTrackingMetrics to ease memory tracking for non-limited memory consumers #1691

Merged

yjshen mentioned this issue Mar 5, 2022

Add timeout to can_grow_directly when waiting for the Condvar. #1921

Closed

alamb mentioned this issue Oct 24, 2022

[Epic] Generate runtime errors if the memory budget is exceeded #3941

Closed

4 tasks

alamb changed the title ~~Optionally Limit memory used by DataFusion plan~~ EPIC Optionally Limit memory used by DataFusion plan Nov 28, 2022

saikrishna1-bidgely mentioned this issue Jan 14, 2023

Allow SessionContext::read_csv, etc to read multiple files #4908

Merged

RustomMS mentioned this issue Mar 3, 2023

Provide memory usage limits on session creation from startup params GlareDB/glaredb#545

Closed

alamb changed the title ~~EPIC Optionally Limit memory used by DataFusion plan~~ [Epic] Optionally Limit memory used by DataFusion plan Mar 5, 2023

alamb closed this as completed Jun 11, 2023

alamb mentioned this issue Jan 23, 2024

Minor: Document memory management design on MemoryPool #8966

Merged

y-f-u mentioned this issue Aug 6, 2024

Out of Memory when i accelerate parquet files having more than 1Billion records and connect it to apache superset using flightsql spiceai/spiceai#2096

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Epic] Optionally Limit memory used by DataFusion plan #587

[Epic] Optionally Limit memory used by DataFusion plan #587

alamb commented Jun 18, 2021 •

edited

Loading

edrevo commented Jun 18, 2021

andygrove commented Aug 8, 2021

alamb commented Aug 16, 2021

alamb commented Aug 16, 2021

yjshen commented Nov 10, 2021

alamb commented Jan 15, 2022

liukun4515 commented Jan 15, 2022

alamb commented Jan 15, 2022

liukun4515 commented Jan 16, 2022

alamb commented Jan 17, 2022

alamb commented Apr 7, 2022

alamb commented Oct 24, 2022

alamb commented Nov 28, 2022 •

edited

Loading

alamb commented Mar 5, 2023

alamb commented Jun 11, 2023

SteveLauC commented Jan 17, 2024

alamb commented Jan 17, 2024

SteveLauC commented Jan 18, 2024

alamb commented Jan 23, 2024

[Epic] Optionally Limit memory used by DataFusion plan #587

[Epic] Optionally Limit memory used by DataFusion plan #587

Comments

alamb commented Jun 18, 2021 • edited Loading

Progress tracking

edrevo commented Jun 18, 2021

andygrove commented Aug 8, 2021

alamb commented Aug 16, 2021

alamb commented Aug 16, 2021

yjshen commented Nov 10, 2021

alamb commented Jan 15, 2022

liukun4515 commented Jan 15, 2022

alamb commented Jan 15, 2022

liukun4515 commented Jan 16, 2022

alamb commented Jan 17, 2022

alamb commented Apr 7, 2022

alamb commented Oct 24, 2022

alamb commented Nov 28, 2022 • edited Loading

alamb commented Mar 5, 2023

alamb commented Jun 11, 2023

SteveLauC commented Jan 17, 2024

alamb commented Jan 17, 2024

SteveLauC commented Jan 18, 2024

alamb commented Jan 23, 2024

alamb commented Jun 18, 2021 •

edited

Loading

alamb commented Nov 28, 2022 •

edited

Loading