[EPIC] Roadmap for cuda/memory_resource #1502

jrhemstad · 2024-03-06T16:58:15Z

cuda::mr is intended to be the future of heterogenous memory allocation in CUDA C++. It is inspired heavily by lessons learned in RMM and our experience with the device_memory_resource* and friends. cuda::mr does not seek to replace RMM, but instead distill and standardize the best parts of RMM into a more central location. Furthermore, RMM is already in the process of rebasing on top of using the cuda::mr interface.

What we have today is the cuda/memory_resource header that provides

[async_]resource concepts
Property system
[async_]resource_ref polymorphic type

In essence, this just provides the top-level interface for memory allocation and defining properties of the allocated memory.

Implementation plan

Misc

Add NVTX annotations to all memory resources
Come up with plan for coupling between async_resource and resource concept #2313
Explore a stricter separation of synchronous and asynchronous containers and APIs #2891

Concrete types that satisfy the C++ allocator requirements

A cuda::mr::allocator<T, Properties...> capable of preserving concrete type of the resource (no type-erasure)
A cuda::mr::polymorphic_allocator<T, Properties...> constructible from a resource_ref<Properties...>

Questions we'll need to answer along the way:

What lifetime semantics do we want to use for resources + allocators + data structures?
- RMM took a very relaxed approach of using non-owning references everywhere, but this is worth reconsidering (see [DOC] Document memory resource lifetime requirements when taking ownership of device buffers in Python rapidsai/rmm#1492
Do all data structures only take Allocators? Or just resources? Both?
- In RMM, we took an approach of only constructing from resource_refs directly, but this was mostly for expediency and convenience, so it is worth reconsidering.

The text was updated successfully, but these errors were encountered:

miscco · 2024-03-07T17:09:06Z

@harrism You might be interested in this

vyasr · 2024-03-08T00:44:43Z

Is there a long-term plan to pull more of the concrete implementations from rmm into CCCL? That seems like the best way to broaden adoption and usage of these allocators and would satisfy some of the new features mentioned above IIRC.

miscco · 2024-03-08T07:11:50Z

@vyasr yes I believe we want to pull some of the foundational features into cccl. Definitely not all but some

jrhemstad · 2024-03-08T17:20:56Z

Is there a long-term plan to pull more of the concrete implementations from rmm into CCCL? That seems like the best way to broaden adoption and usage of these allocators and would satisfy some of the new features mentioned above IIRC.

Yes, that is what we mean by "Concrete types that satisfy the resource and async_resource concepts"

fbusato · 2024-03-08T18:09:30Z

our RFE:

deallocate/deallocate_async functions should accept const void* to skip const_cast<T*>() on the user side
Allow cuda::mr::* functions in device code
Clarify (or fix) the expected behavior of allocate() deallocate() for async_resource
- Personal thought: remove _async() version of the API and add the stream to allocate/deallocate

jrhemstad · 2024-03-08T18:11:55Z

Clarify (or fix) the expected behavior of allocate() deallocate() for async_resource

Can you elaborate on what you mean? allocate() and deallocate() are expected to always be synchronous.

fbusato · 2024-03-08T18:37:19Z

Can you elaborate on what you mean? allocate() and deallocate() are expected to always be synchronous.

Yes, but what is their purpose if the code uses an async_resource with _async() API. They look redundant and confusing in this case

jrhemstad · 2024-03-08T18:47:07Z

Yes, but what is their purpose if the code uses an async_resource with _async() API. They look redundant and confusing in this case

The thinking is that the async_resource concept is a strict superset of the resource concept. This way, if you have an async_resource object, you can still conveniently pass it to a function that expects a resource.

fbusato · 2024-03-08T18:53:20Z

ok, I didn't interpret async_resource as a superset of the resource concept. In this case, can we please just clarify this point on the doc?

jrhemstad assigned miscco Mar 6, 2024

github-project-automation bot added this to CCCL Mar 6, 2024

github-project-automation bot moved this to Todo in CCCL Mar 6, 2024

miscco added feature request New feature or request. libcu++ For all items related to libcu++ labels Mar 7, 2024

miscco moved this from Todo to In Progress in CCCL Mar 7, 2024

wence- mentioned this issue Apr 2, 2024

[FEA] Ensure that Cython wrappers for allocating objects correctly expose and use memory resources rapidsai/rmm#1515

Open

jrhemstad mentioned this issue Apr 18, 2024

[THEME] CUDA Runtime Modernization #1646

Open

16 tasks

jrhemstad mentioned this issue Jul 15, 2024

[FEA]: Allow default initialization for thrust vectors #1992

Open

1 task

ericniebler added the CUDA Next Feature intended for the Cuda Next experimental library label Jul 23, 2024

ericniebler mentioned this issue Dec 9, 2024

add support for comparing type-erased wrappers to non-type-erased objects #3100

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC] Roadmap for cuda/memory_resource #1502

[EPIC] Roadmap for cuda/memory_resource #1502

jrhemstad commented Mar 6, 2024 •

edited

Loading

miscco commented Mar 7, 2024

vyasr commented Mar 8, 2024

miscco commented Mar 8, 2024

jrhemstad commented Mar 8, 2024

fbusato commented Mar 8, 2024

jrhemstad commented Mar 8, 2024

fbusato commented Mar 8, 2024 •

edited

Loading

jrhemstad commented Mar 8, 2024

fbusato commented Mar 8, 2024

[EPIC] Roadmap for cuda/memory_resource #1502

[EPIC] Roadmap for cuda/memory_resource #1502

Comments

jrhemstad commented Mar 6, 2024 • edited Loading

Implementation plan

Misc

Concrete types that satisfy the C++ allocator requirements

miscco commented Mar 7, 2024

vyasr commented Mar 8, 2024

miscco commented Mar 8, 2024

jrhemstad commented Mar 8, 2024

fbusato commented Mar 8, 2024

jrhemstad commented Mar 8, 2024

fbusato commented Mar 8, 2024 • edited Loading

jrhemstad commented Mar 8, 2024

fbusato commented Mar 8, 2024

jrhemstad commented Mar 6, 2024 •

edited

Loading

fbusato commented Mar 8, 2024 •

edited

Loading