Add `cuda::mr::cuda_memory_resource` #1513

miscco · 2024-03-07T19:13:16Z

This introduces a new memory resource that allocates device memory through cudaMalloc / cudaFree

Fixes #1512

miscco · 2024-03-07T19:23:02Z

@harrism you might be interested in this

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h

libcudacxx/include/cuda/__memory_resource/cuda_pinned_memory_resource.h

harrism

Nice addition. I didn't review the refactoring, only the new cuda_memory_resource. I think the refactoring is just splitting into multiple files, and that should be clarified in the PR description. It would have been nice to separate these into separate PRs.

Most of my suggestions are doc clarifications and fixes.

harrism · 2024-03-11T20:26:43Z

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h

+   */
+  void* allocate(const size_t __bytes, const size_t __alignment) const
+  {
+    _LIBCUDACXX_ASSERT(__alignment <= 256 && (256 % __alignment == 0),


suggestion: I think allocation should throw, not assert. Also note that this assertion/error means that the alignment parameter is NOT ignored.

Question: Just because cudaMalloc aligns to 256 (or possibly 512, even though documented at 256), doesn't mean an MR can't support larger allocations. I suppose users who really want that can write a wrapping resource that aligns, but is there another reason we don't support it here? Seems a bit strange that cuda::mr has this parameter and our most basic implementation semi-ignores it and adds another method without it that is not part of the concept.

I added an exception / trap when we are facing an invalid alignment. I also merged the two allocate functions through a default argument

Seems a bit strange that cuda::mr has this parameter and our most basic implementation semi-ignores it and adds another method without it that is not part of the concept.

The existence of the alignment parameter does not mean all possible alignment values are valid. Each resource implementation will have its own limitations and this includes alignment.

cuda_memory_resource is intended to be the simplest possible wrapper on top of cudaMalloc, and so it will inherit all the limitations of cudaMalloc, including alignment. So a user requesting an alignment up to 256 is fine, but beyond that, we throw.

We could add an additional resource that allows for larger alignments, (or an adapter!), but that would no longer be a trivial wrapper on top of cudaMalloc.

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h

libcudacxx/include/cuda/memory_resource

libcudacxx/test/libcudacxx/cuda/memory_resource/cuda_memory_resource/allocate.pass.cpp

jrhemstad · 2024-03-12T16:49:01Z

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h

+    // We need to ensure that the provided alignment matches the minimal provided alignment
+    _LIBCUDACXX_ASSERT(
+      __default_cuda_malloc_alignment <= __alignment && (__alignment % __default_cuda_malloc_alignment == 0),
+      "Invalid alignment passed to cuda_memory_resource::deallocate.");


suggestion: I think this check is unnecessary. It's illegal to ever call deallocate(p, N, M) for p that was not returned by p = allocate(N, M). Therefore, we would have already checked the alignment during the allocate call.

There is no guarantee that a user matches allocate and deallocate correctly.

This is just a tiny bit safer than without and it does not hurt

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h

harrism · 2024-03-13T05:57:30Z

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h

+   *
+   * @param __lhs The cuda_memory_resource
+   * @param __rhs The resource to compare to
+   * @return Comparison of both resources converted to a resource_ref<>


Suggestion: The semantics are pretty simple, can we just spell it out something like the following (or whatever the incantation would be)?

Suggested change

* @return Comparison of both resources converted to a resource_ref<>

* @return true if both arguments are convertible to resource_ref<>(cuda_memory_resource&), false otherwise

I went with: Result of equality comparison of both resources converted to a resource_ref<>

OK, but then the reader has to go dig up docs on the semantics of comparison of refs. Doesn't that just compare the referenced object types? Is there a section in the docs on equality comparison that you could link to?

std::pmr::memory_resource and RMM (current) memory resources have very different comparison semantics, so I think this should be clearly documented.

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h

harrism · 2024-03-13T06:12:44Z

Typo in issue title (hyphen should be underscore).

miscco requested review from a team as code owners March 7, 2024 19:13

miscco requested review from ericniebler, gonidelis, griwes and jrhemstad March 7, 2024 19:13

miscco force-pushed the cuda_memory_resource branch from aae814f to 21ce157 Compare March 7, 2024 19:26

miscco commented Mar 7, 2024

View reviewed changes

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h Show resolved Hide resolved

miscco force-pushed the cuda_memory_resource branch from 21ce157 to 98485fb Compare March 7, 2024 19:28

jrhemstad reviewed Mar 7, 2024

View reviewed changes

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h Outdated Show resolved Hide resolved

jrhemstad reviewed Mar 7, 2024

View reviewed changes

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h Outdated Show resolved Hide resolved

miscco force-pushed the cuda_memory_resource branch 4 times, most recently from a76e390 to 842943e Compare March 11, 2024 11:39

miscco commented Mar 11, 2024

View reviewed changes

libcudacxx/include/cuda/__memory_resource/cuda_pinned_memory_resource.h Outdated Show resolved Hide resolved

miscco changed the title ~~Add cuda::mr::cuda_memory_resource~~ Add cuda::mr:: memory resources Mar 11, 2024

miscco force-pushed the cuda_memory_resource branch 5 times, most recently from 885201b to 919cd42 Compare March 11, 2024 19:11

harrism requested changes Mar 11, 2024

View reviewed changes

Address review comments

64088dd

miscco force-pushed the cuda_memory_resource branch from 8cfde90 to 64088dd Compare March 12, 2024 16:38

miscco requested a review from a team as a code owner March 12, 2024 16:38

copy-pr-bot bot force-pushed the pull-request/1532 branch from bbd9e91 to 16f9082 Compare March 12, 2024 16:38

jrhemstad reviewed Mar 12, 2024

View reviewed changes

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h Outdated Show resolved Hide resolved

jrhemstad reviewed Mar 12, 2024

View reviewed changes

libcudacxx/include/cuda/__memory_resource/cuda_memory_resource.h Outdated Show resolved Hide resolved

harrism reviewed Mar 13, 2024

View reviewed changes

miscco changed the title ~~Add cuda::mr::cuda_memory-resource~~ Add cuda::mr::cuda_memory_resource Mar 13, 2024

miscco added 9 commits March 13, 2024 10:30

Address review comments

1f2c2be

Address more review comments

e508fd2

Merge branch 'reorg_cuda_memory_Resource' into cuda_memory_resource

7bf0815

Merge branch 'reorg_cuda_memory_Resource' into cuda_memory_resource

31f12c3

Drop MSVC_2017

198b4d7

Fix typos

e1e0720

Merge branch 'reorg_cuda_memory_Resource' into cuda_memory_resource

b282c27

Merge branch 'reorg_cuda_memory_Resource' into cuda_memory_resource

bf0bdd8

Drop MSVC2017 support

17192b1

copy-pr-bot bot force-pushed the pull-request/1532 branch from 7674e55 to b65158b Compare April 2, 2024 10:02

miscco added 2 commits April 2, 2024 14:12

Merge branch 'reorg_cuda_memory_Resource' into cuda_memory_resource

459343f

Use __throw_bad_alloc when passing invalid allocation argument

d2202e5

miscco force-pushed the cuda_memory_resource branch 2 times, most recently from eee001b to 318af5c Compare April 2, 2024 12:46

Do not rely on transitive includes

14ddff1

miscco force-pushed the cuda_memory_resource branch from 318af5c to 14ddff1 Compare April 2, 2024 13:08

Pacify msvc

e3c2fa3

copy-pr-bot bot deleted the branch NVIDIA:pull-request/1532 April 2, 2024 18:31

copy-pr-bot bot closed this Apr 2, 2024

miscco mentioned this pull request Apr 2, 2024

Implement cuda::mr::cuda_memory_resource #1578

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `cuda::mr::cuda_memory_resource` #1513

Add `cuda::mr::cuda_memory_resource` #1513

miscco commented Mar 7, 2024 •

edited

Loading

miscco commented Mar 7, 2024

harrism left a comment •

edited

Loading

harrism Mar 11, 2024

harrism Mar 11, 2024

miscco Mar 12, 2024

jrhemstad Mar 12, 2024 •

edited

Loading

jrhemstad Mar 12, 2024

miscco Mar 12, 2024

harrism Mar 13, 2024

miscco Mar 13, 2024

harrism Mar 13, 2024

harrism commented Mar 13, 2024

	* @return Comparison of both resources converted to a resource_ref<>
	* @return true if both arguments are convertible to resource_ref<>(cuda_memory_resource&), false otherwise

Add cuda::mr::cuda_memory_resource #1513

Add cuda::mr::cuda_memory_resource #1513

Conversation

miscco commented Mar 7, 2024 • edited Loading

miscco commented Mar 7, 2024

harrism left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jrhemstad Mar 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

harrism commented Mar 13, 2024

Add `cuda::mr::cuda_memory_resource` #1513

Add `cuda::mr::cuda_memory_resource` #1513

miscco commented Mar 7, 2024 •

edited

Loading

harrism left a comment •

edited

Loading

jrhemstad Mar 12, 2024 •

edited

Loading