Add alignment to cuda_malloc_async_memory_resource. #4923

mzient · 2023-06-23T14:42:02Z

Category:

Bug fix
New feature

Description:

cudaMallocAsync allocates memory aligned to a multiple of 256 bytes, however, DALI's memory_resource interface can be used to request overaligned memory. Before this PR, DALI would neither return properly aligned memory nor raise an error, should cudaMallocAsync return insufficiently aligned pointer.
This PR adds overalignment support by requesting larger blocks for allocations with alignment >256B and aligning the pointer accoringly. The mapping aligned->original pointer is kept (in a global state) and upon deletion, the pointer being deleted is looked up in the map and, if found, replaced with the original pointer.

Additionally, a tiny performance bug is fixed in AsyncPool tests.

Additional information:

Affected modules and functionalities:

cuda_malloc_async_memory_resource
AsyncPool tests.

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

Signed-off-by: Michał Zientkiewicz <[email protected]>

dali-automaton · 2023-06-23T14:43:43Z

CI MESSAGE: [8730704]: BUILD STARTED

JanuszL · 2023-06-23T15:04:49Z

dali/core/mm/malloc_resource.cc

+
+class aligned_alloc_helper {
+ public:
+  static constexpr size_t kCudaMallocAlignment = 256;


Is it given once and for all or may change depending on the version?

I think it's given - I asked about it on CUDA channel and all functions that allocate global memory return data aligned to at least 256B.

dali/core/mm/malloc_resource.cc

Signed-off-by: Michał Zientkiewicz <[email protected]>

dali-automaton · 2023-06-23T15:19:09Z

CI MESSAGE: [8731034]: BUILD STARTED

dali-automaton · 2023-06-23T22:29:54Z

CI MESSAGE: [8731034]: BUILD FAILED

dali-automaton · 2023-06-26T08:46:42Z

CI MESSAGE: [8731034]: BUILD PASSED

* Overallocate and align allocations with alignment > 256B * Store a mapping from aligned to original addresses in a global map * Update tests Signed-off-by: Michał Zientkiewicz <[email protected]>

Add alignment to cuda_malloc_async_memory_resource.

f99fdc5

Signed-off-by: Michał Zientkiewicz <[email protected]>

JanuszL self-assigned this Jun 23, 2023

JanuszL reviewed Jun 23, 2023

View reviewed changes

dali/core/mm/malloc_resource.cc Outdated Show resolved Hide resolved

JanuszL reviewed Jun 23, 2023

View reviewed changes

dali/core/mm/malloc_resource.cc Outdated Show resolved Hide resolved

JanuszL reviewed Jun 23, 2023

View reviewed changes

dali/core/mm/malloc_resource.cc Outdated Show resolved Hide resolved

mzient force-pushed the cudaMallocAsync_align branch from 51b93c5 to f99fdc5 Compare June 23, 2023 15:13

Comment fix.

cdb8d6e

Signed-off-by: Michał Zientkiewicz <[email protected]>

JanuszL approved these changes Jun 23, 2023

View reviewed changes

jantonguirao assigned banasraf Jun 26, 2023

banasraf approved these changes Jun 27, 2023

View reviewed changes

mzient merged commit bd48d14 into NVIDIA:main Jun 27, 2023

JanuszL mentioned this pull request Sep 6, 2023

Roadmap 2023 #4578

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add alignment to cuda_malloc_async_memory_resource. #4923

Add alignment to cuda_malloc_async_memory_resource. #4923

mzient commented Jun 23, 2023

dali-automaton commented Jun 23, 2023

JanuszL Jun 23, 2023 •

edited

Loading

mzient Jun 23, 2023

dali-automaton commented Jun 23, 2023

dali-automaton commented Jun 23, 2023

dali-automaton commented Jun 26, 2023

Add alignment to cuda_malloc_async_memory_resource. #4923

Add alignment to cuda_malloc_async_memory_resource. #4923

Conversation

mzient commented Jun 23, 2023

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

dali-automaton commented Jun 23, 2023

JanuszL Jun 23, 2023 • edited Loading

Choose a reason for hiding this comment

mzient Jun 23, 2023

Choose a reason for hiding this comment

dali-automaton commented Jun 23, 2023

dali-automaton commented Jun 23, 2023

dali-automaton commented Jun 26, 2023

JanuszL Jun 23, 2023 •

edited

Loading