[Feature Request] Support DLPack protocol in non-training builds #15963

SimonRelu · 2023-05-16T10:20:30Z

Describe the feature request

Currently, the DLPack protocol can only be used in non-training build. See:

I believe it would make sense to enable this in the main build and not only the training one. Many AI modules already support this:

Having DLPack support in onnxruntime allows us to have "zero" cost copys between these modules. This is not only interesting during training. Often, multiple models are used in which case the ouput of one model will be used as the input of the next one. When we want to do postprocessing/preprocessing on these models we currently can't do this without moving them to CPU using .numpy(). This comes with a significant performce cost.

Describe scenario use case

We want to use cupy for processing our model inbetween different inference runs. Cupy supports the DLPack protocol which would allow us to do so. One option would be to build with training support but this makes our package size quite a bit bigger which I'd like to avoid.

The text was updated successfully, but these errors were encountered:

SimonRelu · 2023-05-17T15:42:40Z

Update:

It is possible to use an OrtValue in cupy using the data pointer like this:

import cupy as cp
x = cp.random.rand(1, 2, 3)
mem = cp.cuda.UnownedMemory(x.data_ptr(), x.__sizeof__(), owner=x)
memptr = cp.cuda.MemoryPointer(mem, offset=0)
arr = cp.ndarray(x.shape(), dtype=cp.float16, memptr=memptr)

However, it is not possible as far as I know to go from a cupy ndarray to an ortvalue without doing a copy of the data

### Description  This PR will enable python dlpack interface by default. ### Motivation and Context  dlpack python interface is useful in inference mode not only training mode. Since some inference result preprocess may be written in torch and making unnecessary device transfer should be reduced in those cases. closes #15963 closes #22061 TODOs: - [x] Add tests like https://github.com/microsoft/onnxruntime/blob/5407c69028ae6dd4e87521aea147c22153d8e6c7/orttraining/orttraining/test/python/orttraining_test_ortvalue.py that's unrelated to training feature --------- Co-authored-by: Xavier Dupré <[email protected]> Co-authored-by: Justin Chu <[email protected]>

SimonRelu added the feature request request for unsupported feature or enhancement label May 16, 2023

decahedron1 mentioned this issue Jun 29, 2023

Conversion to/from tch Tensors pykeio/ort#46

Closed

monzelr mentioned this issue Feb 24, 2024

Run model with a cupy array on CUDA #10238

Open

justinchuby mentioned this issue Sep 11, 2024

Support __dlpack__ for OrtValues #22061

Closed

take-cheeze mentioned this issue Dec 15, 2024

Enable dlpack by default #23110

Merged

1 task

xadupre closed this as completed in #23110 Jan 30, 2025

xadupre closed this as completed in 7e24088 Jan 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Support DLPack protocol in non-training builds #15963

[Feature Request] Support DLPack protocol in non-training builds #15963

SimonRelu commented May 16, 2023 •

edited

Loading

SimonRelu commented May 17, 2023

[Feature Request] Support DLPack protocol in non-training builds #15963

[Feature Request] Support DLPack protocol in non-training builds #15963

Comments

SimonRelu commented May 16, 2023 • edited Loading

Describe the feature request

Describe scenario use case

SimonRelu commented May 17, 2023

SimonRelu commented May 16, 2023 •

edited

Loading