unsupported getri_batch/getrf_batch for Nvidia #229

Soujanyajanga · 2022-09-22T06:59:32Z

APIs "getri_batch/getrf_batch" are not implemented for Nvidia.

void geqrf_batch(sycl::queue &queue, std::int64_t m, std::int64_t n,
sycl::buffer<std::complex> &a, std::int64_t lda, std::int64_t stride_a,
sycl::buffer<std::complex> &tau, std::int64_t stride_tau,
std::int64_t batch_size, sycl::buffer<std::complex> &scratchpad,
std::int64_t scratchpad_size)
{ throw unimplemented("lapack", "geqrf_batch"); >>>>>>>>> Unimplemented }

void getri_batch(sycl::queue &queue, std::int64_t n, sycl::buffer &a, std::int64_t lda,
std::int64_t stride_a, sycl::bufferstd::int64_t &ipiv, std::int64_t stride_ipiv,
std::int64_t batch_size, sycl::buffer &scratchpad,
std::int64_t scratchpad_size) {
throw unimplemented("lapack", "getri_batch"); >>>>>>>>>> Unimplemented

Can you please let us know when these APIs support will be available?

We have implemented work around by using "SYCL interop", will you be interested in this?

AerialMantis · 2022-09-27T10:24:00Z

Hi @Soujanyajanga, thanks for raising this issue, I can give you an update on the progress of these operations for the Nvidia backend.

For getrf_batch Nvidia supports an equivalent to getrf but not to getrf_batch so to support this we need to implement it by manually batching the regular getrf implementation, and there is a pull request open just now which does this - #209.

For getri_batch, however, Nvidia does have an equivalent to getri_batch but it's provided in cuBLAS rather than cuSOLVER, which means this would require some changes to the Nvidia backend. Unfortunately we don't have any immediate plans to do this, however, we could incorporate this into our future roadmap.

Edit: I originally stated that getri_batch was not provided by Nvidia, but it is in fact provided, but it's in cuBLAS rather than cuSOLVER.

AidanBeltonS · 2022-09-27T10:30:07Z

We have implemented work around by using "SYCL interop", will you be interested in this?

I have a quick question, what native function have you been using as your work around? So far as I am aware getri does not have a native cuSolver equivalent.

AidanBeltonS · 2022-09-27T14:11:02Z

I have managed to answer my own question, cuSolver does not implement getri cuBlas does.
https://docs.nvidia.com/cuda/cublas/index.html#cublas-lt-t-gt-getribatched

I think this is something we can support but I think it may take a bit of additional work in the backend to get the appropriate cuBlas handles, etc.

Soujanyajanga · 2022-09-28T11:56:53Z

We have implemented work around by using "SYCL interop", will you be interested in this?

I have a quick question, what native function have you been using as your work around? So far as I am aware getri does not have a native cuSolver equivalent.

For "getri_batch", CUDA equivalent API is "cublasCgetriBatched" .
We have integrated CUDA APIs with SYCL interop as a workaround.
Are you interested, in this approach?

AerialMantis · 2022-10-10T08:01:50Z

@Soujanyajanga yes, I think this is the approach we would take, if you can share your workaround this could be useful, thanks, I've added this to our roadmap so someone will take a look at.

Soujanyajanga · 2022-10-10T14:37:54Z

@Soujanyajanga yes, I think this is the approach we would take, if you can share your workaround this could be useful, thanks, I've added this to our roadmap so someone will take a look at.

Here below the work around implemented using SYCL interop
static sycl::queue *handle;
error = (handle = &dpct::get_default_queue(), 0);

……………..(creating/adjusting the parameters for CUDA API)
……………...
………………

   cublasStatus_t err1;
   cublasHandle_t handle_cuda1;
   CUstream streamId1 = sycl::get_native<sycl::backend::cuda>(*handle);
   err = cublasCreate(&handle_cuda1);
   err = cublasSetStream(handle_cuda1, streamId1);
   err = cublasCgetriBatched(handle_cuda, n, (cuFloatComplex **)A_array, n, dipiv, (cuFloatComplex **)Ainv_array, n, dinfo_array, batch);

JackAKirk · 2024-10-02T14:06:09Z

@hdelan should this issue be closed?

hdelan · 2024-10-03T10:23:01Z

@JackAKirk I think so. I don't have the permissions to close the issue but maybe @Rbiessy can

Rbiessy · 2024-10-03T11:10:05Z

Thanks for catching this. Looks like all the issues mentioned have been addressed so closing.

mmeterel assigned sknepper and ericlars Sep 22, 2022

hdelan mentioned this issue Nov 3, 2022

[LAPACK][CUSOLVER] Add getri batch funcs #248

Merged

mkrainiuk added the help wanted Tasks, issues or features that could be implemented and contributed to the project label Sep 4, 2024

Rbiessy closed this as completed Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unsupported getri_batch/getrf_batch for Nvidia #229

unsupported getri_batch/getrf_batch for Nvidia #229

Soujanyajanga commented Sep 22, 2022 •

edited

Loading

AerialMantis commented Sep 27, 2022 •

edited

Loading

AidanBeltonS commented Sep 27, 2022 •

edited

Loading

AidanBeltonS commented Sep 27, 2022

Soujanyajanga commented Sep 28, 2022

AerialMantis commented Oct 10, 2022

Soujanyajanga commented Oct 10, 2022

JackAKirk commented Oct 2, 2024

hdelan commented Oct 3, 2024 •

edited

Loading

Rbiessy commented Oct 3, 2024

unsupported getri_batch/getrf_batch for Nvidia #229

unsupported getri_batch/getrf_batch for Nvidia #229

Comments

Soujanyajanga commented Sep 22, 2022 • edited Loading

AerialMantis commented Sep 27, 2022 • edited Loading

AidanBeltonS commented Sep 27, 2022 • edited Loading

AidanBeltonS commented Sep 27, 2022

Soujanyajanga commented Sep 28, 2022

AerialMantis commented Oct 10, 2022

Soujanyajanga commented Oct 10, 2022

JackAKirk commented Oct 2, 2024

hdelan commented Oct 3, 2024 • edited Loading

Rbiessy commented Oct 3, 2024

Soujanyajanga commented Sep 22, 2022 •

edited

Loading

AerialMantis commented Sep 27, 2022 •

edited

Loading

AidanBeltonS commented Sep 27, 2022 •

edited

Loading

hdelan commented Oct 3, 2024 •

edited

Loading