Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding optional CUDA DLLs when installing onnxruntime_gpu #22506

Open
wants to merge 50 commits into
base: main
Choose a base branch
from

Conversation

jchen351
Copy link
Contributor

Description

This code change enable user to install Nvidia CUDA DLLs when installing onnxruntime_gpu. with pip install onnxruntime_gpu[cuda_dlls].

It will also enable onnxruntime_gpu to use dynamic libraries under site-packages/nvidia that contain .dll files for Windows and .so files for Linux by temporary updating the environmental variables within an ORT Inferencing session.

Motivation and Context

Request by

setup.py Show resolved Hide resolved
@snnn snnn requested a review from jywu-msft October 22, 2024 01:16
@snnn
Copy link
Member

snnn commented Oct 23, 2024

There are some test failures, Please fix them. We will remove the "orttraining-linux-gpu-ci-pipeline". The others still need to be taking care of.

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can commit the suggested changes from lintrunner.

onnxruntime/python/onnxruntime_cuda_temp_env.py Outdated Show resolved Hide resolved
@jchen351 jchen351 requested a review from tianleiwu November 7, 2024 22:31
@jchen351 jchen351 closed this Nov 11, 2024
onnxruntime/__init__.py Outdated Show resolved Hide resolved
onnxruntime/__init__.py Outdated Show resolved Hide resolved
onnxruntime/__init__.py Outdated Show resolved Hide resolved
@jchen351 jchen351 requested a review from tianleiwu December 16, 2024 20:27
onnxruntime/__init__.py Outdated Show resolved Hide resolved
@jchen351 jchen351 requested a review from tianleiwu December 17, 2024 02:05
onnxruntime/__init__.py Outdated Show resolved Hide resolved
onnxruntime/__init__.py Outdated Show resolved Hide resolved
@jchen351 jchen351 requested a review from tianleiwu December 19, 2024 03:53
setup.py Outdated
if cuda_version:
f.write(f"cuda_version = '{cuda_version}'\n")
# cudart_versions are integers
cudart_versions = find_cudart_versions(build_env=True)
Copy link
Contributor

@tianleiwu tianleiwu Dec 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

find_cudart_versions only works on Linux. I think we can add a check of linux before calling find_cudart_versions to avoid a warning message in Windows.



# Load nvidia libraries from site-packages/nvidia if the package is onnxruntime-gpu
if cuda_version is not None and cuda_version != "":
Copy link
Contributor

@tianleiwu tianleiwu Dec 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In my test, cuda_version is still empty string. It is imported from onnxruntime.capi.onnxruntime_validation in line 73. That class only outputs cuda_version for training as below:

cuda_version = ""
if has_ortmodule:

We can remove the line of if has_ortmodule there.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does it mean the following code usually won't get executed?

@jchen351 jchen351 requested a review from tianleiwu December 22, 2024 02:33

try: # noqa: SIM105
from .build_and_package_info import cuda_version
except Exception:

Check notice

Code scanning / CodeQL

Empty except Note

'except' clause does nothing but pass and there is no explanatory comment.
@snnn snnn requested a review from Copilot December 30, 2024 23:05

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

@snnn
Copy link
Member

snnn commented Dec 30, 2024

@gedoensmax , any comment?

@snnn
Copy link
Member

snnn commented Dec 31, 2024

@jchen351, I tried to run nightly pipelines with your changes but there were some failures. Would you please update your branch with main, so that I can re-run the pipelines again to check if the problem still exists? Before merging this PR, we should generate some test packages and manually test them locally.

@snnn
Copy link
Member

snnn commented Jan 9, 2025

@jchen351 , I tried the new package, but it didn't work.

# pip install onnxruntime-gpu[cuda_dlls] --pre --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ORT-Nightly/pypi/simple/
Looking in indexes: https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ORT-Nightly/pypi/simple/
Collecting onnxruntime-gpu[cuda_dlls]
  Downloading https://aiinfra.pkgs.visualstudio.com/2692857e-05ef-43b4-ba9c-ccf1c22c437c/_packaging/7982ae20-ed19-4a35-a362-a96ac99897b7/pypi/download/onnxruntime-gpu/1.21.dev20250108002/onnxruntime_gpu-1.21.0.dev20250108002-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (291.9 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 291.9/291.9 MB 11.5 MB/s eta 0:00:00
Requirement already satisfied: flatbuffers in /usr/local/lib/python3.10/dist-packages (from onnxruntime-gpu[cuda_dlls]) (24.12.23)
Requirement already satisfied: protobuf in /usr/local/lib/python3.10/dist-packages (from onnxruntime-gpu[cuda_dlls]) (5.29.3)
Requirement already satisfied: numpy>=1.21.6 in /usr/local/lib/python3.10/dist-packages (from onnxruntime-gpu[cuda_dlls]) (2.2.1)
Requirement already satisfied: sympy in /usr/local/lib/python3.10/dist-packages (from onnxruntime-gpu[cuda_dlls]) (1.13.3)
Requirement already satisfied: coloredlogs in /usr/local/lib/python3.10/dist-packages (from onnxruntime-gpu[cuda_dlls]) (15.0.1)
Requirement already satisfied: packaging in /usr/local/lib/python3.10/dist-packages (from onnxruntime-gpu[cuda_dlls]) (24.2)
Requirement already satisfied: humanfriendly>=9.1 in /usr/local/lib/python3.10/dist-packages (from coloredlogs->onnxruntime-gpu[cuda_dlls]) (10.0)
Requirement already satisfied: mpmath<1.4,>=1.1.0 in /usr/local/lib/python3.10/dist-packages (from sympy->onnxruntime-gpu[cuda_dlls]) (1.3.0)
Installing collected packages: onnxruntime-gpu
Successfully installed onnxruntime-gpu-1.21.0.dev20250108002

Could you please verify?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants