-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[serving] Updates onnxruntime to 1.20.0 and add integration tests #2615
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -16,7 +16,10 @@ ARG djl_version | |
ARG djl_serving_version | ||
ARG python_version=3.11 | ||
ARG djl_torch_version=2.5.1 | ||
ARG djl_onnx_version=1.19.0 | ||
ARG djl_onnx_version=1.20.0 | ||
|
||
# djl converter wheel for text-embedding use case | ||
ARG djl_converter_wheel="https://publish.djl.ai/djl_converter/djl_converter-0.31.0-py3-none-any.whl" | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. once we update this wheel to 0.32.0, will we be able to include it in the requirements file? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I can make some changes to include it in the requirements file There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Ideally we should be putting all python dependencies in the requirements file so that we can see any dependency conflicts at build time. The torch dependencies are outside for two reasons:
Other than that, we should put deps into the requirements file (torch is also included in the requirements file to ensure no version conflicts with other deps) There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Because requirements file does not support --no-deps, I have to move djl-convert wheel outside of requirements. I can remove the onnxruntime dependency from djl-convert, so we don't need --no-deps for djl-convert. |
||
|
||
EXPOSE 8080 | ||
|
||
|
@@ -81,6 +84,7 @@ RUN apt-get update && apt-get install -yq libaio-dev libopenmpi-dev g++ unzip cu | |
COPY requirements-lmi.txt ./requirements.txt | ||
RUN pip3 install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu124 && pip3 cache purge | ||
RUN pip3 install -r requirements.txt \ | ||
&& pip3 install ${djl_converter_wheel} --no-deps \ | ||
&& git clone https://github.com/neuralmagic/AutoFP8.git \ | ||
&& cd AutoFP8 \ | ||
&& git reset --hard 4b2092c \ | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
typo? should this be 1.20.1?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Onnxruntime java only has 1.20.0: https://mvnrepository.com/artifact/com.microsoft.onnxruntime/onnxruntime_gpu
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just realized I have 1.20.1 in title, fixed it.