-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Onnxruntime parity checks to CI #1938
Conversation
Run the three parity checks that are used for onnxruntime as part of CI can be expanded later if needed
…y tests Do the following - Split build_and_test.sh into build_and_install.sh and test_onnxrt_unit_test.py - Add parity tests to test_onnxrt_parity_tests.sh - Parallelize unit and parity tests for MIGraphX-Onnxruntime integration in jenkins - Add changes to dockerfile for additional run scripts. Idea here is that unit tests take a while, and in the meantime if we perform a wheel build of onnxruntime library and then run the appropriate parity tests we should be able to catch any odd changes in parity that are missed by the unit tests which would also appear in additional testing.
Check results before merge 🔆 |
🔴torchvision-inceptionv3_1: FAILED: MIGraphX is not within tolerance - check verbose output🔴cadene-dpn92_1: FAILED: MIGraphX is not within tolerance - check verbose output🔴slim-inceptionv4_1: FAILED: MIGraphX is not within tolerance - check verbose output |
Using groovy to debug issues in jenkins file Fixing new line and names
Codecov Report
@@ Coverage Diff @@
## develop #1938 +/- ##
========================================
Coverage 91.49% 91.49%
========================================
Files 430 430
Lines 16129 16129
========================================
Hits 14758 14758
Misses 1371 1371 |
need to call the script the proper name
odd seeing
|
@@ -127,3 +129,8 @@ ENV LD_LIBRARY_PATH=$PREFIX/lib | |||
ENV UBSAN_OPTIONS=print_stacktrace=1 | |||
ENV ASAN_OPTIONS=detect_stack_use_after_return=1:check_initialization_order=1:strict_init_order=1 | |||
RUN ln -s /opt/rocm/llvm/bin/llvm-symbolizer /usr/bin/llvm-symbolizer | |||
|
|||
#install dependancies used for parity checks | |||
RUN pip3 install psutil==5.9.5 onnx==1.10.2 coloredlogs==15.0.1 packaging==23.1 transformers==4.29.2 sympy==1.12 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We now support the latest onnx version. 1.10.2 is too old and doesn't have a bug fix about model location changes. I would rather there be a requirements file to handle test specific items.
cd /workspace/onnxruntime/onnxruntime/test/python/transformers/ | ||
|
||
#Install latest stable torch version | ||
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.4.2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is 5.4.2 really the latest stable? Would this even be compatible with ROCm 5.7 or 6.0? How useful would this be if the version of migx being tested is 5.7 or 6.0?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't touched this in a while I can update.
@@ -1,7 +1,7 @@ | |||
##################################################################################### | |||
# The MIT License (MIT) | |||
# | |||
# Copyright (c) 2015-2022 Advanced Micro Devices, Inc. All rights reserved. | |||
# Copyright (c) 2015-2023 Advanced Micro Devices, Inc. All rights reserved. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
License checker will flag this. Change to 2024
Closing this out. Superceeded by more recent syncs to Onnxruntime main |
Add in onnxruntime parity checks into our CI after we build onnxruntime. Goal with this changeset is to ensure the tests run without error/issue and isn't concerned about performance.
Parallelize the Parity tests, with the existing unit tests run and organize this so we can keep unit, install, and parity checks separate in our pipeline. ORT is now built with --build_wheel which is used with the final two stages (unit tests and parity)
Related to the issues seen in DLM where parity tests fail when we move ROCm versions. This change is to help get ahead of things
Related issues
#1935 & #1877