Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Skip MultiHeadAttentionTest for ROCmEP #11

Merged
merged 1 commit into from
Jun 8, 2023

Conversation

groenenboomj
Copy link

The rest of the test suite is skipped for ROCm. Adding skips for new tests.

Description

Add skips for ROCm for relevant tests in suite MultiHeadAttentionTest to match the rest of the suite.

Motivation and Context

Enable consistency in test results and intents.

The rest of the test suite is skipped for ROCm. Adding skips
for new tests.
@groenenboomj groenenboomj requested a review from jeffdaily June 8, 2023 21:45
@groenenboomj groenenboomj self-assigned this Jun 8, 2023
@groenenboomj groenenboomj merged commit 1b9f079 into rocm5.7_internal_testing Jun 8, 2023
jagadish-amd pushed a commit to jagadish-amd/onnxruntime that referenced this pull request Sep 10, 2024
… transient connection exceptions. (microsoft#21612)

### Description
Improve docker commands to make docker image layer caching works.
It can make docker building faster and more stable.
So far, A100 pool's system disk is too small to use docker cache.
We won't use pipeline cache for docker image and remove some legacy
code.

### Motivation and Context
There are often an exception of
```
64.58 + curl https://nodejs.org/dist/v18.17.1/node-v18.17.1-linux-x64.tar.gz -sSL --retry 5 --retry-delay 30 --create-dirs -o /tmp/src/node-v18.17.1-linux-x64.tar.gz --fail
286.4 curl: (92) HTTP/2 stream 0 was not closed cleanly: INTERNAL_ERROR (err 2)
```
Because Onnxruntime pipeline have been sending too many requests to
download Nodejs in docker building.
Which is the major reason of pipeline failing now

In fact, docker image layer caching never works.
We can always see the scrips are still running
```
ROCm#9 [3/5] RUN cd /tmp/scripts && /tmp/scripts/install_centos.sh && /tmp/scripts/install_deps.sh && rm -rf /tmp/scripts
ROCm#9 0.234 /bin/sh: warning: setlocale: LC_ALL: cannot change locale (en_US.UTF-8)
ROCm#9 0.235 /bin/sh: warning: setlocale: LC_ALL: cannot change locale (en_US.UTF-8)
ROCm#9 0.235 /tmp/scripts/install_centos.sh: line 1: !/bin/bash: No such file or directory
ROCm#9 0.235 ++ '[' '!' -f /etc/yum.repos.d/microsoft-prod.repo ']'
ROCm#9 0.236 +++ tr -dc 0-9.
ROCm#9 0.236 +++ cut -d . -f1
ROCm#9 0.238 ++ os_major_version=8
....
ROCm#9 60.41 + curl https://nodejs.org/dist/v18.17.1/node-v18.17.1-linux-x64.tar.gz -sSL --retry 5 --retry-delay 30 --create-dirs -o /tmp/src/node-v18.17.1-linux-x64.tar.gz --fail
ROCm#9 60.59 + return 0
...
```

This PR is improving the docker command to make image layer caching
work.
Thus, CI won't send so many redundant request of downloading NodeJS.
```
ROCm#9 [2/5] ADD scripts /tmp/scripts
ROCm#9 CACHED

ROCm#10 [3/5] RUN cd /tmp/scripts && /tmp/scripts/install_centos.sh && /tmp/scripts/install_deps.sh && rm -rf /tmp/scripts
ROCm#10 CACHED

ROCm#11 [4/5] RUN adduser --uid 1000 onnxruntimedev
ROCm#11 CACHED

ROCm#12 [5/5] WORKDIR /home/onnxruntimedev
ROCm#12 CACHED
```

###Reference
https://docs.docker.com/build/drivers/

---------

Co-authored-by: Yi Zhang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant