Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Move Python Training CUDA 12.2 pipeline to another pool. #21745

Merged
merged 3 commits into from
Aug 15, 2024

Conversation

mszhanyi
Copy link
Contributor

@mszhanyi mszhanyi commented Aug 15, 2024

Description

Motivation and Context

Python Training CUDA 12.2 pipeline has been always cancelled by remote provider since Aug 2nd.
But other workflows with the same pool haven't this issue.
It looks like there're some weird things in Azure devops.
It works by using another pool. In fact, the SKU is smaller than the old.

Verification

https://dev.azure.com/aiinfra/Lotus/_build?definitionId=1308&_a=summary

@mszhanyi mszhanyi requested a review from a team as a code owner August 15, 2024 06:52
@mszhanyi mszhanyi added the release:1.19.0 Cherry pick to ORT 1.19 label Aug 15, 2024
@mszhanyi mszhanyi merged commit 8a59b4d into main Aug 15, 2024
94 of 97 checks passed
@mszhanyi mszhanyi deleted the zhanyi/updatepool branch August 15, 2024 09:31
prathikr pushed a commit that referenced this pull request Aug 15, 2024
### Description
<!-- Describe your changes. -->



### Motivation and Context
[Python Training CUDA 12.2
pipeline](https://dev.azure.com/aiinfra/Lotus/_build?definitionId=1308&_a=summary)
has been always cancelled by remote provider since Aug 2nd.
But other workflows with the same pool haven't this issue.
 It looks like there're some weird things in Azure devops.
It works by using another pool. In fact, the SKU is smaller than the
old.

### Verification
https://dev.azure.com/aiinfra/Lotus/_build?definitionId=1308&_a=summary
prathikr pushed a commit that referenced this pull request Aug 17, 2024
### Description
<!-- Describe your changes. -->



### Motivation and Context
[Python Training CUDA 12.2
pipeline](https://dev.azure.com/aiinfra/Lotus/_build?definitionId=1308&_a=summary)
has been always cancelled by remote provider since Aug 2nd.
But other workflows with the same pool haven't this issue.
 It looks like there're some weird things in Azure devops.
It works by using another pool. In fact, the SKU is smaller than the
old.

### Verification
https://dev.azure.com/aiinfra/Lotus/_build?definitionId=1308&_a=summary
prathikr pushed a commit that referenced this pull request Aug 20, 2024
### Description
<!-- Describe your changes. -->



### Motivation and Context
[Python Training CUDA 12.2
pipeline](https://dev.azure.com/aiinfra/Lotus/_build?definitionId=1308&_a=summary)
has been always cancelled by remote provider since Aug 2nd.
But other workflows with the same pool haven't this issue.
 It looks like there're some weird things in Azure devops.
It works by using another pool. In fact, the SKU is smaller than the
old.

### Verification
https://dev.azure.com/aiinfra/Lotus/_build?definitionId=1308&_a=summary
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release:1.19.0 Cherry pick to ORT 1.19
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants