You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When a workload using preferred topology annotation cannot be admitted is sometimes reports negative number of pods in the status message of the workload.
What you expected to happen:
negative values of pods are never reported, it does not make sense.
How to reproduce it (as minimally and precisely as possible):
I have a reproducible scenario, but it is yet not simplified enough to be included here. I will simplify and post.
Basically I create multiple jobs using preferred, and workload corresponding to the non-admitted job reports message like:
status:
conditions:
- lastTransitionTime: "2024-11-07T13:01:01Z"
message: 'couldn''t assign flavors to pod set main: insufficient unused quota
for cpu in flavor tas-flavor, 30 more needed, topology "gke-default" allows
to fit only -18 out of 40 pod(s)'
observedGeneration: 1
reason: Pending
status: "False"
type: QuotaReserved
mimowo
changed the title
TAS: when using preferred topology annotation a non-admitted workload may report negative values of pods
TAS: may report negative values of pods in messages for non-admitted workloads
Nov 8, 2024
After investigation it turns out that this is not only related to preferred - so I retitled the issue.
The root cause is that the usage of TAS workloads is multiplied by the number of resources, because of this loop which iterates over pairs of (resource, flavor). So it would add the same usage for (cpu, tas-flavor), (memory, tas-flavor), (gpu, tas-flavor).
What happened:
When a workload using preferred topology annotation cannot be admitted is sometimes reports negative number of pods in the status message of the workload.
What you expected to happen:
negative values of pods are never reported, it does not make sense.
How to reproduce it (as minimally and precisely as possible):
I have a reproducible scenario, but it is yet not simplified enough to be included here. I will simplify and post.
Basically I create multiple jobs using preferred, and workload corresponding to the non-admitted job reports message like:
Anything else we need to know?:
It was reported first here: #3211 (comment)
The text was updated successfully, but these errors were encountered: