Replies: 1 comment
-
Seems like a bug in label selectors we use for both queries. For example this shows more correct results:
I suggest to open a bug report in kubernetes-mixin project |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Trying to determine what the difference is between the kube metrics as seen and leveraged in two different ways by OCP out-of-box custom records in Prometheus. If you simply sum each by pod and subtract from eachother you'll see the differences are very small, but on some higher CPU use pods like ETCD the difference it gets more skewed (greater diff).
My guess is that while both use "container_cpu_usage_seconds_total", one looks to sum/avg[5m] of active containers, whereas the later filters to a different metric that has the total CPU seconds for the entire life of the pod including any init container not still active?
{container!="POD",image!="",job="kubelet",metrics_path="/metrics/cadvisor"}
vs
{container="",pod!=""}
OR
What's further confusing, is taking the same raw metric sum/rate queries those records are based on and specifying them in your own query, same moment in time, I get different results.
Beta Was this translation helpful? Give feedback.
All reactions