-
Notifications
You must be signed in to change notification settings - Fork 894
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Metric standard SDK behavior specification #259
Comments
I am confused "non-monotonic counter needs a firm mapping", this is the Prometheus Gauge and probably that's why people are also confused about our definition of Gauge. |
@bogdandrutu what do you think about case 3? |
For 3 that is a Prometheus Counter. Everything that is monotonic is a Counter in Prometheus. We have two options to implement keep the last value and record delta if we want to use their main API or implement our own registry in Prometheus that |
Also Prometheus is moving to a non-negative histogram/summary based on OpenMetrics. |
"For 3 that is a Prometheus Counter". The problem is that Prometheus Counter doesn't support |
Correct, but we need to understand the difference between API and data model in Prometheus. From data model perspective that is a Counter and in order to achieve this as I mentioned in the previous post we can use a Registry instead of using directly the API. This is how they also do it for another counter cpu-usage. |
Sorry I used the wrong term "Registry". We will use Here is an example for the cpu-usage and few other metrics: https://github.com/prometheus/client_golang/blob/3525612fea19680dd3d49c768802ec082ca853b2/prometheus/process_collector.go |
Cool, this makes sense. |
Here is a table that tries to summarize the discussion so far:
|
It sounds to me like this should just be part of the spec, in the same way that If the argument for keeping aggregation and export logic out of the API package is the same as for tracing then it seems like the spec would be incomplete without a description of the SDK. We've effectively got two APIs, one for library devs in the "API" package, and another for vendors in the "SDK" package. |
I've posted a prototype metrics SDK to help with this issue. |
I've posted a complete metric exporter: |
JS now also has a complete metric exporter open-telemetry/opentelemetry-js#483
In the JS version, if it is a counter we just destroy and recreate it with the same name and set the new value. There is no internal ID so the prometheus backend has no way of knowing that we are resetting the counter each time it changes and it "just works". |
@mayurkale22 asked that I post a summary of our discussions in the JS SIG here. It is my opinion that we should be only mapping to a different data type in cases where it causes issues. Too many behind the scenes changes could be confusing for the users. For gauges and counters I would suggest the following mapping, see open-telemetry/opentelemetry-js#553 (comment) for context:
See open-telemetry/opentelemetry-js#561 for an example of this behavior. |
This issue is dangerously ambiguous and covers many topics. For the time being, note that #347 contains the current draft of the metrics SDK specification that was requested in this issue. I will close this issue. |
In #250 we've tried to steer away from specifying the behavior of the library (as there are so many possibilities) and instead focus on the semantics of the events.
It's become clear that people are confused about the behavior of the SDK because it's literally unspecified as part of the API. I believe we should prepare a companion specification to explain what the initial OpenTelemetry SDK is targeting for metrics export. This could go as far as describing an API to configure views and aggregations, or a YAML syntax to support standard configuration.
For a prometheus exporter, we need to decide on certain behaviors that are "not semantic" and therefore haven't been described in the API specification. For each of the 6 sub-kinds of metric, we need to choose an export type for Prometheus. For example, our monotonic counter becomes a prometheus counter, but our non-monotonic counter needs a firm mapping. For measure metrics, I believe we should choose a histogram metric export w/ default bucket ranges. I could imagine making the choice of Histogram vs. Summary metric be an option for the API or YAML syntax mentioned above, with support for configurable bucket ranges and configurable quantiles.
The text was updated successfully, but these errors were encountered: