feat: delta to cumulative prometheus #9919

locmai · 2022-05-10T17:50:41Z

Description:

An attempt to fix issue #4153 by adding the delta-to-cumulative code for the prometheus exporter.

Converting from delta points to cumulative point is inherently a stateful operation. To successfully translate, we need all incoming delta points to reach one destination which can keep the current counter state.

So for the prometheus exporter and the accumulateSum, the stateful operations are registered as maps in the lastValueAccumulator and the destination for the delta points to come is the accumulateSum function as well.

I handle the first drop case with unspecified aggregations and drop all non-monotonic delta aggregation next. So there will be no weird adding non-monotonic to the cumulative sum. Follow the step-by-step:

Upon receiving the first Delta point for a given counter we set up the following:

A new counter which stores the cumulative sum, set to the initial counter.

A start time that aligns with the start time of the first point.

A "last seen" time that aligns with the time of the first point.

So this has already been done with the line where we load the registeredMetrics, if none existed, then we created a new one exactly like defined.

opentelemetry-collector-contrib/exporter/prometheusexporter/accumulator.go

Lines 195 to 204 in 9879083

    
           v, ok := a.registeredMetrics.Load(signature) 
        
           if !ok { 
        
           	m := createMetric(metric) 
        
           	m.Sum().SetIsMonotonic(metric.Sum().IsMonotonic()) 
        
           	m.Sum().SetAggregationTemporality(pmetric.MetricAggregationTemporalityCumulative) 
        
           	ip.CopyTo(m.Sum().DataPoints().AppendEmpty()) 
        
           	a.registeredMetrics.Store(signature, &accumulatedValue{value: m, resourceAttrs: resourceAttrs, instrumentationLibrary: il, updated: now}) 
        
           	n++ 
        
           	continue 
        
           }

The next 3 conditions are for handling the follow-up data points:

If the next point aligns with the expected next-time window (see detecting delta restarts):

Update the "last seen" time to align with the time of the current point.

Add the current value to the cumulative counter

Output a new cumulative point with the original start time and current last seen time and count.

The lines I added did these things: update the last seen time (~ Timestamp) with the new value added.

opentelemetry-collector-contrib/exporter/prometheusexporter/accumulator.go

Lines 212 to 216 in 9879083

    
           // Delta-to-Cumulative 
        
           if doubleSum.AggregationTemporality() == pmetric.MetricAggregationTemporalityDelta { 
        
           	ip.SetStartTimestamp(mv.value.Sum().DataPoints().At(0).StartTimestamp()) 
        
           	ip.SetIntVal(ip.IntVal() + mv.value.Sum().DataPoints().At(0).IntVal()) 
        
           }

if the current point precedes the start time, then drop this point.

This has already been done by the line which has if ip.Timestamp().AsTime().Before(…) , then it’s gonna drop the point.

opentelemetry-collector-contrib/exporter/prometheusexporter/accumulator.go

Lines 207 to 210 in 9879083

    
           if ip.Timestamp().AsTime().Before(mv.value.Sum().DataPoints().At(0).Timestamp().AsTime()) { 
        
           	// only keep datapoint with latest timestamp 
        
           	continue 
        
           }

if the next point does NOT align with the expected next-time window, then reset the counter following the same steps performed as if the current point was the first point seen.

This last one is tricky because I couldn’t fine any definition of the “expected next-time windows”. As for the prometheus exporter, we have the expirationTime for the metrics that would automatically delete them from the registeredMetrics map, I’m hoping that could be a way to reset the cumulative sum back to the new start time. Then let Prometheus scrape handle the rest.

Testing:

I have a simple config.yaml to set the metrics pipeline up:

receivers:
  statsd:
    endpoint: "0.0.0.0:8127"
    aggregation_interval: 5s
    enable_metric_type: true
    is_monotonic_counter: true
    timer_histogram_mapping:
      - statsd_type: "histogram"
        observer_type: "summary"
      - statsd_type: "timing"
        observer_type: "summary"
processors:
  batch:
exporters:
  prometheus:
    endpoint: "0.0.0.0:9090"
    metric_expiration: 180m
  file:
    path: ./metrics.json
service:
  pipelines:
    metrics:
      receivers: [statsd]
      processors: [batch]
      exporters: [prometheus]
    metrics/file:
      receivers: [statsd]
      processors: [batch]
      exporters: [file]

Then I built from source into the binary and run it with the config.yaml from bin directory:

make otelcontribcol
otelcontribcol --config config.yaml

Then test with a few nc (to the statsd receiver) to see if they could accumulate the sum correctly:

echo "test.metric:10|c|#myKey:myVal" | nc -w 1 -u localhost 8127
echo "test.metric:10|c|#myKey:myVal" | nc -w 1 -u localhost 8127
echo "test.metric:20|c|#myKey:myVal" | nc -w 1 -u localhost 8127

Also tested with different type of statsd, gauge:

echo "test.gauge:20|g|#myKey:myVal" | nc -w 1 -u localhost 8127
echo "test.gauge:10|g|#myKey:myVal" | nc -w 1 -u localhost 8127
echo "test.gauge:30|g|#myKey:myVal" | nc -w 1 -u localhost 8127

Documentation: Updated CHANGELOG.md

Link to tracking Issue: #4153

Some discussion in the previous PR: #7156

Signed-off-by: Loc Mai <[email protected]>

jmacd · 2022-05-11T19:44:43Z

exporter/prometheusexporter/accumulator.go

+		// Delta-to-Cumulative
+		if doubleSum.AggregationTemporality() == pmetric.MetricAggregationTemporalityDelta {
+			ip.SetStartTimestamp(mv.value.Sum().DataPoints().At(0).StartTimestamp())
+			ip.SetIntVal(ip.IntVal() + mv.value.Sum().DataPoints().At(0).IntVal())


Does this line need to support float64-valued counters separately?

Yeah the Prometheus counter is float64-based, so I think we should do that.

Should it be:

ip.SetIntVal(ip.IntVal() + mv.value.Sum().DataPoints().At(0).IntVal()) ip.SetDoubleVal(ip.DoubleVal() + mv.value.Sum().DataPoints().At(0).DoubleVal())

Or we must parse the IntVal to the DoubleVal type then add them up?

Or just do a simple if/else to check the type of the current delta value/last cumulative value was Int/Double?

Regarding to this, let me find if any other receiver produced the delta counter as StatsD receiver will always parse it's counters to Int.

I see -- what you have is correct for statsd to PRW, but I was thinking of other OTLP senders (e.g., an SDK) configured for delta temporality that might use floating point. There is a pdata.MetricValueType that indicates what the incoming point has. I would say that since PRW exports floating points always, possibly the best solution is to convert points from integer (if present) to double somewhere above the accumulator, so that the stored point is always a floating point.

Roger. Working on this :loading:

exporter/prometheusexporter/accumulator.go

Signed-off-by: Loc Mai <[email protected]>

receiver/statsdreceiver/protocol/statsd_parser.go

locmai · 2022-05-13T05:07:53Z

exporter/prometheusexporter/accumulator.go

+			switch ip.ValueType() {
+			case pmetric.NumberDataPointValueTypeInt:
+				ip.SetIntVal(ip.IntVal() + mv.value.Sum().DataPoints().At(0).IntVal())
+			case pmetric.NumberDataPointValueTypeDouble:
+				ip.SetDoubleVal(ip.DoubleVal() + mv.value.Sum().DataPoints().At(0).DoubleVal())


The final value will be converted into float64 at the convertSum():

opentelemetry-collector-contrib/exporter/prometheusexporter/collector.go

Lines 150 to 155 in 884c275

switch ip.ValueType() {

case pmetric.NumberDataPointValueTypeInt:

value = float64(ip.IntVal())

case pmetric.NumberDataPointValueTypeDouble:

value = ip.DoubleVal()

}

If a client were to mix number types, something irregular happens here, e.g., if the stored point has a floating point value and the new point has an integer value. I'm OK ignoring this case, since there are already caveats about what we're doing here. Instead of changing the code for corner cases, I recommend documenting what this will do.

As it stands, your change means that a Prometheus exporter can aggregate a single stream of delta temporality counter data into a single cumulative metric. This will be the case when there is one statsd receiver. If a single OTLP SDK exports delta temporality to this OTC, a single delta temporality counter metric will be correctly aggregated here.

However, if multiple statsd receivers or OTel SDKs generate the same stream using delta temporality, this code will not be able to correctly aggregate; the same is true if one stream contains mixed number types, but that hardly seems important given this other limitation.

To overcome the "Single stream" limitation, the exporter can either:

Blindly apply all deltas, regardless of timing. As long as all the producers behave correctly, there is little opportunity for incorrectness except due to replayed data.

Maintain a map of start-times already applied, possibly with resource information. If the same resource updates a cumulative metric repeatedly, that's when the start_time==end_time test adds correctness.

I think that this change is useful even with the caveat that it only supports one stream, for now. OTOH, blindly applying all deltas isn't very wrong and is very simple. What do you think?

From the specification of the single-writer principle:

Multiple writers for a metric stream is considered an error state, or misbehaving system. Receivers SHOULD presume a single writer was intended and eliminate overlap / deduplicate.

If I understood that correctly, I would consider that case is an error in the configuration and it should be the responsible of the receivers to handle that probably.

In this case, I would prefer to assume that all deltas comes to the exporter would be from a single stream and with correct timing.

Instead of blindly applying all the deltas, we could handle it:

OpenTelemetry collectors SHOULD export telemetry when they observe overlapping points in data streams, so that the user can monitor for erroneous configurations.

So if any data points fall into the overlapped case, we report it?

I like your reasoning.
I was under the impression that the exporter at this point does not keep the Resource to distinguish the sender of the stream; it may be that two senders with different resources are producing the same metric and attributes--that's the case I was thinking of. In any case, you've done a good thing here and I don't want to block it, let's document what it does and move on!

Nice nice nice! thank you for helping me out with this one!

Hi @Aneurysm9 , could you also take a look? This is for the very old one that we have discussed before to fix the dropping metrics issue.

I agree that documentation of the expected behavior here would be helpful. Also some tests that exercise this capability.

As for the case of two producers of the same metric with different resources, we do have the resource attributes available at this point but don't seem to include them in the timeseries signature. Would doing that remove the concern about improper accumulation? What knock-on effects might that have?

By documenting the expected behavior, I believe I should make another PR to update the Prometheus data model specification here for this change?

With the resource attributes regardless the single write principle, we could identify the 2 producers and therefore could handle the accumulation probably for overlapping case or when the nextDataPoint.startTimestamp > lastDataPoint.Timestamp. I see no side-effect yet.

Adding the spec update here: open-telemetry/opentelemetry-specification#2570

I'll start working the tests this week soon.

Hi folks @jmacd @Aneurysm9 , I just updated the specification (merged open-telemetry/opentelemetry-specification#2570) and the tests for this PR.

Bumped the coverage from 97.7% of statements to 99.3% of statements.

The original TestAccumulateDeltaAggregation simply test the cases where metrics got dropped for both detal aggregation for Sum and Histogram (since they are non-monotonic by default) so I moved some of them to a new one TestAccumulateDroppedMetrics alongside with the MetricAggregationTemporalityUnspecified cases.

Signed-off-by: Loc Mai <[email protected]>

exporter/prometheusexporter/accumulator.go

Signed-off-by: Loc Mai <[email protected]>

jmacd · 2022-06-02T14:48:15Z

I'm still enthusiastic about this PR. I would like to see the same functional change in the prometheusremotewriteexporter.

leocavalcante · 2023-08-01T18:33:34Z

Is this working already?

I've the following at 0.82.0:

receivers:
  statsd:
    endpoint: 0.0.0.0:8125
    aggregation_interval: 5s
    is_monotonic_counter: true

exporters:
  logging:
    verbosity: detailed
  prometheus:
    endpoint: 0.0.0.0:9090

service:
  telemetry:
    logs:
      level: "debug"
  pipelines:
    metrics:
      receivers: [statsd]
      exporters: [logging, prometheus]

When I send 3 delta counters do Statsd:

foo:1|c
foo:1|c
foo:1|c

I expect to see it accumulated at /metrics but I see 1 instead of 3:

# HELP foo 
# TYPE foo counter
foo 1

locmai added 2 commits May 10, 2022 23:40

chore: handle delta-to-cumulative for prometheusexporter

49dbb9b

Signed-off-by: Loc Mai <[email protected]>

Merge branch 'main' into feat/delta-to-cumulative-prometheus

2fa36a6

locmai requested a review from a team May 10, 2022 17:50

locmai requested a review from Aneurysm9 as a code owner May 10, 2022 17:50

github-actions bot assigned djaglowski May 10, 2022

locmai added 2 commits May 11, 2022 00:53

Merge branch 'main' into feat/delta-to-cumulative-prometheus

96e4022

Signed-off-by: Loc Mai <[email protected]>

docs: update changelog

9879083

Signed-off-by: Loc Mai <[email protected]>

locmai mentioned this pull request May 10, 2022

Fix Counter type metrics from statsdreceiver being dropped by prometheusexporter #7156

Closed

fix: condition to drop

e5a80db

Signed-off-by: Loc Mai <[email protected]>

locmai force-pushed the feat/delta-to-cumulative-prometheus branch from f89d604 to e5a80db Compare May 10, 2022 18:12

locmai added 2 commits May 11, 2022 01:24

go mod tidy

c369373

Signed-off-by: Loc Mai <[email protected]>

go mod tidy

8e8aa98

Signed-off-by: Loc Mai <[email protected]>

jmacd reviewed May 11, 2022

View reviewed changes

fix: make stasd update lastIntervalTime on aggregation

855435a

Signed-off-by: Loc Mai <[email protected]>

locmai requested a review from dmitryax as a code owner May 12, 2022 09:43

remove print lines

4737720

locmai commented May 12, 2022

View reviewed changes

receiver/statsdreceiver/protocol/statsd_parser.go Show resolved Hide resolved

locmai commented May 12, 2022

View reviewed changes

receiver/statsdreceiver/protocol/statsd_parser.go Show resolved Hide resolved

chore: handle int/double types

e7ab883

locmai commented May 13, 2022

View reviewed changes

chore: update test case for statsdTestMetrics since start time changed

a585d71

Signed-off-by: Loc Mai <[email protected]>

jmacd approved these changes May 13, 2022

View reviewed changes

locmai added 2 commits May 14, 2022 00:39

Merge branch 'main' into feat/delta-to-cumulative-prometheus

dc5786a

Merge branch 'main' into feat/delta-to-cumulative-prometheus

a48e53f

Signed-off-by: Loc Mai <[email protected]>

dashpole added the comp:prometheus Prometheus related issues label May 18, 2022

locmai mentioned this pull request May 19, 2022

Counter type from statsdreceiver being dropped by prometheusexporter #4153

Closed

Merge branch 'main' into feat/delta-to-cumulative-prometheus

c0f5622

locmai mentioned this pull request May 21, 2022

Update Prometheus Sums for handling delta counter case open-telemetry/opentelemetry-specification#2570

Merged

artem-shorokhov reviewed May 21, 2022

View reviewed changes

exporter/prometheusexporter/accumulator.go Show resolved Hide resolved

Merge branch 'main' into feat/delta-to-cumulative-prometheus

ac63de7

locmai added 4 commits May 26, 2022 13:46

chore: update tests

742e84e

Signed-off-by: Loc Mai <[email protected]>

fix test with start timestamp and add more check

b3923a0

Signed-off-by: Loc Mai <[email protected]>

Merge branch 'main' into feat/delta-to-cumulative-prometheus

7a6b746

Merge branch 'main' into feat/delta-to-cumulative-prometheus

bab967c

Merge branch 'main' into feat/delta-to-cumulative-prometheus

8fec1a1

Aneurysm9 approved these changes Jun 6, 2022

View reviewed changes

djaglowski merged commit c522cef into open-telemetry:main Jun 7, 2022

kentquirk pushed a commit to McSick/opentelemetry-collector-contrib that referenced this pull request Jun 14, 2022

feat: delta to cumulative prometheus (open-telemetry#9919)

0223aa8

douglasbgray mentioned this pull request Jul 1, 2022

[exporter/prometheusexporter] Support for delta metrics from multiple sources #11870

Closed

jmacd mentioned this pull request Jul 8, 2022

[exporter/prometheusexporter] accumulate delta temporality metrics #9006

Closed

locmai mentioned this pull request Jul 12, 2022

[exporter/prometheus] fix: cumulative condition based on the starttimestamp #12340

Merged

locmai deleted the feat/delta-to-cumulative-prometheus branch August 8, 2022 03:30

leocavalcante mentioned this pull request Aug 31, 2023

Count metric is overridden instead of added/summed open-telemetry/opentelemetry-php#970

Closed

sh0rez mentioned this pull request Jan 12, 2024

new component: deltatocumulative processor #30479

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: delta to cumulative prometheus #9919

feat: delta to cumulative prometheus #9919

locmai commented May 10, 2022 •

edited

Loading

jmacd May 11, 2022

locmai May 12, 2022

jmacd May 12, 2022

locmai May 12, 2022

locmai May 13, 2022

jmacd May 13, 2022

locmai May 13, 2022 •

edited

Loading

jmacd May 13, 2022 •

edited

Loading

locmai May 13, 2022

Aneurysm9 May 19, 2022

locmai May 19, 2022

locmai May 23, 2022

locmai May 26, 2022

jmacd commented Jun 2, 2022

leocavalcante commented Aug 1, 2023

	v, ok := a.registeredMetrics.Load(signature)
	if !ok {
	m := createMetric(metric)
	m.Sum().SetIsMonotonic(metric.Sum().IsMonotonic())
	m.Sum().SetAggregationTemporality(pmetric.MetricAggregationTemporalityCumulative)
	ip.CopyTo(m.Sum().DataPoints().AppendEmpty())
	a.registeredMetrics.Store(signature, &accumulatedValue{value: m, resourceAttrs: resourceAttrs, instrumentationLibrary: il, updated: now})
	n++
	continue
	}

	// Delta-to-Cumulative
	if doubleSum.AggregationTemporality() == pmetric.MetricAggregationTemporalityDelta {
	ip.SetStartTimestamp(mv.value.Sum().DataPoints().At(0).StartTimestamp())
	ip.SetIntVal(ip.IntVal() + mv.value.Sum().DataPoints().At(0).IntVal())
	}

	if ip.Timestamp().AsTime().Before(mv.value.Sum().DataPoints().At(0).Timestamp().AsTime()) {
	// only keep datapoint with latest timestamp
	continue
	}

	switch ip.ValueType() {
	case pmetric.NumberDataPointValueTypeInt:
	value = float64(ip.IntVal())
	case pmetric.NumberDataPointValueTypeDouble:
	value = ip.DoubleVal()
	}

feat: delta to cumulative prometheus #9919

feat: delta to cumulative prometheus #9919

Conversation

locmai commented May 10, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

locmai May 13, 2022 • edited Loading

Choose a reason for hiding this comment

jmacd May 13, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmacd commented Jun 2, 2022

leocavalcante commented Aug 1, 2023

locmai commented May 10, 2022 •

edited

Loading

locmai May 13, 2022 •

edited

Loading

jmacd May 13, 2022 •

edited

Loading