Allow one observer to observe multiple metrics #549

marwan-at-work · 2020-04-07T13:23:40Z

Summary

In opentelemetry-go, one metric == one observer.

This proposal stresses the need for one observer to observe multiple metrics.

The best use case is for reading various system metrics in one call such as:

import "runtime"

var ms runtime.MemStats
runtime.ReadMemStats()

This operation (and others) can be expensive. This one in particular can populates ~30 variables where each one is fit to be in its own metric/measure. Therefore, the current way it would work would have to be the following:

mm := metric.Must(meter)

mm.RegisterInt64Observer("process/heap_alloc", func(result metric.Int64ObserverResult) {
	var ms runtime.MemStats
	runtime.ReadMemStats(&ms)
	result.Observe(int64(ms.HeapAlloc))
})
mm.RegisterInt64Observer("process/pause_ns", func(result metric.Int64ObserverResult) {
	var ms runtime.MemStats
	runtime.ReadMemStats(&ms)
	result.Observe(int64(ms.PauseNs[(ms.NumGC+255)%256]))
})
mm.RegisterInt64Observer("process/sys_heap", func(result metric.Int64ObserverResult) {
	var ms runtime.MemStats
	runtime.ReadMemStats(&ms)
	result.Observe(int64(ms.Sys))
})
// etc etc...

I hope you can see that this becomes inefficient fairly quickly.

Workaround

As a workaround, it was suggested on Gitter that we can just use Labels to substitute for multiple metrics. This has two issues:

You have to stick with either int64 or float64, you cannot use both in the same observer.
Using labels feels like a stretch. For an example, see https://github.com/marwan-at-work/otelruntime/blob/master/otelruntime.go#L52

Proposal

I propose that users should be able to register one call back that will populate multiple metrics as such:

mm.RegisterBatchInt64Observer(func(result metric.BatchInt64ObserverResult) {
	var ms runtime.MemStats
	runtime.ReadMemStats(&ms)
	result.Observe("process/heap_alloc", int64(ms.HeapAlloc)))
	result.Observe("process/sys_heap", int64(ms.PauseNs[(ms.NumGC+255)%256]))
	result.Observe("process/sys_heap", int64(ms.Sys)))
})

If metric names must be passed ahead of time, then you'd probably need to pass the metric names in the RegisterBatch<T>Observer function.

Otherwise, I'm happy to hear any other suggestions.

Thanks

The text was updated successfully, but these errors were encountered:

jmacd · 2020-04-07T15:58:06Z

From Gitter:

alrex
@codeboten
Apr 06 11:41
i thought about implementing this with a single callback and using labels instead of separate observers, but i wasn't sure if that would make things better or worst

Tristan Sloughter
@tsloughter
Apr 06 11:52
I was thinking about how to do similar for the Erlang VM metrics and asked in Java channel since they would need similar. John pointed me to the Java code and they use labels instead of many observers

Tristan Sloughter
@tsloughter
Apr 06 11:53
like https://github.com/open-telemetry/opentelemetry-java/blob/e248ea7ad74b3ac9774f871636c8f6b839be998f/contrib/runtime_metrics/src/main/java/io/opentelemetry/contrib/metrics/runtime/MemoryPools.java

alrex
@codeboten
Apr 06 12:56
right, it's similar to what's done in some examples in the python repo https://github.com/open-telemetry/opentelemetry-python/blob/master/docs/examples/basic_meter/observer.py#L39

obecny · 2020-04-07T22:08:33Z

the proposal -> this is how I implemented it in JS
And I think the spec says the same:
Multiple observations are permitted in a single callback invocation.
https://github.com/open-telemetry/oteps/blob/master/text/0072-metric-observer.md#observer-calling-conventions

jmacd · 2020-04-08T04:55:00Z

OTEP 72 does say that multiple observations are permitted per callback, but the intention at that point was to support multiple (value, label set) observations for a single metric. The language is unclear, but the pseudocode in the OTEP is somewhat clear.

In getting to OTEP 72 we discussed several related issues. We removed "bound" observer instruments, justifying this because LabelSets were still part of the API and support caching the label encoding used in an exporter.

Now that we have removed LabelSets from the API, the notion of a bound observer starts to look like a real optimization. What would an API look like to support multiple metrics to be observed from one callback?

We would register callbacks with the SDK directly, since they are not associated with metrics directly. We could have three analogous calling patterns to the synchronous instruments: direct calls, bound calls, and batch calls. Callbacks would be executed sequentially in the order they were registered. For example:

var instrument1 = global.Meter(...).RegisterInt64Observer(...)
var instrument2 = global.Meter(...).RegisterFloat64Observer(...)
var boundInst2 = instrument2.Bind(labels...)
var instrument3 = global.Meter(...).RegisterInt64Observer(...)

func Callback(result metric.ObserverResult) {

   // direct call
   result.Observe(instrument1, value1, labels1...)

   // bound call
   boundInst2.Observe(result, value2)

   // batch call
   result.ObserveBatch([]core.KeyValue{labels3}, instrument1.Measurement(value1), instrument3.Measurement(value3), ...)
}

This sort of change requires us to mention that callbacks are executed in order, whereas we didn't have to before. Last-value wins was previously applied to the result from one callback, but now it has to be applied to the result from all callbacks.

Thoughts? @bogdandrutu this is a significant problem, please have a look.

jkwatson · 2020-04-08T19:52:06Z

I like the idea of adding a callback interface to the Meter.

Java pseudocode:

meter.addObserver(Callback<Meter> meterCallback)

where Callback<T> looks something like this:

interface Callback<R> {
  void update(R input);
}

obecny · 2020-04-08T20:05:41Z

What if instead of function as a callback it would be an observable object. This way the problem with sync / async would be solved.
for example

const observable1 = new ObservableObject();
observerResult.observe(observable1, labels);


// anywhere in a code doesn't matter when 
observable1.next(1);
//or
setTimeout(()=>{
  observable1.next(1);
}, 1000)
// etc.

so every time you call observable1.next(1); it will simply update the metric.

The main problem when the callback is a function is that you have no easy way of getting data which requires async calls. And also you will have problem with blocking the sync call for longer than it is needed.
WDYT ?

jmacd · 2020-04-08T22:06:59Z

@obecny What you've written looks like a synchronous instrument through and through. There are reasons for Observers to be asynchronous, they're supposed to be a solution. The reasons:

Optimization for expensive-to-read measurements
Semantics of LastValue relationship
Context-free reporting API.

We had made the Observer callback support a single metric merely out of a desire to keep things as simple as possible. I believe this example demonstrates that we have oversimplified. The proposal I made above for bound observers surfaced in early in the 0.3 conversation IIRC, and we dropped it.

bogdandrutu · 2020-04-08T22:42:59Z

I think it should be possible to have the "batch-callback" similar to batch record. Probably not one per meter, but allow batching-callback is indeed something that we can support.

bogdandrutu · 2020-04-08T22:54:03Z

@jmacd I would rely on a combination of the BatchRecorder and ObserverResult to achieve the performance you mentioned:

var instrument1 = global.Meter(...).RegisterInt64Observer(...)
var instrument2 = global.Meter(...).RegisterFloat64Observer(...)
var instrument3 = global.Meter(...).RegisterInt64Observer(...)
global.Meter(...).RegisterBatchCallback(Callback)

func Callback(result metric.ObserverBatchResult) {
   // batch call
   result.ObserveBatch([]core.KeyValue{labels3}, instrument1.Measurement(value1), instrument3.Measurement(value3), ...)
}

// OR

func Callback(result metric.ObserverBatchResult) {
   // batch call
   result.BatchRecorder([]core.KeyValue{labels3})
             .add(instrument1.Measurement(value1))
             .add(instrument3.Measurement(value3))
             .record()
}

jmacd · 2020-04-09T06:54:04Z

I like this. There are a few details to work out, like:

Instruments may pass a null callback, they can be registered w/o defining a callback due to batch observations
Callbacks will be called in the ordered they were registered, whether on individual instruments or batch observer callbacks, to preserve last-value-wins semantics
I take it you mean that the above two calling patterns are both valid options, and may vary by language? I am on-board with this. I think Golang will take the first of these options, but recognize that others will adopt the second choice.

As we have removed LabelSets, the BatchRecorder pattern that you bring up stands in as a replacement for LabelSet. I wonder if you would allow this pattern:

var (
   meter = global.Meter("library")

   metricRecorder = meter.AsyncBatchRecorder(labels...) // Or "ObserverBatchRecorder"

    _ = meter.RegisterBatchCallback(
       func (result metric.ObserverBatchResult) {
              metricRecorder.record(result, 
                   instrument1.Measurement(value1),
                   instrument3.Measurement(value3),
               )
        })
)

I have the same question for synchronous usage, e.g.:

func F(ctx context.Context, ch <-chan T) {
   meter := global.Meter("library")
   metricRecorder := meter.SyncBatchRecorder(labels...) // Or "MeasureBatchRecorder"

   for elem := range ch {
              metricRecorder.record(ctx, 
                   instrument1.Measurement(elem.X),
                   instrument3.Measurement(elem.Y),
               )
        })
)

Note the only difference between Sync- and Async- BatchRecorders is that Sync- recorders take Context, while Async- recorders take ObserverBatchResult.

bogdandrutu · 2020-04-09T16:26:06Z

Instruments may pass a null callback, they can be registered w/o defining a callback due to batch observations
Callbacks will be called in the ordered they were registered, whether on individual instruments or batch observer callbacks, to preserve last-value-wins semantics

I would like to think of a solution where we can ensure only one callback is register at a time for every instrument, what do you think?

jmacd · 2020-04-09T17:33:05Z

I can't immediately think of how I'd do that. Do you have an API in mind?

jmacd · 2020-04-13T22:34:57Z

Possible OTel-Go APIs are being discussed here: open-telemetry/opentelemetry-go#634

jmacd · 2020-04-27T18:51:45Z

This API review is underway: open-telemetry/opentelemetry-go#634

marwan-at-work changed the title ~~[Proposa] Allow one observer to observe multiple metrics~~ [Proposal] Allow one observer to observe multiple metrics Apr 7, 2020

jmacd mentioned this issue Apr 7, 2020

metric/selector: return LastValue aggregator for Observer kinds open-telemetry/opentelemetry-go#619

Closed

jmacd mentioned this issue Apr 10, 2020

Metric Observer API changes for batch reporting open-telemetry/opentelemetry-go#633

Closed

bogdandrutu mentioned this issue Apr 11, 2020

Rationalize naming of metric instruments and their default aggregations open-telemetry/oteps#96

Closed

jmacd mentioned this issue Apr 12, 2020

Batch Observer API open-telemetry/opentelemetry-go#634

Closed

jmacd changed the title ~~[Proposal] Allow one observer to observe multiple metrics~~ Allow one observer to observe multiple metrics Apr 13, 2020

jmacd added the spec:metrics Related to the specification/metrics directory label Apr 13, 2020

This was referenced May 12, 2020

Batch Observer callback support open-telemetry/opentelemetry-go#717

Merged

Complete the metrics/api.md TODOs #598

Closed

Metrics API specification: incorporate old content on calling conventions, label sets. #601

Merged

bogdandrutu closed this as completed in #601 May 21, 2020

carlosalberto added this to the v0.5 milestone May 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow one observer to observe multiple metrics #549

Allow one observer to observe multiple metrics #549

marwan-at-work commented Apr 7, 2020

jmacd commented Apr 7, 2020

obecny commented Apr 7, 2020 •

edited

Loading

jmacd commented Apr 8, 2020

jkwatson commented Apr 8, 2020

obecny commented Apr 8, 2020 •

edited

Loading

jmacd commented Apr 8, 2020

bogdandrutu commented Apr 8, 2020

bogdandrutu commented Apr 8, 2020 •

edited

Loading

jmacd commented Apr 9, 2020 •

edited

Loading

bogdandrutu commented Apr 9, 2020 •

edited

Loading

jmacd commented Apr 9, 2020

jmacd commented Apr 13, 2020

jmacd commented Apr 27, 2020

Allow one observer to observe multiple metrics #549

Allow one observer to observe multiple metrics #549

Comments

marwan-at-work commented Apr 7, 2020

Summary

Workaround

Proposal

jmacd commented Apr 7, 2020

obecny commented Apr 7, 2020 • edited Loading

jmacd commented Apr 8, 2020

jkwatson commented Apr 8, 2020

obecny commented Apr 8, 2020 • edited Loading

jmacd commented Apr 8, 2020

bogdandrutu commented Apr 8, 2020

bogdandrutu commented Apr 8, 2020 • edited Loading

jmacd commented Apr 9, 2020 • edited Loading

bogdandrutu commented Apr 9, 2020 • edited Loading

jmacd commented Apr 9, 2020

jmacd commented Apr 13, 2020

jmacd commented Apr 27, 2020

obecny commented Apr 7, 2020 •

edited

Loading

obecny commented Apr 8, 2020 •

edited

Loading

bogdandrutu commented Apr 8, 2020 •

edited

Loading

jmacd commented Apr 9, 2020 •

edited

Loading

bogdandrutu commented Apr 9, 2020 •

edited

Loading