Performance Testing & Benchmarking Go Auto Instrumentation #334

dineshg13 · 2023-09-17T13:03:05Z

When using go auto instrumentation, we need to make performance trade-offs very clear to the end user.
We need to run performance tests to measure the throughput of the go auto instrumentation supports. We have two knobs that we can tweak

We should derive recommended values for the above and also make them configurable.

RonFed · 2023-09-20T14:01:02Z

Related to this topic but more on the ebpf side.
As a general note, I think the ebpf code performance has a more "direct" impact on the probed code relative to the performance of our Go code.
Currently, we use perf buffer to transfer the events from ebpf to the user code in Go.

I think that we should switch to ring buffer instead of perf buffer as it provides better performance in almost all scenarios as explained in this blog post and in more detail in this patch
A key take-away from the above links is that the biggest performance impact by using the ring/perf buffer is the process in which the kernel signals the user code (blocking on reading from the buffer) that an event is ready. Today we signal for each event. In practice, a much more efficient approach would be to not signal each time we push an event to the buffer, but take the 'sampled' approach discussed in the links - to wake the Go user code every X events, where X may be a configurable parameter. Or for example, X can be a percentage of the buffer - if more than X% of the buffer is full, signal.
I did some basic testing using bpftool prog profile and it looks like when we signal from ebpf code to the user code, the time spent in the uprboe increase drastically. Hence, implementing the sampled ring buffer approach sounds like a good idea to me.
Another point to consider is what happens if the events throughput decreases and we wait a long time for X events - without signaling the Go code. For this case, I did this Pr - which adds the ability to set a timeout for flushing the buffer by the user code in case it didn't get a signal. This timeout can be another configurable parameter.

dineshg13 added the enhancement New feature or request label Sep 17, 2023

dineshg13 mentioned this issue Sep 17, 2023

Add orchestrator for supporting multiple processes #208

Closed

RonFed mentioned this issue Oct 11, 2023

REQUEST: New membership for RonFed open-telemetry/community#1723

Closed

6 tasks

RonFed mentioned this issue Jul 6, 2024

Refactor span output to a common function #918

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance Testing & Benchmarking Go Auto Instrumentation #334

Performance Testing & Benchmarking Go Auto Instrumentation #334

dineshg13 commented Sep 17, 2023 •

edited

Loading

RonFed commented Sep 20, 2023

Performance Testing & Benchmarking Go Auto Instrumentation #334

Performance Testing & Benchmarking Go Auto Instrumentation #334

Comments

dineshg13 commented Sep 17, 2023 • edited Loading

RonFed commented Sep 20, 2023

dineshg13 commented Sep 17, 2023 •

edited

Loading