[processor/tailsampling] Support hot sampling policy loading #37014

portertech · 2025-01-02T23:05:57Z

Description

Adding a feature. This pull-request adds support for hot sampling policy loading to the tail sampling processor. This allows the collector (or another service using the processor) to dynamically update tail sampling policy without needing to restart the processor (or the entire collector). This greatly minimizes the impact of sampling policy modifications on pipeline availability and processing. Changes to policy are safely applied on the next tick loop.

A collector (and/or other service) could use OpAMP to remotely manage sampling policy with little to no negative impact on pipeline availability and performance. This is what the https://tailctrl.io/ agent did.

Usage

Currently need to define a custom interface in order to set sampling policy.

type SamplingProcessor interface {
	processor.Traces

	SetSamplingPolicy(cfgs []tailsamplingprocessor.PolicyCfg)
}

factory := tailsamplingprocessor.NewFactory()

tsp, _ := factory.CreateTraces()
sp = tsp.(SamplingProcessor)

sp.SetSamplingPolicy(cfgs)

Testing

Added a test to ensure changes to policy are loaded. Using the changes in a private project.

Signed-off-by: Sean Porter <[email protected]>

portertech · 2025-01-02T23:14:02Z

Will add that changelog entry.

Signed-off-by: Sean Porter <[email protected]>

portertech · 2025-01-03T00:30:11Z

processor/tailsamplingprocessor/processor.go

-		initialDecisions := make([]sampling.Decision, lenPolicies)
-		for i := 0; i < lenPolicies; i++ {
-			initialDecisions[i] = sampling.Pending
-		}


I discovered this little chunk of no-op code, leftover from a decision refactor.

processor/tailsamplingprocessor/processor.go

Co-authored-by: Matthew Wear <[email protected]>

Signed-off-by: Sean Porter <[email protected]>

portertech · 2025-01-06T19:10:50Z

@mwear thank you for the review, applied your suggested changes 👍

mwear

The code looks good to me. I'll let the codeowners weigh in on the feature addition.

jpkrohling

This looks good, thanks! I wonder if you have data about the performance before and after this change.

jpkrohling · 2025-01-07T08:41:27Z

processor/tailsamplingprocessor/processor.go

+}
+
+func (tsp *tailSamplingSpanProcessor) SetSamplingPolicy(cfgs []PolicyCfg) {
+	tsp.logger.Debug("Setting pending sampling policy", zap.Int("pending.len", len(cfgs)))


For a follow-up PR: it would be useful to have a counter, stating how many times the sampling policy has been set.

jpkrohling · 2025-01-07T08:46:05Z

processor/tailsamplingprocessor/processor_test.go

+	}
+	tsp.SetSamplingPolicy(cfgs)
+
+	assert.Len(t, tsp.policies, 2)


this confused me for a little moment, as I was expecting "3" here -- perhaps add another assertion, with Len(t, tsp.pendingPolicy, 3), to highlight that the cfgs has been accepted by SetSamplingPolicy?

jpkrohling · 2025-01-07T08:48:25Z

I'm merging, the comments I left can be addressed on a follow-up PR.

…lemetry#37014) #### Description Adding a feature. This pull-request adds support for hot sampling policy loading to the tail sampling processor. This allows the collector (or another service using the processor) to dynamically update tail sampling policy without needing to restart the processor (or the entire collector). This greatly minimizes the impact of sampling policy modifications on pipeline availability and processing. Changes to policy are safely applied on the next tick loop. A collector (and/or other service) could use OpAMP to remotely manage sampling policy with little to no negative impact on pipeline availability and performance. This is what the https://tailctrl.io/ agent did. #### Usage Currently need to define a custom interface in order to set sampling policy. ``` go type SamplingProcessor interface { processor.Traces SetSamplingPolicy(cfgs []tailsamplingprocessor.PolicyCfg) } factory := tailsamplingprocessor.NewFactory() tsp, _ := factory.CreateTraces() sp = tsp.(SamplingProcessor) sp.SetSamplingPolicy(cfgs) ``` #### Testing Added a test to ensure changes to policy are loaded. Using the changes in a private project. --------- Signed-off-by: Sean Porter <[email protected]> Co-authored-by: Matthew Wear <[email protected]>

support hot sampling policy loading

0d47747

Signed-off-by: Sean Porter <[email protected]>

portertech requested review from jpkrohling and a team as code owners January 2, 2025 23:05

github-actions bot assigned codeboten Jan 2, 2025

github-actions bot added the processor/tailsampling Tail sampling processor label Jan 2, 2025

added changelog entry

535b20e

Signed-off-by: Sean Porter <[email protected]>

portertech commented Jan 3, 2025

View reviewed changes

mwear reviewed Jan 6, 2025

View reviewed changes

processor/tailsamplingprocessor/processor.go Outdated Show resolved Hide resolved

processor/tailsamplingprocessor/processor.go Outdated Show resolved Hide resolved

portertech and others added 3 commits January 6, 2025 10:27

moved debug log event outside of lock

152db30

Co-authored-by: Matthew Wear <[email protected]>

use basic mutex (instead of rwmutex)

2fd892b

Co-authored-by: Matthew Wear <[email protected]>

fixed minor lint error

02c95c7

Signed-off-by: Sean Porter <[email protected]>

mwear approved these changes Jan 6, 2025

View reviewed changes

jpkrohling approved these changes Jan 7, 2025

View reviewed changes

jpkrohling changed the title ~~[tailsamplingprocessor] Support hot sampling policy loading~~ [processor/tailsampling] Support hot sampling policy loading Jan 7, 2025

jpkrohling merged commit 5f9d943 into open-telemetry:main Jan 7, 2025
162 checks passed

github-actions bot added this to the next release milestone Jan 7, 2025

portertech mentioned this pull request Jan 17, 2025

[processor/tailsampling] Added @portertech to codeowners #37299

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[processor/tailsampling] Support hot sampling policy loading #37014

[processor/tailsampling] Support hot sampling policy loading #37014

portertech commented Jan 2, 2025 •

edited

Loading

portertech commented Jan 2, 2025

portertech Jan 3, 2025

portertech commented Jan 6, 2025

mwear left a comment

jpkrohling left a comment

jpkrohling Jan 7, 2025

jpkrohling Jan 7, 2025

jpkrohling commented Jan 7, 2025

[processor/tailsampling] Support hot sampling policy loading #37014

[processor/tailsampling] Support hot sampling policy loading #37014

Conversation

portertech commented Jan 2, 2025 • edited Loading

Description

Usage

Testing

portertech commented Jan 2, 2025

portertech Jan 3, 2025

Choose a reason for hiding this comment

portertech commented Jan 6, 2025

mwear left a comment

Choose a reason for hiding this comment

jpkrohling left a comment

Choose a reason for hiding this comment

jpkrohling Jan 7, 2025

Choose a reason for hiding this comment

jpkrohling Jan 7, 2025

Choose a reason for hiding this comment

jpkrohling commented Jan 7, 2025

portertech commented Jan 2, 2025 •

edited

Loading