[processor/groupbyattrsprocessor] allow empty keys for compaction #7793

pmm-sumo · 2022-02-10T20:34:06Z

Description:

Extension of groupbyattrsprocessor for compacting data when spread across multiple ResourceSpans/ResourceMetrics/ResourceLogs with matching Resource and InstrumentationLibrary

Link to tracking Issue: #2265 in core

Testing: Several unit tests added

Documentation: Docs clarified on usage of empty keys. Provided example on compaction

Benchmark results (compacting 100 spans in different layouts):

BenchmarkCompacting
BenchmarkCompacting/instrumentation_library_count=1,_spans_per_library_count=100
BenchmarkCompacting/instrumentation_library_count=1,_spans_per_library_count=100-16         	   27966	     42352 ns/op
BenchmarkCompacting/instrumentation_library_count=10,_spans_per_library_count=10
BenchmarkCompacting/instrumentation_library_count=10,_spans_per_library_count=10-16         	   23912	     50266 ns/op
BenchmarkCompacting/instrumentation_library_count=100,_spans_per_library_count=1
BenchmarkCompacting/instrumentation_library_count=100,_spans_per_library_count=1-16         	   16819	     71327 ns/op

pkositsyn · 2022-02-12T06:35:27Z

I have 2 insights:

There is a comment about prohibiting empty keys in config.go, it should be changed as well
I have an example of Jaegerexporter, which sends different ResourceSpans independently in separate calls. Given that, I feel like in many cases setting batchpeocessor -> groupbyattrs(with empty set of keys) -> jaeger exporter will benefit performance a lot. Might be a good idea to make performance test and add something to jaeger exporter docs as well

pkositsyn · 2022-02-13T18:06:53Z

I have done some tests with adding groupbyattrs with empty keys for jaeger exporter

On the following screenshots 2 versions are compared: first window without groupbyattrs, and second with this processor.

Spans rate in jaeger exporter

CPU for collector

The benchmark was conducted via tracegen docker-compose, adding this diff to current branch

diff --git a/examples/tracegen/docker-compose.yml b/examples/tracegen/docker-compose.yml
index 015068b5e..d6842c6ce 100644
--- a/examples/tracegen/docker-compose.yml
+++ b/examples/tracegen/docker-compose.yml
@@ -5,7 +5,16 @@ services:
     image: jaegertracing/all-in-one:latest
     ports:
       - "16686:16686"
-      - "14250"
+      - "14250" 
+
+  prometheus:
+    image: prom/prometheus
+    ports:
+      - "9090:9090"
+    volumes:
+      - ./prometheus-config.yml:/etc/prometheus/prometheus.yml
+    depends_on:
+      - otel-collector
 
   otel-collector:
     build:
@@ -25,8 +34,11 @@ services:
       - otel-collector:4317
       - -otlp-insecure
       - -rate
-      - "1"
+      - "1000"
+      - -workers
+      - "4"
       - -duration
-      - 10000000s
+      - 600s
     depends_on:
       - otel-collector
+      - prometheus
diff --git a/examples/tracegen/otel-collector-config.yml b/examples/tracegen/otel-collector-config.yml
index e07abfae1..c3518e0ae 100644
--- a/examples/tracegen/otel-collector-config.yml
+++ b/examples/tracegen/otel-collector-config.yml
@@ -12,10 +12,13 @@ exporters:
 
 processors:
   batch:
+  groupbytrace:
+  groupbyattrs:
+    keys: []
 
 service:
   pipelines:
     traces:
       receivers: [otlp]
       exporters: [jaeger, logging]
-      processors: [batch]
+      processors: [groupbytrace, batch, groupbyattrs]

I added groupbytrace to split (possibly) sent batches by tracegen and make all pdata.ResourceSpans contain only one span. (Actually it is a common situation when groupbytrace is set before batching). The runs on screenshots differ only by adding/removing groupbyattrs from the pipeline

processor/groupbyattrsprocessor/README.md

processor/groupbyattrsprocessor/processor_test.go

pmm-sumo · 2022-02-17T13:12:30Z

@pkositsyn @jpkrohling I updated README, added some test cases and examples. Let me know what do you think

jpkrohling · 2022-02-17T13:27:12Z

I'll add this to my review queue and should provide some feedback soon.

jpkrohling

So, basically, there wasn't a change in the processing itself, only on the constraint checking right? I like it :-)

Because this feels like an esoteric feature, I recommend documenting it well, including when and how it should be used and when/how it should not be used.

jpkrohling · 2022-02-18T13:10:55Z

processor/groupbyattrsprocessor/README.md

@@ -83,6 +86,64 @@ Notes:
 * The specified "grouping" attributes that are set on the new *Resources* are also **removed** from the metric *DataPoints*
 * While not shown in the above example, the processor also merges collections of records under matching InstrumentationLibrary

+### Compaction
+
+In some cases, the data might come in single requests to the collector and even after batching there might be multiple duplicated ResourceSpans/ResourceLogs/ResourceMetrics objects, which leads to additional memory consumption and increased processing costs. As a remedy, `groupbyattrs` processor might be used to compact the data which has matching Resource and InstrumentationLibrary properties.


To me, the appalling aspect of this feature is to get better performance while sending data out. Without calling this out explicitly, people might not realize that the advantages are good on the transport side as well.

If I understand correctly, this will reduce the size of the message for many of the formats (e.g. OTLP). In some cases (Jaeger) this will also reduce the number of RPC calls (since the Jaeger model maps one batch to one ResourceSpans) - if I got it right. Perhaps it would be worth calling out in jaegerexporter that groupbyattrs is recommended (or worth considering)?

processor/groupbyattrsprocessor/README.md

processor/groupbyattrsprocessor/processor_test.go

pmm-sumo · 2022-02-18T15:44:46Z

Benchmark results (compacting 100 spans in different layouts):

BenchmarkCompacting
BenchmarkCompacting/instrumentation_library_count=1,_spans_per_library_count=100
BenchmarkCompacting/instrumentation_library_count=1,_spans_per_library_count=100-16         	   27966	     42352 ns/op
BenchmarkCompacting/instrumentation_library_count=10,_spans_per_library_count=10
BenchmarkCompacting/instrumentation_library_count=10,_spans_per_library_count=10-16         	   23912	     50266 ns/op
BenchmarkCompacting/instrumentation_library_count=100,_spans_per_library_count=1
BenchmarkCompacting/instrumentation_library_count=100,_spans_per_library_count=1-16         	   16819	     71327 ns/op

pmm-sumo · 2022-02-18T15:47:31Z

processor/groupbyattrsprocessor/README.md


-## Example
+It is recommended to use the `groupbyattrs` processor together with [batch](https://github.com/open-telemetry/opentelemetry-collector/tree/main/processor/batchprocessor) processor, as a consecutive step, as this will reduce the fragmentation of data (by grouping records together under matching Resource/Instrumentation Library)


@jpkrohling @pkositsyn I put the note on batch processor in this section, also updated the wording and included it in the examples. If you have any suggestions how to express this better, you are more than welcome :)

Looks good to me!

…used for compaction

…te docs

pmm-sumo · 2022-03-02T12:39:38Z

Thank you @jpkrohling! Just rebased

pmm-sumo force-pushed the group-by-attrs-compaction branch from 8b5ba0c to a715bc9 Compare February 11, 2022 14:08

jpkrohling self-requested a review February 15, 2022 09:56

jpkrohling reviewed Feb 15, 2022

View reviewed changes

processor/groupbyattrsprocessor/README.md Outdated Show resolved Hide resolved

processor/groupbyattrsprocessor/processor_test.go Outdated Show resolved Hide resolved

pmm-sumo force-pushed the group-by-attrs-compaction branch from a715bc9 to f988c4b Compare February 16, 2022 16:46

pmm-sumo force-pushed the group-by-attrs-compaction branch from 1f71a94 to 6714230 Compare February 17, 2022 13:16

pmm-sumo force-pushed the group-by-attrs-compaction branch from 6714230 to ecde4ce Compare February 17, 2022 14:08

pmm-sumo marked this pull request as ready for review February 17, 2022 14:27

pmm-sumo requested a review from a team February 17, 2022 14:27

github-actions bot assigned tigrannajaryan Feb 17, 2022

jpkrohling reviewed Feb 18, 2022

View reviewed changes

processor/groupbyattrsprocessor/processor_test.go Show resolved Hide resolved

pmm-sumo force-pushed the group-by-attrs-compaction branch 2 times, most recently from c7b691c to 6600874 Compare February 18, 2022 15:41

pmm-sumo commented Feb 18, 2022

View reviewed changes

pmm-sumo mentioned this pull request Feb 21, 2022

[Proposal] Move batching to exporterhelper open-telemetry/opentelemetry-collector#4646

Open

jpkrohling approved these changes Mar 2, 2022

View reviewed changes

pmm-sumo added 7 commits March 2, 2022 13:29

[processor/groupbyattrsprocessor] allow empty keys since it might be …

d2e86a1

…used for compaction

[processor/groupbyattrsprocessor] expand test cases for data compaction

75031a5

[processor/groupbyattrsprocessor] add examples on compaction and upda…

4913070

…te docs

[processor/groupbyattrsprocessor] add compaction benchmark

6ff10c3

[processor/groupbyattrsprocessor] nicer formatting

45113de

[processor/groupbyattrsprocessor] changelog entry

3d9f9b9

[processor/groupbyattrsprocessor] more details on using batch processor

0a95f37

pmm-sumo force-pushed the group-by-attrs-compaction branch from 6600874 to 0a95f37 Compare March 2, 2022 12:35

jpkrohling merged commit 40b09b2 into open-telemetry:main Mar 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[processor/groupbyattrsprocessor] allow empty keys for compaction #7793

[processor/groupbyattrsprocessor] allow empty keys for compaction #7793

pmm-sumo commented Feb 10, 2022 •

edited

Loading

pkositsyn commented Feb 12, 2022

pkositsyn commented Feb 13, 2022

pmm-sumo commented Feb 17, 2022

jpkrohling commented Feb 17, 2022

jpkrohling left a comment •

edited

Loading

jpkrohling Feb 18, 2022

pmm-sumo Feb 18, 2022

pmm-sumo commented Feb 18, 2022

pmm-sumo Feb 18, 2022

jpkrohling Mar 2, 2022

pmm-sumo commented Mar 2, 2022


		## Example
		It is recommended to use the `groupbyattrs` processor together with [batch](https://github.com/open-telemetry/opentelemetry-collector/tree/main/processor/batchprocessor) processor, as a consecutive step, as this will reduce the fragmentation of data (by grouping records together under matching Resource/Instrumentation Library)

[processor/groupbyattrsprocessor] allow empty keys for compaction #7793

[processor/groupbyattrsprocessor] allow empty keys for compaction #7793

Conversation

pmm-sumo commented Feb 10, 2022 • edited Loading

pkositsyn commented Feb 12, 2022

pkositsyn commented Feb 13, 2022

pmm-sumo commented Feb 17, 2022

jpkrohling commented Feb 17, 2022

jpkrohling left a comment • edited Loading

Choose a reason for hiding this comment

jpkrohling Feb 18, 2022

Choose a reason for hiding this comment

pmm-sumo Feb 18, 2022

Choose a reason for hiding this comment

pmm-sumo commented Feb 18, 2022

pmm-sumo Feb 18, 2022

Choose a reason for hiding this comment

jpkrohling Mar 2, 2022

Choose a reason for hiding this comment

pmm-sumo commented Mar 2, 2022

pmm-sumo commented Feb 10, 2022 •

edited

Loading

jpkrohling left a comment •

edited

Loading