confgenerator: move modify filter before the processors #419

ridwanmsharif · 2022-02-18T04:44:41Z

This change moves the modify filter to before the processors.
This is useful so the processors can reference the LogName moving
forward. This will be needed for modify_fields and exclude_logs.

This change also updates the parser component to preserve existing
fields on the record when parsing. We do this so the LogName is not
blown away when the parser is used.

This change moves the modify filter to before the processors. This is useful so the processors can reference the LogName moving foward. This will be needed for `modify_fields` and `exclude_logs`. This change also updates the `parser` component to preserve existing fields on the record when parsing. We do this so the LogName is not blown away when the parser is used. Signed-off-by: Ridwan Sharif <[email protected]>

quentinmit · 2022-03-03T16:01:11Z

confgenerator/fluentbit/processors.go

+
+			// We need to preserve existing fields (like LogName) that are present
+			// before parsing.
+			"Reserve_Data": "True",


Unfortunately this will cause user-visible behavior changes. :( We need to make sure we still reset all the other fields by default. (As part of my modify_fields work I'm going to have to expose this as a boolean for users, since we can't just override it.)

Hm... to preserve only the one key, I'm thinking you'd need to "nest" the whole log entry, parse with Reserve_Data: True, pull out the log name field, and then delete the nested field.

Right, using the alternative approach we'd do the following when using the parse processor:

Nest all keys under a temporary field

Parse the key that we care about (with "Reserve_Data:" "True")

Lift only the log name field from nested temporary field

Delete the temporary field (effectively erasing all the other fields that existing in record pre-parsing)

And in the modify_fields work you're working on, we'd just skip all of that and use "Reserve_Data:" "True"

Based on my understanding above I have a few thoughts:

Our current documentation doesn't specify what the expected behaviour is with the fields that are not the parse key field (https://cloud.google.com/stackdriver/docs/solutions/agents/ops-agent/configuration#logging-processors). I expected them to actually be preserved by default but turns out it wasn't true (I don't know if this was a conscious decision or a result of using the defaults from Fluent Bit). We might not be able to fix forward but figured I'd leave a note here so it is considered

This alternative approach might cause a lot of performance regression (see b/169784211#comment21 for more details about how a single add field and nest operation impacts performance).

It seems to me that we could alternatively also ask for a feature request in the Fluent Bit plugin directly where we as for an option like "Reserve_Fields": ["field_1", "field_2",...] and it will only reserve the specified fields. This way we can default to only preserving the LogName (or other fields that the ops agent adds in the future like resource_name in Add resource_name to log entries using modify + nest filter #414).

ridwanmsharif requested review from quentinmit, a team and igorpeshansky and removed request for a team February 18, 2022 04:52

ridwanmsharif force-pushed the ridwanmsharif-log-name branch from c9cb0b9 to 50758cd Compare February 24, 2022 19:01

ridwanmsharif removed request for quentinmit and igorpeshansky February 24, 2022 21:56

ridwanmsharif force-pushed the ridwanmsharif-log-name branch 2 times, most recently from a9d7ef4 to c5ef87e Compare March 1, 2022 15:28

ridwanmsharif force-pushed the ridwanmsharif-log-name branch from c5ef87e to db6fd78 Compare March 1, 2022 15:30

ridwanmsharif requested review from a team, davidbtucker and quentinmit and removed request for a team March 1, 2022 16:49

quentinmit requested changes Mar 3, 2022

View reviewed changes

ridwanmsharif mentioned this pull request Jun 6, 2022

fluentbit/processor: Update the parser to reserve data while avoiding duplication of keys #653

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

confgenerator: move modify filter before the processors #419

confgenerator: move modify filter before the processors #419

ridwanmsharif commented Feb 18, 2022 •

edited

Loading

quentinmit Mar 3, 2022

ridwanmsharif Mar 3, 2022

confgenerator: move modify filter before the processors #419

Are you sure you want to change the base?

confgenerator: move modify filter before the processors #419

Conversation

ridwanmsharif commented Feb 18, 2022 • edited Loading

quentinmit Mar 3, 2022

Choose a reason for hiding this comment

ridwanmsharif Mar 3, 2022

Choose a reason for hiding this comment

ridwanmsharif commented Feb 18, 2022 •

edited

Loading