Dissect tag on parsing error #8751

ph · 2018-10-25T17:10:26Z

Before when a parsing error occurred the events was returned untouched
and an error was logged, if you don't look at your logs you have no
the idea that the tokenizer was not able to match your string.

Instead, when a parsing error occurs in the Dissect processor, we will now
add a tag named 'dissect_parsing_error' to the 'log.flags' field.
With that information, you are now able to reprocess your data or do
filtering on the UI.

Fixes: #8123

Before when a parsing error occurred the events was returned untouched and an error was logged, if you don't look at your logs you have no the idea that the tokenizer was not able to match your string. Instead, when a parsing error occurs in the Dissect processor, we will now add a tag named 'dissect_parsing_error' to the 'log.flags' field. With that information, you are now able to reprocess your data or do filtering on the UI. Fixes: elastic#8123

CHANGELOG.asciidoc

ruflin

Let's be consistent and call it flag everywhere in the code and docs.

ruflin · 2018-10-26T11:15:07Z

libbeat/processors/dissect/processor.go

@@ -27,6 +27,8 @@ import (
 	"github.com/elastic/beats/libbeat/processors"
 )

+const tagParsingError = "dissect_parsing_error"


can we also call it flag ?

libbeat/processors/dissect/processor.go

ruflin · 2018-10-26T11:15:25Z

libbeat/processors/dissect/processor_test.go

@@ -176,3 +176,59 @@ func TestFieldAlreadyExist(t *testing.T) {
 		})
 	}
 }
+
+func TestErrorTagging(t *testing.T) {


ph · 2018-10-26T12:08:17Z

@ruflin Now with more flags (tm) :)

ruflin · 2018-10-29T09:28:36Z

libbeat/beat/event.go

@@ -24,6 +24,9 @@ import (
 	"github.com/elastic/beats/libbeat/common"
 )

+// FlagField fields used to keep information or errors when events are parsed.
+const FlagField = "log.flags"


I'm wondering if dissect should write it's flags into log.flags or rather event.flags? Reasons is that dissect is not only for logs but more generic.

Should have spotted this earlier.

@webmat I think we need event.flags in the future in ECS.

@ruflin Would it be the same for when an event is truncated?

I did create elastic/ecs#100 a while ago for the log tag field. There is no issue yet for a more generic set of flags.

I agree with @ruflin that the dissect error should not be set on log.flags.

event.flags is a bit better. But I think this approach still mixes up pipeline & processing metadata with userland data (like the error discussion we had last week, @ruflin). The following idea hasn't been fleshed out yet, but I've been thinking we should introduce a section that's clearly about stuff that happened in the processing pipeline. E.g. pipeline.error, pipeline.tags (or flags), if someone wants to note down timings of each step in their pipeline, they'd do it under pipeline. as well, etc. However this will have to come after ECS 1.0/GA, so don't wait on this being defined for what needs to happen in Beats.

In the meantime, what I would suggest instead is to do what we've been doing for years, and add this dissect tag to tags directly, like Logstash does with _grok_parse_failure.

And @ph, to answer your more recent question, I would consider the truncation to be userland information, about the log itself. So I do think having truncated right on log.flags makes sense.

This is the new field where the multiline tag is also being added, correct? (Sorry I haven't been following these developments very closely)

I believe it's the same field correct.

Ok thanks for confirming. So my opinion for now is that flags that are descriptive of the log itself or the log entry should be added to log.flags, so multiline, truncated, as they are now.

Parsing flags like dissect_parsing_error, on the other hand, should be added to tags, until we define a more general place to put pipeline errors and details.

To not block this PR, lets go with log.flags for now. Lets open a more general discussion where information from processing should go.

For tags in LS: We should probably also tackle this.

Before when a parsing error occurred the events was returned untouched and an error was logged, if you don't look at your logs you have no the idea that the tokenizer was not able to match your string. Instead, when a parsing error occurs in the Dissect processor, we will now add a tag named 'dissect_parsing_error' to the 'log.flags' field. With that information, you are now able to reprocess your data or do filtering on the UI. Fixes: elastic#8123 (cherry picked from commit 8dbfed2)

Before when a parsing error occurred the events was returned untouched and an error was logged, if you don't look at your logs you have no the idea that the tokenizer was not able to match your string. Instead, when a parsing error occurs in the Dissect processor, we will now add a tag named 'dissect_parsing_error' to the 'log.flags' field. With that information, you are now able to reprocess your data or do filtering on the UI. Fixes: #8123 (cherry picked from commit 8dbfed2)

ph added in progress Pull request is currently in progress. libbeat :Processors labels Oct 25, 2018

ph force-pushed the fix/dissect-add-flags branch from 074371a to 97e3e19 Compare October 25, 2018 17:12

ph added review and removed in progress Pull request is currently in progress. labels Oct 25, 2018

ruflin reviewed Oct 26, 2018

View reviewed changes

CHANGELOG.asciidoc Outdated Show resolved Hide resolved

ruflin reviewed Oct 26, 2018

View reviewed changes

flag instead of tag

5e4fc09

ruflin reviewed Oct 29, 2018

View reviewed changes

ruflin approved these changes Oct 30, 2018

View reviewed changes

ph merged commit 8dbfed2 into elastic:master Oct 30, 2018

ph added the needs_backport PR is waiting to be backported to other branches. label Oct 30, 2018

ph mentioned this pull request Oct 30, 2018

Cherry-pick #8751 to 6.x: Dissect tag on parsing error #8818

Merged

ph added v6.6.0 and removed needs_backport PR is waiting to be backported to other branches. labels Oct 30, 2018

webmat mentioned this pull request Oct 30, 2018

Using ECS to track the source of errors or exceptions elastic/ecs#154

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dissect tag on parsing error #8751

Dissect tag on parsing error #8751

ph commented Oct 25, 2018

ruflin left a comment

ruflin Oct 26, 2018

ruflin Oct 26, 2018

ph commented Oct 26, 2018

ruflin Oct 29, 2018

ruflin Oct 29, 2018

ph Oct 29, 2018

webmat Oct 29, 2018

webmat Oct 29, 2018 •

edited

Loading

ph Oct 29, 2018

webmat Oct 29, 2018 •

edited

Loading

ruflin Oct 30, 2018

Dissect tag on parsing error #8751

Dissect tag on parsing error #8751

Conversation

ph commented Oct 25, 2018

ruflin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ph commented Oct 26, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

webmat Oct 29, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

webmat Oct 29, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

webmat Oct 29, 2018 •

edited

Loading

webmat Oct 29, 2018 •

edited

Loading