New `journald` source #327

binarylogic · 2019-05-07T16:47:49Z

This is a new source that makes it easy to ingest journald entries. The Rust systemd library should make this easier.

Specification

The first order of business for this source is to spec it out. I would like to start with an investigative process filling in the blanks in my comment below. The questions I have (I'm sure I'm missing some):

How do we filter by service? Does the JournalRecord type include the service source? And would this be a post ingestion filter?
What else can we filter by?
How does checkpointing / cursor position work? Ex: If we restart Vector how do we resume where we left off?

Prior art

Journalbeat: Journalbeat elastic/beats#8323 & https://github.com/elastic/beats/tree/master/journalbeat
Fluentbit: https://docs.fluentbit.io/manual/input/systemd

The text was updated successfully, but these errors were encountered:

binarylogic · 2019-07-19T17:03:37Z

@bruceg I'd like for you to start by finishing off the following spec. Please make any adjustments you feel are necessary. There are a number of behavior questions we have (as noted in the original issue) that I would like to solidify before we begin work:

Specification

Config Example

[sources.journald]
  type = "journald"
  current_runtime_only = true # default
  local_only = true # default
  include = { unit = "nginx.service" }
  exclude = { message = ".*ignore this.*", priority = "debug" }
  units = [ "apache2", "system.slice" ]

Requirements

current_runtime_only - boolean - if true, include only journal entries from the current boot. If false, include all entries.
local_only - boolean - if true, include only journal entries originating from localhost. If false, include all entries.
include - a whitelist table of filters/matches to include. If empty or not present, all entries are accepted.
exclude - a blacklist table of filters/matches to drop. Defaults to none. Note: this takes applies after include and so takes precedence.
units - an array of unit names to monitor. If empty or not present, all units are accepted. Unit names lacking a . will have .service appended to make them a valid service unit name.
Vector will checkpoint the record most recently processed and continue from that point on restart, to avoid duplicating logs.

bruceg · 2019-07-20T21:34:38Z

Should the include & exclude keys be arbitrary journald fields (like MESSAGE or _COMM etc) or a fixed set of allowed possibilities (like message or command), or some combination where pre-programmed keys have fixed behavior but all others reference literal journald fields?

I think some will have to be fixed, like priority where it's a number and you'd want to compare greater than for include and less than for exclude, but it would be useful to be able to search or filter on any key.

bruceg · 2019-07-20T21:44:33Z

How do we foresee handling the privilege boundary issues? The files for journald are not directly readable by non-privileged users (on my system, readable by root or groups systemd-journal, adm, or wheel). The easiest option would be to just require that vector is run with the appropriate supplementary group where journald support is required (which obviously needs to be well documented), but another option would be a separate privileged process to feed in the logs.

bruceg · 2019-07-20T22:03:56Z

1. How do we filter by service? Does the `JournalRecord` type include the service source? And would this be a post ingestion filter?

journald records include a large number of fields to filter on. For selecting a service, there is SYSLOG_FACILITY, SYSLOG_IDENTIFIER, _COMM (executable path), _CMDLINE (command and arguments), _SYSTEMD_UNIT and _SYSTEMD_USER_UNIT (for things launched by systemd), _KERNEL_SUBSYSTEM (for kernel log messages), and occasionally UNIT (though this isn't a "trusted" field).

2. What else can we filter by?

See man systemd.journal-fields

3. How does checkpointing / cursor position work? Ex: If we restart Vector how do we resume where we left off?

The systemd crate referenced above has 3 methods of producing a checkpoint value and later seeking back to that checkpoint. This would require storing that checkpoint in a file or table somewhere, and being careful not to allow a race to drop records.

binarylogic · 2019-07-21T13:56:50Z

Thanks @bruceg! So I don't create misdirection, I'd like to use planning tomorrow morning to obtain consensus on a direction. I'll follow up with answers tomorrow.

binarylogic · 2019-07-22T17:11:43Z

We spent some time discussing this during planning. So I'll answer your questions in order:

Should the include & exclude keys be arbitrary journald fields (like MESSAGE or _COMM etc) or a fixed set of allowed possibilities (like message or command), or some combination where pre-programmed keys have fixed behavior but all others reference literal journald fields?

To start, we'd like to only include service filters. You are welcome to rename this filter if you find it to be clearer (ex: include_services, exclude_services). The filters for this source should be obvious fundamental filters that relate directly to the source behavior. Otherwise, you get into questions like "would a filter be better off in a separate filter transform"?. Does that make sense?

How do we foresee handling the privilege boundary issues?

I think we go with what you suggested for now, which is "The easiest option would be to just require that vector is run with the appropriate supplementary group where journald support is required (which obviously needs to be well documented)".

The systemd crate referenced above has 3 methods of producing a checkpoint value and later seeking back to that checkpoint. This would require storing that checkpoint in a file or table somewhere, and being careful not to allow a race to drop records.

Currently, the file source stores checkpoint values in individual files in the data_dir, but @lukesteensen recommended using leveldb as a way to store this data. We're on the fence with the best approach here and would like to delegate that decision to you -- whatever you think is the easiest. I would recommend designing this in such a way that we could swap out the underlying persistence strategy in the future.

That should answer most of your questions. This section will outline other details we discussed:

Fields / Schema

We'd like to take all default fields and leave them as unaltered root keys. For example, _SYSTEMD_UNIT would remain exactly as that field:

{
  // ...
  "_SYSTEMD_UNIT": "vector.service",
  "SYSLOG_FACILITY": "debug",
  "SYSLOG_PID": 123,
  // ...
}

The only change we'd like to make is mapping the relevant keys to our default schema: timestamp, message, and host. These should be configurable via timestamp_field, message_field, and host_field configuration options. Any other transformations the user would like to make, such as dropping or renaming fields can be accomplished with a separate transform component. Does that make sense?

First Version

Keep in mind, we don't need everything in the first version. Perhaps it makes more sense to start with a simple source that does not include any filtering or field mapping, then we can work on follow up changes to add those features.

Let me know if that answers everything, happy to clarify further.

bruceg · 2019-07-22T19:36:19Z

To start, we'd like to only include service filters. You are welcome to rename this filter if you find it to be clearer (ex: include_services, exclude_services). The filters for this source should be obvious fundamental filters that relate directly to the source behavior. Otherwise, you get into questions like "would a filter be better off in a separate filter transform"?. Does that make sense?

By "service" do you really mean any systemd unit, or specifically only a systemd .service unit? (there are also .device, .mount, .target units, etc).

Currently, the file source stores checkpoint values in individual files in the data_dir, but @lukesteensen recommended using leveldb as a way to store this data. We're on the fence with the best approach here and would like to delegate that decision to you -- whatever you think is the easiest. I would recommend designing this in such a way that we could swap out the underlying persistence strategy in the future.

If we are going to have multiple sources and/or sinks doing checkpointing, it would be a good idea to have a unified storage scheme for them. I have no strong preference between individual files or a database. Files are easier to debug but a database can make storing additional data easier. There are some potential consistency issues (like partial writes) that would best be hidden behind a common interface.

I am curious why LevelDB is being proposed over other key/value stores, particularly LMDB.

The only change we'd like to make is mapping the relevant keys to our default schema: timestamp, message, and host. These should be configurable via timestamp_field, message_field, and host_field configuration options. Any other transformations the user would like to make, such as dropping or renaming fields can be accomplished with a separate transform component. Does that make sense?

I don't see a point in having those configurable, at least initially. Are there any field names other than MESSAGE used for the message text? The timestamp and host are also trusted fixed fields injected by journald.

Keep in mind, we don't need everything in the first version. Perhaps it makes more sense to start with a simple source that does not include any filtering or field mapping, then we can work on follow up changes to add those features.

Given some of the disagreement, this is probably best. Get the minimum working and then increment as needs arise.

bruceg · 2019-07-22T19:39:51Z

Oh, I forgot to ask. The initial spec included a "paths" configuration, but the systemd crate doesn't expose this functionality yet. Is it safe to presume I can drop this from the initial spec?

binarylogic · 2019-07-23T02:13:54Z

By "service" do you really mean any systemd unit, or specifically only a systemd .service unit? (there are also .device, .mount, .target units, etc).

Interesting, I hadn't thought about that. I'm not entirely sure if collecting log data for .device, .mount, .target units is useful. From what I can gather it doesn't seem to be. I'm leaning towards just .service units, but if you understand the other unit types better please feel free to make your best judgement.

If we are going to have multiple sources and/or sinks doing checkpointing, it would be a good idea to have a unified storage scheme for them.

👍 from me on that. If you'd like to extract that out, that would be fine. I'm slightly concerned that could expand scope on this PR. What do you think about the initial PR forgoing check-pointing, and then we can add that in a follow up PR? Again, I'll defer to you on the best way to approach this.

I am curious why LevelDB is being proposed over other key/value stores, particularly LMDB.

We currently use leveldb for our on-disk buffering and I believe it had a number of requirements we needed there. Specifically around ordering. And we suggested leveldb just because we use it for that / we're trying to reduce the amount of dependencies we need.

I don't see a point in having those configurable, at least initially. Are there any field names other than MESSAGE used for the message text? The timestamp and host are also trusted fixed fields injected by journald.

Agree, let's skip that for now.

Oh, I forgot to ask. The initial spec included a "paths" configuration, but the systemd crate doesn't expose this functionality yet. Is it safe to presume I can drop this from the initial spec?

Yep, we can drop the paths configuration for this first version. We might not need it.

bruceg · 2019-07-23T19:32:01Z

Interesting, I hadn't thought about that. I'm not entirely sure if collecting log data for .device, .mount, .target units is useful. From what I can gather it doesn't seem to be. I'm leaning towards just .service units, but if you understand the other unit types better please feel free to make your best judgement.

Then I will mimic systemd's behavior: automatically append .service if the unit name has no .. That way, most valid uses will simply be the service name, but all other units could potentially be used as well.

+1 from me on that. If you'd like to extract that out, that would be fine. I'm slightly concerned that could expand scope on this PR. What do you think about the initial PR forgoing check-pointing, and then we can add that in a follow up PR? Again, I'll defer to you on the best way to approach this.

The problem with forgoing check-pointing is that each time the journal is opened, it is re-read from the start. This can potentially be a huge amount of data on long-running machines. I will start without this capability, but it will only be useful for testing so I'll try to work it in for the initial PR.

binarylogic · 2019-07-23T19:39:32Z

The problem with forgoing check-pointing is that each time the journal is opened, it is re-read from the start.

Yep, understood. We wouldn't advertise the integration until checkpointing is done. I just thought it might be more focused on a development and review perspective to break them out.

bruceg · 2019-07-30T22:12:32Z

Checkpointing is going to depend on issue #644 / pull #673 in particular the global data_dir option.

binarylogic · 2019-09-13T04:39:18Z

@bruceg nice work on these changes. We can close this, correct?

bruceg · 2019-09-13T15:20:47Z

I did not close it as two of the spec points are unfinished (the include and exclude filters). There should be a transform that can do this, but there is none that completely fits.

LucioFranco · 2019-09-17T15:24:32Z

@bruceg I think then we should open an issue for that transform then close this one.

bruceg · 2019-09-17T16:00:34Z

Closing this via #882

binarylogic added the Type: New Source label May 7, 2019

binarylogic added Type: New Feature and removed Type: New Feature labels Jun 14, 2019

binarylogic changed the title ~~Journald source~~ New journald source Jun 20, 2019

binarylogic added the needs: approval Needs review & approval before work can begin. label Jul 12, 2019

binarylogic assigned bruceg Jul 16, 2019

binarylogic added needs: requirements Needs a a list of requirements before work can be begin and removed needs: approval Needs review & approval before work can begin. labels Jul 19, 2019

bruceg mentioned this issue Aug 1, 2019

feat(new source): Initial journald source implementation #702

Merged

bruceg mentioned this issue Aug 31, 2019

feat(journald source): Add checkpointing support #816

Merged

bruceg mentioned this issue Sep 17, 2019

New include/exclude transform #882

Closed

bruceg closed this as completed Sep 17, 2019

binarylogic added type: feature A value-adding code addition that introduce new functionality. and removed type: new feature labels Jun 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New `journald` source #327

New `journald` source #327

binarylogic commented May 7, 2019 •

edited

Loading

binarylogic commented Jul 19, 2019 •

edited by bruceg

Loading

bruceg commented Jul 20, 2019

bruceg commented Jul 20, 2019

bruceg commented Jul 20, 2019

binarylogic commented Jul 21, 2019

binarylogic commented Jul 22, 2019

bruceg commented Jul 22, 2019

bruceg commented Jul 22, 2019

binarylogic commented Jul 23, 2019

bruceg commented Jul 23, 2019

binarylogic commented Jul 23, 2019

bruceg commented Jul 30, 2019

binarylogic commented Sep 13, 2019

bruceg commented Sep 13, 2019

LucioFranco commented Sep 17, 2019

bruceg commented Sep 17, 2019

New journald source #327

New journald source #327

Comments

binarylogic commented May 7, 2019 • edited Loading

Specification

Prior art

binarylogic commented Jul 19, 2019 • edited by bruceg Loading

Specification

Config Example

Requirements

bruceg commented Jul 20, 2019

bruceg commented Jul 20, 2019

bruceg commented Jul 20, 2019

binarylogic commented Jul 21, 2019

binarylogic commented Jul 22, 2019

Fields / Schema

First Version

bruceg commented Jul 22, 2019

bruceg commented Jul 22, 2019

binarylogic commented Jul 23, 2019

bruceg commented Jul 23, 2019

binarylogic commented Jul 23, 2019

bruceg commented Jul 30, 2019

binarylogic commented Sep 13, 2019

bruceg commented Sep 13, 2019

LucioFranco commented Sep 17, 2019

bruceg commented Sep 17, 2019

New `journald` source #327

New `journald` source #327

binarylogic commented May 7, 2019 •

edited

Loading

binarylogic commented Jul 19, 2019 •

edited by bruceg

Loading