Docs: clarify node source(s) on kubernetes_logs #21474

Firehed · 2024-10-10T18:29:54Z

A note for the community

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Problem

On the docs for the kubernetes_logs source, it's unclear to me if any given instance of Vector produces the logs for:

a) the host node of the vector instance
b) all nodes in the cluster, regardless of which node Vector instance(s) run on
c) some configurable combination of the above
d) something else entirely

Basically, I'd like to know what the default configuration will provide depending on the deployment topology, so I can de-risk logs getting either omitted or duplicated (or both!), especially when running in an autoscaling cluster.

I'm assuming that running Vector as a DaemonSet with a minimal config for the kubernetes_logs source will generally do the expected thing (implying the source is "all logs from the host node, and no other nodes"), but I've run into easily-misconfigured tools for log aggregation before. Being very explicit in the documentation on this would help ensure everyone is able to use it in the way they intend.

This could literally be a change as small as

-Collects Pod logs from Kubernetes Nodes, automatically enriching data with metadata via the Kubernetes API.
+Collects Pod logs from Vector's host Kubernetes Node, automatically enriching data with metadata via the Kubernetes API.

(assuming this is actually the case, of course)

Configuration

No response

Version

0.41.0

Debug Output

No response

Example Data

No response

Additional Context

No response

References

I searched open issues and couldn't find anything!

The text was updated successfully, but these errors were encountered:

jszwedko · 2024-10-10T22:59:58Z

Thanks @Firehed ! I agree the docs are unclear here. Your assumption is correct, though, Vector will collect logs from pods on the same host as it. I'll submit your diff as a PR.

… from the host Closes: #21474 Signed-off-by: Jesse Szwedko <[email protected]>

… from the host (#21477) Closes: #21474 Signed-off-by: Jesse Szwedko <[email protected]>

Firehed added the type: bug A code related bug. label Oct 10, 2024

jszwedko added a commit that referenced this issue Oct 10, 2024

docs(kubernetes_logs source): Clarify that the source only tails logs…

9cf5726

… from the host Closes: #21474 Signed-off-by: Jesse Szwedko <[email protected]>

jszwedko mentioned this issue Oct 10, 2024

docs(kubernetes_logs source): Clarify that the source only tails logs from the host #21477

Merged

github-merge-queue bot pushed a commit that referenced this issue Oct 11, 2024

docs(kubernetes_logs source): Clarify that the source only tails logs…

45ac1b9

… from the host (#21477) Closes: #21474 Signed-off-by: Jesse Szwedko <[email protected]>

jszwedko closed this as completed in #21477 Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs: clarify node source(s) on kubernetes_logs #21474

Docs: clarify node source(s) on kubernetes_logs #21474

Firehed commented Oct 10, 2024

jszwedko commented Oct 10, 2024

Docs: clarify node source(s) on kubernetes_logs #21474

Docs: clarify node source(s) on kubernetes_logs #21474

Comments

Firehed commented Oct 10, 2024

A note for the community

Problem

Configuration

Version

Debug Output

Example Data

Additional Context

References

jszwedko commented Oct 10, 2024