Implement backoff for Kafka output #16777

ycombinator · 2020-03-03T22:01:29Z

Describe the enhancement:

Currently the Kafka output does not support any sort of backoff for publishing events in the situation where the Kafka broker might've temporarily gone away. We should add support for this, similar to what the Redis and Elasticsearch outputs.

Describe a specific use case for the enhancement or feature:

To prevent the Kafka output aggressively retrying to publish events to a Kafka broker that might have temporarily gone away.

ycombinator · 2020-03-03T22:02:09Z

Potentially useful for implementation:

beats/libbeat/outputs/kafka/config.go

Line 256 in 95626b8

// TODO: k.Producer.Retry.Backoff = ?
https://github.com/Shopify/sarama/blob/58123455d1a70c7f438871597d9a1715462e5d1c/config.go#L218-L229

ycombinator · 2020-03-09T15:48:25Z

@faec @urso See the link in the previous comment for backoff/retry options for the producer in Sarama.

urso · 2020-03-10T14:40:31Z

BackoffFunc is "new". We should give it a try. From experience the 'Backoff' setting did not always work correctly, which did lead to us using 100% CPU (depending where an error occured and if the internal write buffer is full). Not sure if this has been improved.

We try to move most 'retry' handling to libbeat, because sarama does not support infinite retry. This might impact the BackoffFunc I presume. Plus, with exponential backoff we will need a way to reset the wait state upon success.

ZHumphries · 2020-03-23T15:06:20Z

Just experienced this issue. Someone changed kafka to use authentication and all winlogbeat instances on every server spiked to 100% cpu once the buffer was full. Fortunately the winlogbeat rollout wasnt complete so vmware didnt grid completely to a halt.

toby-sutor · 2020-03-25T07:57:44Z

It's worth mentioning that a temporary workaround is to add the Kafka IPs to the local /etc/hosts file of the Beats nodes to relieve the DNS servers until this has been implemented.

ycombinator added enhancement libbeat :Outputs Team:Integrations Label for the Integrations team labels Mar 3, 2020

andresrc added [zube]: Inbox [zube]: Backlog and removed [zube]: Inbox labels Mar 4, 2020

andresrc added [zube]: Ready and removed [zube]: Backlog labels Apr 13, 2020

ycombinator self-assigned this Apr 17, 2020

ycombinator added [zube]: In Progress and removed [zube]: Ready labels Apr 17, 2020

ycombinator mentioned this issue Apr 17, 2020

Implement backoff for Kafka output #17808

Merged

6 tasks

ycombinator closed this as completed in #17808 May 5, 2020

zube bot added [zube]: Done and removed [zube]: In Progress labels May 5, 2020

andresrc removed the [zube]: Done label May 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement backoff for Kafka output #16777

Implement backoff for Kafka output #16777

ycombinator commented Mar 3, 2020

ycombinator commented Mar 3, 2020 •

edited

Loading

ycombinator commented Mar 9, 2020

urso commented Mar 10, 2020 •

edited

Loading

ZHumphries commented Mar 23, 2020

toby-sutor commented Mar 25, 2020

Implement backoff for Kafka output #16777

Implement backoff for Kafka output #16777

Comments

ycombinator commented Mar 3, 2020

ycombinator commented Mar 3, 2020 • edited Loading

ycombinator commented Mar 9, 2020

urso commented Mar 10, 2020 • edited Loading

ZHumphries commented Mar 23, 2020

toby-sutor commented Mar 25, 2020

ycombinator commented Mar 3, 2020 •

edited

Loading

urso commented Mar 10, 2020 •

edited

Loading