Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: do not send Fluent Bit application logs to CloudWatch TDE-1016 #374

Merged
merged 1 commit into from
Jan 3, 2024

Conversation

paulfouquet
Copy link
Collaborator

@paulfouquet paulfouquet commented Jan 3, 2024

Motivation

Due to some managed network issues dealt by Fluent Bit application pods, the amount of logs increased dramatically which has a considerable impact on our cloud storage cost. A solution (that could be temporary until this network errors stop/slow down if we find a way to fix them?) is to stop sending the Fluent Bit application pod logs to CloudWatch.

Modification

Telling Fluent Bit to ignore its own logs using the exclude annotation.
Consequences are that these logs won't be shipped through our system (not being seen in Elasticsearch). We are still able to view them from the EKS Cluster that we have hands on.

Checklist

  • Tests updated - N/A
  • Docs updated
  • Issue linked in Title

@paulfouquet paulfouquet added this pull request to the merge queue Jan 3, 2024
Merged via the queue into master with commit 628900c Jan 3, 2024
4 checks passed
@paulfouquet paulfouquet deleted the fix/reducing-log-volume-fluentbit-tde-1016 branch January 3, 2024 22:05
github-merge-queue bot pushed a commit that referenced this pull request Apr 3, 2024
#### Motivation

Fluent Bit is experiencing a lot of network errors connecting to
`logs.ap-southeast-2.amazonaws.com`. This amount of errors does increase
the log storage cost, see
#374.
This is a known issue for which Fluent Bit team made [some
recommendations to reduce
it](aws/aws-for-fluent-bit#340). This PR is
applying one of these recommendations and has been tested with success
on non prod.

#### Modification

- Remove [the patch](#374)
that stops sending Fluent Bit application logs to CloudWatch
- Set the Fluent Bit `keepalive idle timeout` to 4s (default is 1.5s)
following [the recommendations made
here](aws/aws-for-fluent-bit#340).

#### Checklist

- [ ] Tests updated - N/A
- [x] Docs updated
- [x] Issue linked in Title

---------

Co-authored-by: Victor Engmark <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

3 participants