Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

issue with disk buffer - data_dir filling up with ldb files and data even though data is making it upstream #6602

Closed
ryn9 opened this issue Mar 2, 2021 · 6 comments
Assignees
Labels
domain: buffers Anything related to Vector's memory/disk buffers type: bug A code related bug.

Comments

@ryn9
Copy link

ryn9 commented Mar 2, 2021

I am having an issue with disk buffer in an http sink.
The data_dir filling up with ldb files and data even though data is making it upstream.

Vector Version

From timberio/vector:0.11.1-alpine

vector 0.11.1 (v0.11.1 x86_64-unknown-linux-musl 2020-12-17)

Vector Configuration File

Numerous input and transforms, but everything end up going out the one sink...

sinks:
  logs_out:
    type: http
    inputs:
      - logs_remove_fields
    compression: gzip
    uri: https://url.domain.name:10010/logs
    tls:
      ca_file: /etc/vector/ssl_auth/ca-certificates.crt
      crt_file: /etc/vector/ssl_auth/client.crt
      key_file: /etc/vector/ssl_auth/client.key
      verify_hostname: true
    encoding:
      codec: ndjson
      timestamp_format: rfc3339
    buffer:
      max_size: 10737418240
      type: disk
      when_full: block

Debug Output

Not performed

Expected Behavior

The data_dir should have buffered data only that has not been flushed.

Actual Behavior

The data_dir filling up with ldb files and data even though data is making it upstream.

Example Data

N/A

Additional Context

N/A

References

N/A

Additional Notes

I started asking about this on this Discord Timber server, in the support channel.
Please feel free to reach out via that method as well if more information is needed.

@ryn9 ryn9 added the type: bug A code related bug. label Mar 2, 2021
@binarylogic binarylogic added the domain: buffers Anything related to Vector's memory/disk buffers label Mar 3, 2021
@binarylogic
Copy link
Contributor

Thanks @ryn9, I think #6597 should resolve this. We'll have @ktff take a look.

@lukesteensen
Copy link
Member

@binarylogic As noted on discord, this is a related but different issue from the one you linked. We requested that this one also be opened to track both aspects of the problem.

@streeck
Copy link

streeck commented Mar 3, 2021

I observe exactly the same issue but on a statsd sink. I'm running it under Docker and the same image version as @ryn9.

My sink configuration:

[sinks.metrics_out]
  type = "statsd"
  inputs = ["metrics_in"]
  address = "address:port"
  mode = "udp"
  path = "/tmp/"

  encoding.codec = "json"

  buffer.type = "disk"
  buffer.max_size = 1073741824

@ktff
Copy link
Contributor

ktff commented Mar 3, 2021

I'm running it under Docker

@ryn9 reported the same so it seams the original issue still happens in Docker.

@ktff
Copy link
Contributor

ktff commented Jun 1, 2021

@ryn9 @streeck the fix for this has landed on master and is currently on nightly, while v0.14 should be released soon . Could you both check it out?

@ktff
Copy link
Contributor

ktff commented Jun 16, 2021

This is fixed in v0.14 but we can reopen it if it turns out that there are more causes.

@ktff ktff closed this as completed Jun 16, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
domain: buffers Anything related to Vector's memory/disk buffers type: bug A code related bug.
Projects
None yet
Development

No branches or pull requests

5 participants