-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Crash on startup #1362
Comments
I suspect this could happen due to one of the following reasons:
I do know that clearing all log files fixes the problem, but we cannot do that every time this happens. UPDATE: I have tried to bump the backpressure to 200M and input buffer to 20M, but it doesn't seem to fix the problem. |
I have found the issue, the following line:
The Would it be possible to just have fluent-bit ignore this sort of lines instead of crashing without any indication why? |
+1 to this as i've seen something similar with just the following logs:
not 100% if its related yet though but i am sending logs to splunk also and seen the below stack trace as well:
also i've increased the FYI, once i start seeing the above error, the only way i can recover was to |
Crash on startup will happen when the storage file system path have more files than the fd system limit, I am working in a solution for the problem |
Would you please give it a try to the latest version on GIT Master ? I've pushed several improvements on storage startup and chunks handling. |
Will try on Monday. |
@edsiper tried it and I don't see any obvious issues. It will be difficult to test the scenario though, but I will keep an eye on it and if it occurs I will let you know. |
In our case, it seems to be an issue in one of our filters (possibly Lua) that sometimes ends up producing
After a lot of those, fluent-bit (1.1.3-debug) then crashes
This might be specific to Splunk output, so let me know if you want me to create a new issue for this. |
please try to come up with a small reproducible case with Lua where it crashes Fluent Bit (we might need to add extra checks and enforce data types when using Lua scripts) |
ping |
@edsiper I haven't seen the error anymore. |
new storage improvements fix the issue, closing. |
It also happens with splunk ... using fluentbit 1.13
debug logs does not show more than time format warn
|
Same with fluent-bit 1.2.2
|
same with 1.3.2
|
Looks like a bad formed record is messing up things, any clue from where those records are coming from? If you disable the filters do u still see the issue? |
How to determine which earliest version has this fix available ? I am guessing 1.2.1 or 1.2.2 but not sure. |
Bug Report
Describe the bug
fluent-bit keeps crashing on startup.
To Reproduce
I'm running fluent-bit as a service, so here are the logs for journal:
And the fluent-bit logs:
Expected behavior
Fluent bit should not crash.
Screenshots
Your Environment
Additional context
I'm building fluent-bit instead of using the td-agent-bit with the following script:
And here are the current log file sizes:
The text was updated successfully, but these errors were encountered: