-
Notifications
You must be signed in to change notification settings - Fork 712
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conntrack loses events due to ENOBUFS #1137
Comments
User was running kernel 4.3.5-300 |
This is kinda by design; on a system with a large rate of connections we can't keep up with conntract and it will fail. We degrade gracefully, falling back to polling. |
Wouldn't it be reasonable to dynamically adjust the buffer size? |
In the worst case there will always be too many events to keep up with. Its reasonable to fall back to polling. |
I think it would be worth confirming with the user whether he was actually reaching a point in which we want to fall back to polling or whether we want to extend the buffers a bit further. Also, it would be helpful to print a friendlier error with an explanation similar to the one in this ticket. |
I've seen this again in our service:
|
Note that when encountering this (or any other) error we immediately spawn another conntrackWalker, so we generally won't fall back to polling for long. If spikes are frequent though then a larger buffer would be better. Plus the message is alarming to users. So we should consider
|
(1) done in #1896 |
The error message we log from conntrack is now rather misleading since it suggests that the buffer size can be adjusted in Perhaps the default value should be read from |
We increased the default buffer size significantly in #2739. I don't think it's worth doing anything more here. |
Full logs: https://gist.github.com/janwillies/54083a50358718a4fb21
Context: https://weaveworks.slack.com/archives/scope-public/p1457352064000743
The text was updated successfully, but these errors were encountered: