-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
crash looping on GKE: wal-e pipe capacity issue with newer kernels #154
Comments
Thanks for the report @tmc! What provider and k8s version did you use to deploy Workflow? That can help us nail down the issue so we can figure out a fix we could propose upstream and then bump wal-e to a release with that fix. Did the fix in wal-e/wal-e#270 (comment) work for you? |
It might be a neat experiment to try a slightly older kernel version with kubernetes and see if this issue still persists. |
Yes that fix worked, GKE, k8s 1.4 |
this is still present on k8s 1.4.6 on GKE |
Unfortunately there is nothing we can do on our end to fix this other than to use the provided workaround or to fix it in wal-e and bump the installed version. If you can provide a patch that fixes this issue for you, please make a PR upstream and we can bump wal-e forwards to the fix once it's merged. I haven't seen this issue in the wild on GKE with k8s 1.4+ so I don't have a reliable test case (or even the slightest idea how this issue crops up) to test a fix against. Until then I cannot help you. |
Hey all, WAL-E maintainer here. I will accept a patch with a lower pipe size that doesn't tank performance that works with defaults or some adaptive code to deal with this new limit. I suspect the adaptive approaches may be more trouble than its worth, but if someone can surprise me, that'd be great. |
I'm on k8s 1.5.2 on GKE. When I try the workaround from wal-e/wal-e#270 (comment) I get:
Am I doing something wrong? |
For anyone else that runs into the same problem I had, I was able to solve it by downloading the workflow chart, unpacking it and editing |
I ran into this again on an upgrade. |
|
I think one could write a patch with some arithmetic to solve this (by
reading the value, generously estimating how many pipes will be created,
and dividing).
…On Wed, May 10, 2017 at 12:29 PM Travis Cline ***@***.***> wrote:
I ran into this again on an upgrade.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#154 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AAAcF-tkSOpzFHFhZVfGPHnO8RPJaDDFks5r4hAWgaJpZM4Kd9Z8>
.
|
This is apparently due to wal-e/wal-e#270
example spew:
The text was updated successfully, but these errors were encountered: