-
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DEBIAN Base] Possibly Fix Random Down, DNS Timeout, EAGAIN, EAI_AGAIN #295
Comments
See my comment in #275 for slightly more detail , but TLDR; switching from alpine did nothing to resolve the issue for me. I did resolve it successfully though by ensuring kuma was going directly to my DNS server by setting DNS on the container instead of requests going through the docker gateway. Does kuma's container appropriately cache and respect TTL's for DNS records? |
Currently trying the uptime-kuma:1.5.0-debian image, will report back when it has run for a while. |
Just in case someone else wants to try this, I manually added these dns servers to uptime-kuma's docker-compose.yml file and my response times have also gone down.
|
So far there has been no timeout issues for me. So it looks like it has helped? Will let it run over the weekend too and report back. |
From my understanding the DNS issues affecting Alpine happens if the host is using a Kernel older than 5.x. When the host is using 5.x or newer the issue doesn't happen. Commit here: torvalds/linux@4e35c1c Another thing that helps with DNS issues in alpine is installing the |
Interesting, but is it just a dns cli tools? Why it could affect that?👀 |
I think you are right, it may be just be a cli tool. We use it since this comment in the Alpine Repo (gliderlabs/docker-alpine#539 (comment)). It helps when doing healthchecks in Kubernetes. Another suggestion would be to update the documentation and recommend running uptime-kuma Docker container with:
|
Hi @louislam I have also been experiencing these issues and just started using the debian based image. I will report back in a couple of days. |
No more random downtimes due to mentioned errors on my end for days, seems to be fixed by this. |
Any test hostname that can be used to test this issue using the Alpine container? |
No timeouts on my end either. It has been running for 5 days, looks like it works perfectly fine with the Debian image. |
Thank you for you guys' reports. Combining the reports here and the similar reports from the Internet, I am pretty sure Alpine Linux is having some kind of dns problem that they are not going to fix. Plus, armv7 error in Alpine Linux >= 3.13, 3.14 (#41), I am going to fade out Alpine Linux in the next release. If you are a developer in any other projects and to future me, my advice is that you should keep away from Alpine Linux. Imagine that your application need to call payment gateway API such as Paypal and it throws EAI_AGAIN randomly, that could be a disaster. |
Hmm... I'm still seeing EAI_AGAIN with the Debian image. 🤔definitely doesn't occur as often as before though. |
Then your local dns server is actually busy, you can config your container to use Google DNS / Cloudflare DNS directly. |
Hi @louislam So you mentioning that my DNS server is busy got me thinking, I remember seeing a post on /r/selfhosted about PiHole rate-limiting queries since FTL v5.7. This might be why I'm seeing I have just added the |
Oh, Pihole again. Some users were using Pihole that also got the error. My recommendation is you should don't put serious things behind Pihole. Just use it for the browsing and watching Youtube. |
You can just run two instances on different hardware; it's no different to any other DNS resovler, you want redundancy. As for Alpine, most of the Linuxserver fleet is using an Alpine base (mix of 3.14, 3.13, and a few older versions) and we've never encountered any widespread DNS issues - with the scale of use of our images I would have expected to get a lot of reports if it was something systemic with Alpine. Edit: Also yikes, the image is nearly a gig with the Debian "slim" base. |
I actually can't reproduce the dns problem, but the fact is a few users keep reporting such problem in this repo and alpine's repo (gliderlabs/docker-alpine#255). I still build alpine image, you can use |
hello i noticed a few random EAI_AGAIN errors but http website status says still UP after updating phihole to Docker Tag 2022.01.1 and uptimekuma to 1.11.3 adding only if i user dns monitor and specific ask pihole i get status down error hope this helps edit |
If you are experiencing such issues, it is worth to try this debian base docker image, and let me know whether it is solved. Thank you!
Docker Image
Story:
There were some Uptime Kuma users who were facing weird connectivity problems which I cannot explain.
getaddrinfo EAI_AGAIN
Errors #114Recently, one of our contributions @chakflying found out that it may be related to Alpine Docker which Uptime Kuma is based on it.
The text was updated successfully, but these errors were encountered: