-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sporadic cluster creation timeout #412
Comments
You have several errors about no space left in the device sys/fs/cgroup/blkio/docker/ebd0b4c8f8840ef15d77d256089b3c79bdfe85ab8152559f5abd5ee5b67c4463/system.slice: no space left on device Is it possible that's the cause? |
The "no space left" messages are definitly worrisome. I wonder, if something is wrong with disk space limits in view of three levels of containerization. |
quick response between meetings, apologies for terseness! kubelet will see the actual backing host disk. disk space has no isolation, and kind does not do disk eviction etc. because we cannot guarantee how much space users will reserve on the host. if you're actually out of space though, there's really not much kind can do. |
I checked the disk usage and it seems that there should be space. Yet, I am not sure how the interpret the output of
In addition, here is the output of
|
Good (bad) news: The creation failure is permanent now (at least the last couple of times) ;-( |
I seems that our problem is related to leaking cgroups. The error message "no space left on device" is given while accessing I assume that the problem has nothing to do with kind and therefore close the ticket. |
For the record: restarting the node helped. |
FTR: pretty sure this is running out inotify watches google/cadvisor#1581 (comment) |
@swachter can you upload somewhere the cluster logs with |
* Upload keos installer images first version * Add temp yaml * Add example * Remove comment * Remove print bind * Update info * Remove deprecated parser import * Update Doc * Update Doc * Update Doc * Update Doc
We use
kind
to create a local Kubernetes cluster during a Gitlab-CI job. The cluster creation fails sporadically with a timeout.Attached is
kind
systemctl status kubelet
journalctl -xeu kubelet
The
systemctl
andjournalctl
commands where executed inside thekind-control-plane
Docker container. The output ofdocker ps -a
inside that container shows that no containers were created.The
journalctl
output shows a couple of errors. The most significant one seems 'no space left on device' when startingcAdvisor
.I wonder how the device space is constraint and if there are parameters that might remedy the situation.
kind
uses thedocker:dind
service that is configured for the Gitlab-CI job that useskind
. Both, the build container and dind container run in the same POD. Docker is addressed by kind viatcp://localhost:2375
. Thekind-control-plane
container in turn runs "inside" the dind container and brings its own Docker. Finally, that innermost Docker tries to runcAdvisor
.In my understandang there are 3 nested Docker containers:
kind-1.log
systemctl-status-kubelet.txt
journalctl-xeu-kubelet.txt
The text was updated successfully, but these errors were encountered: