You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is the issue report properly structured and detailed with version numbers?
Is this for Kubeflow development ?
Would you like to work on this issue?
You can join the CNCF Slack and access our meetings at the Kubeflow Community website. Our channel on the CNCF Slack is here #kubeflow-platform.
Version
master
Describe your issue
I encountered an error while deploying Kubeflow on a KIND cluster. The error logs indicate a failure in creating a containerd task due to an inability to start a new OS thread, suggesting that the maximum number of user processes might need to be increased (ulimit -u). Here are the specific error message:
Error: failed to create containerd task: failed to start shim: start failed: runtime: failed to create new OS thread (have 5 already; errno=11)
runtime: may need to increase max user processes (ulimit -u)
My system meets all the prerequisites, and I am running Fedora 40.
Here's the output of ulimit -a on my host system:
real-time non-blocking time (microseconds, -R) 200000
core file size (blocks, -c) unlimited
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 251414
max locked memory (kbytes, -l) 8192
max memory size (kbytes, -m) unlimited
open files (-n) 8388608
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) 8192
cpu time (seconds, -t) unlimited
max user processes (-u) 8388608
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
and the limits on the control-plane (of kind):
docker exec -it kubeflow-control-plane bash
root@kubeflow-control-plane:/# ulimit -a
real-time non-blocking time (microseconds, -R) 200000
core file size (blocks, -c) unlimited
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 251414
max locked memory (kbytes, -l) 8192
max memory size (kbytes, -m) unlimited
open files (-n) 1048576
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) 8192
cpu time (seconds, -t) unlimited
max user processes (-u) 1048576
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
Hi @Al-Pragliola ,
I installed Fedora 40 in a vm and tried to re-produce following the steps mentioned in the issue, but didn't get same results.
I used docker engine then , but here is reported problem with podman if that's what you are using
My environment:
kind version 0.24.0
Docker version 27.3.1, build ce12230
Validation Checklist
Version
master
Describe your issue
I encountered an error while deploying Kubeflow on a KIND cluster. The error logs indicate a failure in creating a containerd task due to an inability to start a new OS thread, suggesting that the maximum number of user processes might need to be increased (ulimit -u). Here are the specific error message:
My system meets all the prerequisites, and I am running Fedora 40.
Here's the output of
ulimit -a
on my host system:and the limits on the control-plane (of kind):
Workaround:
add two workers to kind
Steps to reproduce the issue
Put here any screenshots or videos (optional)
The text was updated successfully, but these errors were encountered: