-
Notifications
You must be signed in to change notification settings - Fork 19
nomad-driver-lxc compatibility with LXC 4.0 #30
Comments
From what i can see, nomad-lxc-driver does not release handles to the most recently created container in /sys/fs/cgroup//lxc.monitor... as per below
b1 has been created after b2. as you can see, b2 is not occupied by driver's process. to properly cleanup the container (remove all its artifacts from nomad client) i need to either restart nomad which restarts also driver's process, or create another container to release driver's handles to container which will be removed later. i am not programmer so cannot see how it can be fixed in the driver's code i used nomad 1.1.3 and 1.1.4. i recompiled the driver for both versions with 2 different pkg.in/go-lxc.v2 drivers (version from 2018 and 2021). no differences i guess the issue is in the driver's code and not in any dependecied library |
hey, is there anyone who has similar issue and found solution? |
@kwianeck are you using Centos/Redhat/VzLinux? I had encountered reoccurring issues with lxc/lxd and cgroups on Centos (using lxd snap package) but they have went away on Ubuntu 20.04. We are using LXD/LXC on Ubuntu 20.04, version 4.0.7 and during my testing I encountered issues that I couldn't resolve. The error complains about network type configuration, I am guessing that it should use the default lxc profile by default? I've included my default profile and other details. I hope it helps resolve issues with this plugin as I'd love to start using nomad for all my lxd/lxc containers!
Thank you. |
It looks like go-lxc only supports cgoups (not cgroups2). I found a few other incompatibilities and bugs while testing lxc 4 support. Will create a merge request and tag this issue "soon". |
Acording to https://discuss.linuxcontainers.org/t/lxc-4-0-lts-has-been-released/7182, cgroup specification for container and it's monitor have been separated.
My observation is that driver is not requesting to lxc to fully cleanup container. Instead, when stopping task, lxc removes cgroup for container but leaves cgroups for lxc.monitor under /sys/fs/cgroup//
in this way, within the time, we can expect hundreds of left objects for many lxc.monitors of containers which have been removed long time back
ex.
Output after alpine2 removal (nomad job stop alpine2)
as you can see, lxc.payload directory has gone (container's specific cgroups), however, lxc-monitor cgroups stay
The text was updated successfully, but these errors were encountered: