dnsmasq process in wrong cgroup #21

AlbanBedel · 2020-05-19T11:12:09Z

When a pod/container is started directly with the podman command, the dnsmasq process created by dnsname end up in the calling user cgroup:

$ sudo podman start hello
$ cat /proc/$(sudo cat /run/containers/cni/dnsname/podman/pidfile)/cgroup
12:pids:/user.slice/user-1000.slice/[email protected]
11:memory:/user.slice/user-1000.slice/[email protected]
...

That's probably not ideal as the dnsmasq process is then bound to the user slice.

But when systemd service file generated by podman are used, the dnsmasq process ends up in the service's cgroup:

$ cd /run/systemd/system
$ sudo podman generate systemd --files --name hello
$ sudo systemctl start container-hello.service
$ cat /proc/$(sudo cat /run/containers/cni/dnsname/podman/pidfile)/cgroup
12:pids:/system.slice/container-hello.service
11:memory:/system.slice/container-hello.service

This is problematic as the dnsmasq process and the container have totally different life cycles. I noticed this problem as I'm trying to start containers using transient units. Transient units are normally automatically removed when they are stopped, but if the dnsmasq process is still running because of another container, it prevent the transient unit it was started in from being destroyed.

I can probably workaround this problem in some way for my setup, but I think the dnsmasq process, or any other long running process related to a cni network, should be in a cgroup whose life cycle match the cni network life cycle.

The text was updated successfully, but these errors were encountered:

baude · 2020-05-19T13:59:58Z

@mheon WDYT?

mheon · 2020-05-19T14:04:27Z

We could try moving to the container's cgroups, but that's problematic because it should outlive any single container as long as another container in the network is started and running.

Best way is likely to make a scope exclusively for dnsmasq (podman-dnsmasq-$NETWORK.service maybe?) under Libpod's default cgroup parent. We have code to do this for cgroupfs and systemd in Podman (we use it for making pod cgroups, but it could easily be repurposed for this).

carbolymer · 2021-04-14T19:09:38Z

Is there any workaround for this? This makes using more than one network in podman impossible.

AlbanBedel · 2021-04-15T07:46:26Z

@mheon and what about #12? Each pod can have a unique combination of networks attached, to support that we would probably need a dnsmasq process per pod anyway. It's a larger change but that would solve both bugs at once.

On the other hand it would make sense to have a generic solution to handle the case where a cni plugin start a process that should outlive the pod it was started for.

mheon · 2021-04-15T13:23:07Z

@AlbanBedel We're presently discussing an extensive rearchitecture/rewrite of dnsname to resolve that, that should also resolve this. I'm just waiting for the OK to go and ahead and get started on it.

AlbanBedel mentioned this issue May 19, 2020

unexpected behavior when a container is attached to more than one network #12

Open

Luap99 mentioned this issue Feb 17, 2022

Problem when containers resolver each other by name containers/podman#13269

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dnsmasq process in wrong cgroup #21

dnsmasq process in wrong cgroup #21

AlbanBedel commented May 19, 2020

baude commented May 19, 2020

mheon commented May 19, 2020

carbolymer commented Apr 14, 2021

AlbanBedel commented Apr 15, 2021

mheon commented Apr 15, 2021

dnsmasq process in wrong cgroup #21

dnsmasq process in wrong cgroup #21

Comments

AlbanBedel commented May 19, 2020

baude commented May 19, 2020

mheon commented May 19, 2020

carbolymer commented Apr 14, 2021

AlbanBedel commented Apr 15, 2021

mheon commented Apr 15, 2021