Deadlock when container exits while killing by podman #15492

tyler92 · 2022-08-26T08:10:03Z

Is this a BUG REPORT or FEATURE REQUEST? (leave only one on its own line)

/kind bug

Description

There is a situation when podman tries to lock the same mutex twice and deadlocks. After that, all podman's commands do not work due to endless waiting for the release of this mutex. There are two possibilities to reproduce this issue - realistic and not. The following steps are about not real but simple way:

Steps to reproduce the issue:

Create a bash script with the following content (be careful - kill command inside):

#!/bin/bash

set -o errexit

podman rm -f -i label-for-kill 

while true; do
	podman info > /dev/null
	podman run --rm --init --name label-for-kill nginx &
	sleep 3
	echo send kill for `pgrep -f label-for-kill`
	kill `pgrep -f label-for-kill`
	sleep 3
done

Execute the script.

Describe the results you received:

After some time the script will hang.

Describe the results you expected:

The script will never stop and it will do its work.

Additional information you deem important (e.g. issue happens only occasionally):

The issue happens only occasionally. In my case, it's not reproduced in the adm64 machine, but very often reproduced in my weak ARMv7 device. When issue happens I have the following output:

# ./test.sh 
send kill for 946 1015
/docker-entrypoint.sh: /docker-entrypoint.d/ is not empty, will attempt to perform configuration
/docker-entrypoint.sh: Looking for shell scripts in /docker-entrypoint.d/
/docker-entrypoint.sh: Launching /docker-entrypoint.d/10-listen-on-ipv6-by-default.sh
send kill for 1270 1337
/docker-entrypoint.sh: /docker-entrypoint.d/ is not empty, will attempt to perform configuration
/docker-entrypoint.sh: Looking for shell scripts in /docker-entrypoint.d/
/docker-entrypoint.sh: Launching /docker-entrypoint.d/10-listen-on-ipv6-by-default.sh
2022-08-26T07:53:55.000449985Z: kill container: No such process

and the processes list looks like:

 1270 S  `- podman run --rm --init --name label-for-kill nginx
 1356 Z      `- [3] <defunct>
 1337 S  `- /usr/bin/conmon <...>
 1355 S      `- /usr/bin/podman <...> container cleanup --rm <...>

Output of podman version:

Client:       Podman Engine
Version:      4.3.0-dev
API Version:  4.3.0-dev
Go Version:   go1.17.8
Built:        Thu Jan  1 00:00:00 1970
OS/Arch:      linux/arm

Output of podman info:

host:
  arch: arm
  buildahVersion: 1.27.0
  cgroupControllers:
  - cpuset
  - cpu
  - cpuacct
  - blkio
  - memory
  - devices
  - freezer
  - net_cls
  - perf_event
  - net_prio
  - pids
  cgroupManager: systemd
  cgroupVersion: v1
  conmon:
    package: Unknown
    path: /usr/bin/conmon
    version: 'conmon version 2.0.29, commit: unknown'
  cpuUtilization:
    idlePercent: 87.12
    systemPercent: 10.63
    userPercent: 2.24
  cpus: 4
  distribution:
    distribution: buildroot
    version: "2022.02"
  eventLogger: journald
  hostname: comm99-dev
  idMappings:
    gidmap: null
    uidmap: null
  kernel: 4.14.98
  linkmode: dynamic
  logDriver: k8s-file
  memFree: 180031488
  memTotal: 511803392
  networkBackend: cni
  ociRuntime:
    name: crun
    package: Unknown
    path: /usr/bin/crun
    version: |-
      crun version 1.4.3
      commit: 61c9600d1335127eba65632731e2d72bc3f0b9e8
      spec: 1.0.0
      +SYSTEMD +SELINUX +APPARMOR +CAP +SECCOMP +YAJL
  os: linux
  remoteSocket:
    exists: true
    path: /run/podman/podman.sock
  security:
    apparmorEnabled: false
    capabilities: CAP_CHOWN,CAP_DAC_OVERRIDE,CAP_FOWNER,CAP_FSETID,CAP_KILL,CAP_NET_BIND_SERVICE,CAP_SETFCAP,CAP_SETGID,CAP_SETPCAP,CAP_SETUID,CAP_SYS_CHROOT
    rootless: false
    seccompEnabled: true
    seccompProfilePath: ""
    selinuxEnabled: false
  serviceIsRemote: false
  slirp4netns:
    executable: ""
    package: ""
    version: ""
  swapFree: 0
  swapTotal: 0
  uptime: 0h 17m 39.00s
plugins:
  authorization: null
  log:
  - k8s-file
  - none
  - passthrough
  - journald
  network:
  - bridge
  - macvlan
  - ipvlan
  volume:
  - local
registries:
  search:
  - docker.io
  - registry.fedoraproject.org
  - registry.access.redhat.com
  - registry.centos.org
store:
  configFile: /etc/containers/storage.conf
  containerStore:
    number: 6
    paused: 0
    running: 0
    stopped: 6
  graphDriverName: overlay
  graphOptions:
    overlay.mountopt: nodev
  graphRoot: /opt/containers/storage
  graphRootAllocated: 29639618560
  graphRootUsed: 3874271232
  graphStatus:
    Backing Filesystem: extfs
    Native Overlay Diff: "true"
    Supports d_type: "true"
    Using metacopy: "false"
  imageCopyTmpDir: /var/tmp
  imageStore:
    number: 35
  runRoot: /run/containers/storage
  volumePath: /opt/containers/storage/volumes
version:
  APIVersion: 4.3.0-dev
  Built: 0
  BuiltTime: Thu Jan  1 00:00:00 1970
  GitCommit: ""
  GoVersion: go1.17.8
  Os: linux
  OsArch: linux/arm
  Version: 4.3.0-dev

Package info (e.g. output of rpm -q podman or apt list podman):

Have you tested with the latest version of Podman and have you checked the Podman Troubleshooting Guide? (https://github.com/containers/podman/blob/main/troubleshooting.md)

Yes

Additional environment details (AWS, VirtualBox, physical, etc.):

The text was updated successfully, but these errors were encountered:

vrothberg · 2022-08-26T08:13:26Z

Thanks for reaching out and for providing the reproducer, @tyler92!

@mheon, that smells like being caused by the exit-code changes. I will take a look.

tyler92 · 2022-08-26T08:17:51Z

libpod.Kill locks container's mutex
ociRuntime.KillContainer in some cases calls libpod.Wait, that wants to lock the same mutex again

I tried to create PR with a fix, but to be honest, it looks a little bit difficult. The simplest way is unlock mutex in the beginning of the libpod.Kill function (as in Attach function), but race condition can occur in this case.

vrothberg · 2022-08-26T08:20:33Z

@tyler92 could you send SIGHUP to the deadlocked process and paste the stacktrace here?

tyler92 · 2022-08-26T08:21:37Z

In real life this can happen when we launch podman run --rm ... via SSH and the TCP connection is closed. In this scenario (as I understand) sshd will send SIGTERM to every process within the SSH session. And this is the original scenario where I found this problem.

vrothberg · 2022-08-26T08:23:02Z

In real life this can happen when we launch podman run --rm ... via SSH and the TCP connection is closed. In this scenario (as I understand) sshd will send SIGTERM to every process within the SSH session. And this is the original scenario where I found this problem.

Can you elaborate what's deadlocking after the session is closed?

vrothberg · 2022-08-26T08:30:38Z

ociRuntime.KillContainer in some cases calls libpod.Wait, that wants to lock the same mutex again

Can you point out that code path? I don't think that's possible.

vrothberg · 2022-08-26T08:32:54Z

ociRuntime.KillContainer in some cases calls libpod.Wait, that wants to lock the same mutex again

Can you point out that code path? I don't think that's possible.

OK, found it: https://github.com/containers/podman/blob/main/libpod/oci_conmon_common.go#L283

tyler92 · 2022-08-26T08:37:09Z

For some reason I can't get stack though SIGHUP, but I reproduce it in IDE:

lock.(*SHMLock).Lock (shm_lock_manager_linux.go:113) github.com/containers/podman/v4/libpod/lock
libpod.(*Container).WaitForExit.func1 (container_api.go:512) github.com/containers/podman/v4/libpod
libpod.(*Container).WaitForExit (container_api.go:565) github.com/containers/podman/v4/libpod
libpod.(*Container).Wait (container_api.go:495) github.com/containers/podman/v4/libpod
libpod.(*ConmonOCIRuntime).UpdateContainerStatus (oci_conmon_common.go:283) github.com/containers/podman/v4/libpod
libpod.(*ConmonOCIRuntime).KillContainer (oci_conmon_common.go:339) github.com/containers/podman/v4/libpod
libpod.(*Container).Kill (container_api.go:229) github.com/containers/podman/v4/libpod
terminal.ProxySignals.func1 (sigproxy_linux.go:41) github.com/containers/podman/v4/pkg/domain/infra/abi/terminal
runtime.goexit (asm_amd64.s:1594) runtime
 - Async Stack Trace
terminal.ProxySignals (sigproxy_linux.go:30) github.com/containers/podman/v4/pkg/domain/infra/abi/terminal

vrothberg · 2022-08-26T08:38:02Z

Excellent, thanks @tyler92. I am sure we'll find a solution.

tyler92 · 2022-08-26T08:41:51Z

Can you elaborate what's deadlocking after the session is closed?

The same thing as in description, but in SSH case SIGTERM is sent by sshd daemon. STR:

Connect by ssh to some host with podman
Launch podman run --rm ...
Close terminal window (yes, by clicking 'close' button) - this will lead to TCP session close
Observe podman state on the remote host.

But FYI: I tried to reproduce this in amd64 machine and I had no success. But in ARMv7 - ~50% of attempts lead to the issue. My ARMv7 host is significantly slower - maybe it's the reason why I faced with deadlocks last time =)

Commit 30e7cbc accidentally added a deadlock as Podman was waiting for the exit code to show up when the container transitioned to stopped. Code paths that require the exit code to be written (by the cleanup process) should already be using `(*Container).Wait()` in a deadlock free way. [NO NEW TESTS NEEDED] as I did not manage to a reproducer that would work in CI. Ultimately, it's a race condition. Fixes: containers#15492 Signed-off-by: Valentin Rothberg <[email protected]>

vrothberg · 2022-08-26T10:42:38Z

@tyler92 could you test #15494?

tyler92 · 2022-08-26T11:14:37Z

Sure, give me some time.

tyler92 · 2022-08-26T11:41:40Z

After several iterations, I don't see deadlocks, but it looks a little bit strange.

After some time I see "Error: error creating container storage: the container name "label-for-kill" is already in use by 951c6a6b159eeaf138ade41993bcdeee83956842a7eed16605e7a4e89e7e9e36. You have to remove that container to be able to reuse that name: that name is already in use". Looks like option --rm didn't complete its work.
There are processes in process list, that are never exited:

17308 S  `- /usr/bin/crun <...> create --bundle <...>
17829 S  `- /usr/bin/crun <...> create --bundle <...>
19599 S  `- /usr/bin/crun <...> create --bundle <...>
19976 S  `- /usr/bin/conmon <...> --exit-command-arg cleanup <...>
19980 S      `- /usr/bin/crun  <...> create --bundle <...>

This process list is increased until an error from (1). And even if I stop my script they still exist.

vrothberg · 2022-08-26T11:44:59Z

I don't have a good explanation for that observation. @mheon WDYT?

mheon · 2022-08-26T14:21:51Z

The hung crun instances are concerning - those are OCI runtimes that have escaped the supervision of a Conmon. They could be completely hung in their setup code, but crun create will also naturally stop just short of actually exec()ing the first process in the container, so my initial inclination would be that each of these stopped naturally, waiting for Podman to do a final crun start, which never happened - the equivalent of Podman stopping midway through podman run, possibly due to error? This could explain why the container failed to clean up fully (the crun process is using files from the container's rootfs, so we can't unmount it, can't remove it from c/storage, name stays in use). I don't know what could actually be causing this, though. I would assume it's a failed podman run - any logs to that effect?

tyler92 · 2022-08-27T16:03:12Z

Ok, thanks for the fix. I'll investigate other problems and create separate issues with details if necessary.

vrothberg · 2022-08-28T17:16:22Z

Thanks a lot, @tyler92 !

Commit 30e7cbc accidentally added a deadlock as Podman was waiting for the exit code to show up when the container transitioned to stopped. Code paths that require the exit code to be written (by the cleanup process) should already be using `(*Container).Wait()` in a deadlock free way. [NO NEW TESTS NEEDED] as I did not manage to a reproducer that would work in CI. Ultimately, it's a race condition. Backport-for: containers#15492 Signed-off-by: Valentin Rothberg <[email protected]>

tyler92 · 2022-08-29T09:24:47Z

I'm not ready to create a separate issue right now, but I gathered logs:

podman-run.log (here is one line from test script)
process-list.txt

It looks like podman run was killed right after conmon process started. And conmon is not killed by SIGTERM (because its child crun is not killed?). What do you think - should I create an issue with this information? It looks a little bit artificial case, but this can lead to useless processes.

mheon · 2022-08-29T11:51:56Z

If Podman is ending up in a bad state (podman inspect output for the container in question would help), then this is definitely worth a bug.

tyler92 · 2022-08-29T11:55:38Z

podman-inspect.log

Does it look like a bad state?

mheon · 2022-08-29T12:41:47Z

Yep. The container is stuck in Created state, when it should be in Initialized. It doesn't have a PID registered for Conmon. It looks like Podman died before it could register that Conmon had started.

Commit 30e7cbc accidentally added a deadlock as Podman was waiting for the exit code to show up when the container transitioned to stopped. Code paths that require the exit code to be written (by the cleanup process) should already be using `(*Container).Wait()` in a deadlock free way. [NO NEW TESTS NEEDED] as I did not manage to a reproducer that would work in CI. Ultimately, it's a race condition. Backport-for: containers#15492 BZ: https://bugzilla.redhat.com/show_bug.cgi?id=2124716 BZ: https://bugzilla.redhat.com/show_bug.cgi?id=2125647 Signed-off-by: Valentin Rothberg <[email protected]>

openshift-ci bot added the kind/bug Categorizes issue or PR as related to a bug. label Aug 26, 2022

vrothberg mentioned this issue Aug 26, 2022

libpod: UpdateContainerStatus: do not wait for container #15494

Merged

openshift-merge-robot closed this as completed in #15494 Aug 26, 2022

vrothberg mentioned this issue Aug 28, 2022

[v4.2] libpod: UpdateContainerStatus: do not wait for container #15521

Merged

vrothberg mentioned this issue Sep 12, 2022

[v4.2.0-rhel] libpod: UpdateContainerStatus: do not wait for container #15745

Merged

tyler92 mentioned this issue Oct 27, 2022

kill: resync the container if runtime fails #16320

Closed

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 17, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deadlock when container exits while killing by podman #15492

Deadlock when container exits while killing by podman #15492

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

vrothberg commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

mheon commented Aug 26, 2022

tyler92 commented Aug 27, 2022

vrothberg commented Aug 28, 2022

tyler92 commented Aug 29, 2022 •

edited

Loading

mheon commented Aug 29, 2022

tyler92 commented Aug 29, 2022

mheon commented Aug 29, 2022

Deadlock when container exits while killing by podman #15492

Deadlock when container exits while killing by podman #15492

Comments

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

vrothberg commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

tyler92 commented Aug 26, 2022

tyler92 commented Aug 26, 2022

vrothberg commented Aug 26, 2022

mheon commented Aug 26, 2022

tyler92 commented Aug 27, 2022

vrothberg commented Aug 28, 2022

tyler92 commented Aug 29, 2022 • edited Loading

mheon commented Aug 29, 2022

tyler92 commented Aug 29, 2022

mheon commented Aug 29, 2022

tyler92 commented Aug 29, 2022 •

edited

Loading