healthcheck run: error: ctr does not exist in database: no such container #16075

edsantiago · 2022-10-06T19:22:17Z

Seen just now in an in-flight PR:

[+1345s] not ok 292 podman create --health-on-failure=kill
...
# podman healthcheck run APUcS4ihNr
Error: container 43712cb6940abc3a35e883961fa79d4632674e40ca72394fec96535842fdf06c does not exist in database: no such container
[ rc=125 (** EXPECTED 0 **) ]

f36 local root aarch64. Also seen in my flake logs, but only once so I hadn't filed it:

[sys] 288 podman create --health-on-failure=kill

fedora-36-aarch64 : sys podman fedora-36-aarch64 root host
- PR remote: checkpoint --export prints a rawInput or an error on remote #15812
  - 09-16 05:06

The text was updated successfully, but these errors were encountered:

edsantiago · 2022-10-08T00:01:30Z

Another one on f36 aarch64 root

edsantiago · 2022-10-08T00:42:41Z

and again

edsantiago · 2022-10-10T12:59:25Z

This one is really blowing up. It has failed seven reruns (eight total) in nightly cron

rhatdan · 2022-10-10T13:16:46Z

@vrothberg PTAL

vrothberg · 2022-10-11T11:30:41Z

I'll take a look.

The on-failure=kill system tests turned out to be flaky. Once the container has been killed, the test waits for systemd to restart the service by running `container inspect` for 10 seconds. The subsequent `healthcheck run` was the flake point which suggests the 10 seconds timeout to not be sufficiently high enough; presumably when the CI nodes are under pressure. Fixes: containers#16075 Signed-off-by: Valentin Rothberg <[email protected]>

vrothberg · 2022-10-11T11:44:26Z

Opened #16112. Looks like test-side fix to me.

edsantiago added the flakes Flakes from Continuous Integration label Oct 6, 2022

edsantiago mentioned this issue Oct 6, 2022

Set up minikube for k8s testing #15826

Merged

vrothberg mentioned this issue Oct 11, 2022

increase timeout of on-failure=kill system test #16112

Closed

edsantiago mentioned this issue Oct 12, 2022

system tests: health-on-failure: fix broken logic #16129

Merged

openshift-merge-robot closed this as completed in #16129 Oct 12, 2022

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 13, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

healthcheck run: error: ctr does not exist in database: no such container #16075

healthcheck run: error: ctr does not exist in database: no such container #16075

edsantiago commented Oct 6, 2022

edsantiago commented Oct 8, 2022

edsantiago commented Oct 8, 2022

edsantiago commented Oct 10, 2022

rhatdan commented Oct 10, 2022

vrothberg commented Oct 11, 2022

vrothberg commented Oct 11, 2022

healthcheck run: error: ctr does not exist in database: no such container #16075

healthcheck run: error: ctr does not exist in database: no such container #16075

Comments

edsantiago commented Oct 6, 2022

[sys] 288 podman create --health-on-failure=kill

edsantiago commented Oct 8, 2022

edsantiago commented Oct 8, 2022

edsantiago commented Oct 10, 2022

rhatdan commented Oct 10, 2022

vrothberg commented Oct 11, 2022

vrothberg commented Oct 11, 2022