Repeat system pruning until there is nothing removed #8599

rhatdan · 2020-12-04T19:08:31Z

Signed-off-by: Daniel J Walsh [email protected]

rhatdan · 2020-12-04T19:08:47Z

Fixes: #7990

rhatdan · 2020-12-04T19:10:12Z

@srcshelton PTAL

srcshelton · 2020-12-04T19:14:22Z

Okay, let me patch this into podman-2.2.0 and I'll check it out.

I'll do system rebuild without and check that multiple runs are required, and then again with to confirm it's fixed.

This may take a good number of hours…

(Although it does look as if it can't fail to resolve the problem!)

mheon · 2020-12-04T19:30:51Z

cmd/podman/system/prune.go

-	if pruneOptions.Volume {
-		fmt.Println("Deleted Volumes")
-		err = utils.PrintVolumePruneResults(response.VolumePruneReport)
+	for {


Recommendation: We should put a max iteration count on this - if we haven't removed everything in, say, 25 loops, we probably have a problem.

Just a quick note to agree, but suggest a larger number: I've seen 40-odd manual iterations before… so set the bar at 50?

Yeah, I'm not at all a fan of a naked for loop.

If each iteration takes a beat, we might want to add a message like "Working" that adds a dot after each iteration or after some number of iterations.

Each successful interaction will print purged lines, So no need for heartbeat.

srcshelton · 2020-12-05T11:58:11Z

... a couple of questions:

The earlier patch to repeatedly remove stale images when using podman image prune was said to use the same code-path as the image-pruning stage of podman system prune - is this not actually the case?
Did the podman image prune patch also have a maximum number of iterations set - and, if so, do both commands now share a consistent limit?

srcshelton · 2020-12-05T12:19:47Z

Also, I just did another system prune which removed a large number of images, and the output froze for quite some time with only half of the last hash to be removed written to the terminal - could output-buffering be disabled for this operation (as I assume that an output buffer was filed when half the hash had been printed, and the delay in further output was the buffer not flushing again until filled)?

(It could also be a symptom of another issue, such as the host system locking temporarily due to high I/O? I didn't see any other signs to suggest this being the case, though - other interactive tasks running at the same time weren't affected, and it was a long pause - at least 30s or so once I'd noticed it, but could have been much longer since I was doing other things and only checking the output occasionally)

docs/source/markdown/podman-system-prune.1.md

TomSweeneyRedHat · 2020-12-05T16:19:47Z

Code LGTM, a couple of small doc nits. Any chance to get a test geared towards this?

rhatdan · 2020-12-07T11:09:12Z

@srcshelton podman system prune calls the same internal function to prune images as podman image prune

As it does with podman volume prune, container prune, pod prune. So there should not be a difference. I don't think the pause was caused by the output printing, but some kind of locking in the storage layer. Is some other operation was running on your system, there is a good chance that the removal of content was blocked on a lock.

srcshelton · 2020-12-07T14:30:38Z

Interesting - would a wait on a lock cause the output to pause half-way through printing an image hash, though?

(Even if it is a lock-wait, unbuffered output would at least slow full lines to be printed whilst waiting?)

Whilst there were other contains running when the pause in output occurred, these had been running for several days at this point, and there was no other podman activity: nothing was starting or stopping or creating/updating images.

My question was just trying to understand how image pruning was fixed, and system pruning uses the same internal function, and yet system pruning has needed fixing separately? And whether there's a recursion limit for image pruning and, if so, whether the two methods to prune images are using the same limit, or whether they differ (potentially causing future confusion!)?

rhatdan · 2020-12-07T14:42:07Z

All of the printing is being done with fmt.Println(). We could change this, but it does seem like a corner case.

I did not do anything for image pruning. I have just added a loop on system prune to try to prune content again, since one pass of pruning could free up other pruning.

@containers/podman-maintainers PTAL

vrothberg · 2020-12-08T15:24:10Z

docs/source/markdown/podman-system-prune.1.md

@@ -16,7 +16,7 @@ By default, volumes are not removed to prevent important data from being deleted
 ## OPTIONS
 #### **--all**, **-a**

-Remove all unused images not just dangling ones.
+Recursively remove all unused images, not just dangling ones. (Maximum 50 iterations.)


"all unused images" ... I think we remove more than just images

vrothberg · 2020-12-08T15:29:14Z

cmd/podman/system/prune.go

-		fmt.Println("Deleted Volumes")
-		err = utils.PrintVolumePruneResults(response.VolumePruneReport)
+
+	const MAX = 50


It seems we're now working around the fact that registry.ContainerEngine().SystemPrune(...) isn't doing it's job correctly. Having the code scattered between cmd/podman and pkg/domain/... seems like a recipe for trouble.

Could we move all the logic to registry.ContainerEngine().SystemPrune(...)? I think it should remove all data, even if there are more than 50 iterations. It's not friendly to use. If I do a rm -rf * I don't want to ls afterwards to check if things were really removed.

vrothberg

Code LGTM

saschagrunert

LGTM

openshift-ci-robot · 2020-12-09T08:34:02Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: rhatdan, saschagrunert

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [rhatdan,saschagrunert]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

saschagrunert · 2020-12-09T08:34:50Z

Ah alright:

Inconsistent subcommand descriptions:
  podman-system-prune.1.md         = 'Remove all unused pod, container, image and volume data'
  podman-system.1.md               = 'Remove all unused container, image and volume data'
Please ensure that the NAME section of podman-system-prune.1.md
matches the subcommand description in podman-system.1.md

Signed-off-by: Daniel J Walsh <[email protected]>

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 4, 2020

rhatdan force-pushed the prune branch from d28122c to 420ae7c Compare December 4, 2020 19:09

mheon reviewed Dec 4, 2020

View reviewed changes

rhatdan force-pushed the prune branch 2 times, most recently from a14a8f6 to ff0d964 Compare December 5, 2020 11:38

rhatdan force-pushed the prune branch from ff0d964 to 5960a10 Compare December 5, 2020 13:50

TomSweeneyRedHat reviewed Dec 5, 2020

View reviewed changes

docs/source/markdown/podman-system-prune.1.md Outdated Show resolved Hide resolved

rhatdan force-pushed the prune branch from 128d31a to 0137f7c Compare December 7, 2020 11:06

vrothberg reviewed Dec 8, 2020

View reviewed changes

rhatdan force-pushed the prune branch from 0137f7c to 62616b2 Compare December 8, 2020 20:19

vrothberg reviewed Dec 9, 2020

View reviewed changes

saschagrunert approved these changes Dec 9, 2020

View reviewed changes

Repeat system pruning until there is nothing removed

a59e2a1

Signed-off-by: Daniel J Walsh <[email protected]>

rhatdan force-pushed the prune branch from 62616b2 to a59e2a1 Compare December 9, 2020 11:17

rhatdan added the lgtm Indicates that a PR is ready to be merged. label Dec 9, 2020

openshift-merge-robot merged commit b875c5c into containers:master Dec 9, 2020

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 24, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repeat system pruning until there is nothing removed #8599

Repeat system pruning until there is nothing removed #8599

rhatdan commented Dec 4, 2020

rhatdan commented Dec 4, 2020

rhatdan commented Dec 4, 2020

srcshelton commented Dec 4, 2020

mheon Dec 4, 2020

srcshelton Dec 4, 2020

TomSweeneyRedHat Dec 4, 2020

TomSweeneyRedHat Dec 4, 2020

rhatdan Dec 4, 2020

srcshelton commented Dec 5, 2020

srcshelton commented Dec 5, 2020

TomSweeneyRedHat commented Dec 5, 2020

rhatdan commented Dec 7, 2020

srcshelton commented Dec 7, 2020

rhatdan commented Dec 7, 2020

vrothberg Dec 8, 2020

vrothberg Dec 8, 2020

vrothberg left a comment

saschagrunert left a comment

openshift-ci-robot commented Dec 9, 2020

saschagrunert commented Dec 9, 2020

Repeat system pruning until there is nothing removed #8599

Repeat system pruning until there is nothing removed #8599

Conversation

rhatdan commented Dec 4, 2020

rhatdan commented Dec 4, 2020

rhatdan commented Dec 4, 2020

srcshelton commented Dec 4, 2020

mheon Dec 4, 2020

Choose a reason for hiding this comment

srcshelton Dec 4, 2020

Choose a reason for hiding this comment

TomSweeneyRedHat Dec 4, 2020

Choose a reason for hiding this comment

TomSweeneyRedHat Dec 4, 2020

Choose a reason for hiding this comment

rhatdan Dec 4, 2020

Choose a reason for hiding this comment

srcshelton commented Dec 5, 2020

srcshelton commented Dec 5, 2020

TomSweeneyRedHat commented Dec 5, 2020

rhatdan commented Dec 7, 2020

srcshelton commented Dec 7, 2020

rhatdan commented Dec 7, 2020

vrothberg Dec 8, 2020

Choose a reason for hiding this comment

vrothberg Dec 8, 2020

Choose a reason for hiding this comment

vrothberg left a comment

Choose a reason for hiding this comment

saschagrunert left a comment

Choose a reason for hiding this comment

openshift-ci-robot commented Dec 9, 2020

saschagrunert commented Dec 9, 2020