Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

checkpoint/restore: implement --manage-cgroups-mode ignore #3546

Merged
merged 7 commits into from
Jan 27, 2023

Conversation

kolyshkin
Copy link
Contributor

@kolyshkin kolyshkin commented Aug 2, 2022

This patchset fixes a few issues and adds support for --manage-cgroups-mode ignore. This option allows to restore a container into a different cgroup than the original one. A test case is added to check that it works as expected.

See individual commits for details.

Loosely based on #3447 and my earlier work from 2021.

Closes: #3447

Also, this uses the new mode in lazy checkpoint/restore test, hopefully fixing this test flakiness.

Fixes: #2760
Fixes: #2924
Fixes: #2475

PS I am thinking whether this (--manage-cgroups-mode ignore should become a default in 1.2.0, because otherwise changing the cgroupsPath in container's config.json before restore doesn't have any effect)

@kolyshkin kolyshkin force-pushed the criu-add-ignore-cgroup branch 2 times, most recently from 21a635e to d61b160 Compare August 2, 2022 17:36
@kolyshkin kolyshkin marked this pull request as ready for review August 2, 2022 17:37
@kolyshkin kolyshkin force-pushed the criu-add-ignore-cgroup branch from d61b160 to 0dddc06 Compare August 4, 2022 19:13
@kolyshkin kolyshkin added this to the 1.2.0 milestone Aug 11, 2022
@kolyshkin
Copy link
Contributor Author

@opencontainers/runc-maintainers PTAL (this, among the other things, fixes a few flaky tests we had for a few years)

@kolyshkin
Copy link
Contributor Author

@opencontainers/runc-maintainers PTAL

@kolyshkin
Copy link
Contributor Author

@adrianreber PTAL

@kolyshkin
Copy link
Contributor Author

@opencontainers/runc-maintainers PTAL (this, among the other things, fixes a few flaky tests we had for a few years)

@kolyshkin
Copy link
Contributor Author

@adrianreber PTAL

@adrianreber
Copy link
Contributor

Looks good. Thanks!

@kolyshkin kolyshkin force-pushed the criu-add-ignore-cgroup branch from dfd5b65 to 8268e56 Compare October 27, 2022 18:31
@kolyshkin
Copy link
Contributor Author

@opencontainers/runc-maintainers PTAL 🙏🏻

@kolyshkin
Copy link
Contributor Author

@opencontainers/runc-maintainers PTAL 🙏🏻
This fixes a few real bugs on which I worked (on and off) for the last few years, and it's a pity to see this sitting 4 months without a review.

@kolyshkin kolyshkin force-pushed the criu-add-ignore-cgroup branch from 8268e56 to f5f17cd Compare November 2, 2022 22:02
test -d "$orig_path"

runc checkpoint --work-path ./work-dir --manage-cgroups-mode ignore test_busybox
grep -B 5 Error ./work-dir/dump.log || true
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think this is robust, but I don't have an alternative idea, so LGTM.

Eventually we need to have a robust error reporting system.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is just to print anything containing Error (with some context) in case the checkpointing failed. It's merely a way to debug criu failures, and does not affect the test itself (only its output in case of an error). The || true part here is so that the test won't fail in case there's no error; IOW, we ignore grep exit code.

Alternatively, we could create an after-test artefact containing all the files, but this works OK so far.

Thinking about it, we might work on making runc do something like what grep does here in case of an error. Currently, this is just criu writing the log file.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Filed #3711 to not forget about it

AkihiroSuda
AkihiroSuda previously approved these changes Nov 3, 2022
Merge the logic of setPageServer, setManageCgroupsMode, and
setEmptyNsMask into criuOptions. This does three things:

1. Fixes ignoring --manage-cgroups-mode on restore;
2. Simplifies the code in checkpoint.go and restore.go;
3. Ensures issues like 1 won't happen again.

Signed-off-by: Kir Kolyshkin <[email protected]>
 - add the new mode and document it;
 - slightly improve the --help output;
 - slightly simplify the parsing code.

Signed-off-by: Kir Kolyshkin <[email protected]>
When manage-cgroups-mode: ignore is used, criu still needs to know the
cgroup path to work properly (see [1]).

Revert "libct/criuApplyCgroups: don't set cgroup paths for v2"

This reverts commit d5c57dc.

[1]: checkpoint-restore/criu#1793 (comment)

Signed-off-by: Kir Kolyshkin <[email protected]>
I don't want to implement it now, because this might result in some
new issues, but this is definitely something that is worth implementing.

Signed-off-by: Kir Kolyshkin <[email protected]>
This test checks that the container is restored into a different cgroup.

To do so, a user should
 - use --manage-cgroups-mode ignore on both checkpoint and restore;
 - change the cgroupsPath value in config.json before restoring.

The test does some checks to ensure that its logic is correct, and that
after the restore the old (original) cgroup does not exist, the new one
exists, and the container's init is in that new cgroup.

Signed-off-by: Kir Kolyshkin <[email protected]>
When doing a lazy checkpoint/restore, we should not restore into the
same cgroup, otherwise there is a race which result in occasional
killing of the restored container (GH opencontainers#2760, opencontainers#2924).

The fix is to use --manage-cgroup-mode=ignore, which allows to restore
into a different cgroup.

Note that since cgroupsPath is not set in config.json, the cgroup is
derived from the container name, so calling set_cgroups_path is not
needed.

For the previous (unsuccessful) attempt to fix this, as well as detailed
(and apparently correct) analysis, see commit 36fe3cc.

Signed-off-by: Kir Kolyshkin <[email protected]>
@kolyshkin
Copy link
Contributor Author

Rebased to resolve a conflict caused by #3655.

@AkihiroSuda please re-LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
4 participants