feat: Allow to specify grace period for pod GC #5033

terrytangyuan · 2021-02-05T03:02:49Z

Use case: we need some grace period to allow other services to complete the pod information collection (e.g. log and db persistence), especially during high load where those services have certain amount of delays.

Checklist:

My organization is added to USERS.md.

terrytangyuan · 2021-02-05T03:09:11Z

cc @stefansedich who's looking for this feature as well.

stefansedich · 2021-02-05T03:11:31Z

cc @stefansedich who's looking for this feature as well.

Love your work! Question however will this do what we want? If a pod shuts down right away the grace period won't help right? I believe this handles the time between pod stop and force kill.

terrytangyuan · 2021-02-05T03:21:00Z

A pod is added to the podCleanupQueue when it meets the podGCStrategy. This grace period is the time to wait before the pod in the queue gets deleted.

alexec · 2021-02-09T19:14:38Z

config/config.go

@@ -92,6 +92,10 @@ type Config struct {
 	// PodSpecLogStrategy enables the logging of podspec on controller log.
 	PodSpecLogStrategy PodSpecLogStrategy `json:"podSpecLogStrategy,omitempty"`

+	// PodGCGracePeriodSeconds specifies the duration in seconds before the pods in the GC queue get deleted.
+	// Value must be non-negative integer. Defaults to zero, which indicates delete immediately.
+	PodGCGracePeriodSeconds int64 `json:"podGCGracePeriodSeconds,omitempty"`


uint64 allowed?

Updated this to be *int64 to be consistent with the type of DeleteOptions.GracePeriodSeconds.

alexec · 2021-02-09T19:15:24Z

workflow/controller/controller.go

-			err := pods.Delete(ctx, podName, metav1.DeleteOptions{PropagationPolicy: &propagation})
+			err := pods.Delete(ctx, podName, metav1.DeleteOptions{
+				PropagationPolicy:  &propagation,
+				GracePeriodSeconds: pointer.Int64Ptr(wfc.Config.PodGCGracePeriodSeconds)})


this is 30s by default, so presumably, you'll make this longer?

Yes, it's necessary to make this longer for certain scenarios.

@terrytangyuan @alexec I am still trying to understand how this change waits before deleing pods, this call here deletes the pod setting the grace-period-seconds, which as far as I understand what will happen:

SIGTERM is sent to container

SIGKILL is sent if container does not gracefully shutdown within the grace-period

If my container exits immediately after the SIGTERM or in this case is not even running as it is completed how is the grace period helping to delay it's deletion?

For posterity, the above was resolved and implemented in #6168

Signed-off-by: terrytangyuan <[email protected]>

alexec self-assigned this Feb 8, 2021

alexec reviewed Feb 9, 2021

View reviewed changes

terrytangyuan added 3 commits February 9, 2021 15:35

wip

3e5c9c8

Signed-off-by: terrytangyuan <[email protected]>

feat: Allow to specify grace period for pod GC

d0b18d4

Signed-off-by: terrytangyuan <[email protected]>

Update type

d87b908

Signed-off-by: terrytangyuan <[email protected]>

alexec approved these changes Feb 9, 2021

View reviewed changes

alexec merged commit daf1a71 into argoproj:master Feb 9, 2021

alexec added this to the v3.0 milestone Feb 9, 2021

terrytangyuan deleted the gc-strategy-graceperiod branch February 9, 2021 21:31

simster7 mentioned this pull request Feb 16, 2021

v2.12.9 cherry-pick #5119

Closed

33 tasks

stefansedich mentioned this pull request May 19, 2021

feat: Add support for deletion delay when using PodGC #5952

Closed

1 task

stefansedich mentioned this pull request Jun 17, 2021

feat(controller): Delay pod GC deletion by a configurable amount (default 5s) #6168

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Allow to specify grace period for pod GC #5033

feat: Allow to specify grace period for pod GC #5033

terrytangyuan commented Feb 5, 2021 •

edited

Loading

terrytangyuan commented Feb 5, 2021

stefansedich commented Feb 5, 2021 •

edited

Loading

terrytangyuan commented Feb 5, 2021 •

edited

Loading

alexec Feb 9, 2021

terrytangyuan Feb 9, 2021

alexec Feb 9, 2021

terrytangyuan Feb 9, 2021

stefansedich Feb 23, 2021

agilgur5 Oct 15, 2024 •

edited

Loading

feat: Allow to specify grace period for pod GC #5033

feat: Allow to specify grace period for pod GC #5033

Conversation

terrytangyuan commented Feb 5, 2021 • edited Loading

terrytangyuan commented Feb 5, 2021

stefansedich commented Feb 5, 2021 • edited Loading

terrytangyuan commented Feb 5, 2021 • edited Loading

alexec Feb 9, 2021

Choose a reason for hiding this comment

terrytangyuan Feb 9, 2021

Choose a reason for hiding this comment

alexec Feb 9, 2021

Choose a reason for hiding this comment

terrytangyuan Feb 9, 2021

Choose a reason for hiding this comment

stefansedich Feb 23, 2021

Choose a reason for hiding this comment

agilgur5 Oct 15, 2024 • edited Loading

Choose a reason for hiding this comment

terrytangyuan commented Feb 5, 2021 •

edited

Loading

stefansedich commented Feb 5, 2021 •

edited

Loading

terrytangyuan commented Feb 5, 2021 •

edited

Loading

agilgur5 Oct 15, 2024 •

edited

Loading