multiregion: all regions start in running if no max_parallel #8209

tgross · 2020-06-19T13:53:48Z

If max_parallel is not set, all regions should begin in a running state
rather than a pending state. Otherwise the first region is set to running
and then all the remaining regions once it enters blocked. That behavior is
technically correct in that we have at most max_parallel regions running,
but definitely not what a user expects.

If `max_parallel` is not set, all regions should begin in a `running` state rather than a `pending` state. Otherwise the first region is set to `running` and then all the remaining regions once it enters `blocked. That behavior is technically correct in that we have at most `max_parallel` regions running, but definitely not what a user expects.

tgross · 2020-06-19T13:54:43Z

scheduler/reconcile.go

@@ -198,7 +198,6 @@ func (a *allocReconciler) Compute() *reconcileResults {
 	// Detect if the deployment is paused
 	if a.deployment != nil {
 		a.deploymentPaused = a.deployment.Status == structs.DeploymentStatusPaused
-		//||			a.deployment.Status == structs.DeploymentStatusPending


notnoop

makes sense to me and should go into beta - but one question.

notnoop · 2020-06-19T13:58:46Z

scheduler/reconcile.go

+			// region starts in the running state
+			if a.job.IsMultiregion() &&
+				a.job.Multiregion.Strategy != nil &&
+				a.job.Multiregion.Strategy.MaxParallel != 0 &&


I missed some of the changes of PRs - do we also want to check MaxParallel against current region index (e.g. second region with MaxParallel=2); is that handled elsewhere?

Oh that's a good catch. That's the same sort of thing -- we'd safely have at most max_parallel going, but the operator probably expects us to start with max_parallel. Will fix.

The logic here is getting gnarly though so I'm going to pull it out into a function.

I've pulled this logic out to a function on Job in f10cc93

cgbaker

💯

In #8209 we fixed the max_parallel stanza for multiregion by introducing the IsMultiregionStarter check, but didn't apply it to the earlier place its required. The result is that deployments start but don't place allocations.

In #8209 we fixed the `max_parallel` stanza for multiregion by introducing the `IsMultiregionStarter` check, but didn't apply it to the earlier place its required. The result is that deployments start but don't place allocations.

github-actions · 2023-01-01T02:19:52Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

tgross commented Jun 19, 2020

View reviewed changes

tgross requested review from drewbailey, notnoop and cgbaker June 19, 2020 13:55

drewbailey approved these changes Jun 19, 2020

View reviewed changes

notnoop approved these changes Jun 19, 2020

View reviewed changes

pull out max_parallel check to function

f10cc93

cgbaker approved these changes Jun 19, 2020

View reviewed changes

tgross merged commit 5a068e6 into master Jun 19, 2020

tgross deleted the b-multiregion-maxparallel-unset branch June 19, 2020 15:17

tgross mentioned this pull request Jun 19, 2020

multiregion: initial deploymentPaused must match start condition #8215

Merged

tgross added this to the 0.12.0 milestone Jun 25, 2020

github-actions bot locked as resolved and limited conversation to collaborators Jan 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multiregion: all regions start in running if no max_parallel #8209

multiregion: all regions start in running if no max_parallel #8209

tgross commented Jun 19, 2020 •

edited

Loading

tgross Jun 19, 2020

notnoop left a comment

notnoop Jun 19, 2020

tgross Jun 19, 2020

tgross Jun 19, 2020

tgross Jun 19, 2020

cgbaker left a comment

github-actions bot commented Jan 1, 2023

multiregion: all regions start in running if no max_parallel #8209

multiregion: all regions start in running if no max_parallel #8209

Conversation

tgross commented Jun 19, 2020 • edited Loading

tgross Jun 19, 2020

Choose a reason for hiding this comment

notnoop left a comment

Choose a reason for hiding this comment

notnoop Jun 19, 2020

Choose a reason for hiding this comment

tgross Jun 19, 2020

Choose a reason for hiding this comment

tgross Jun 19, 2020

Choose a reason for hiding this comment

tgross Jun 19, 2020

Choose a reason for hiding this comment

cgbaker left a comment

Choose a reason for hiding this comment

github-actions bot commented Jan 1, 2023

tgross commented Jun 19, 2020 •

edited

Loading