Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve robustness of salt-master and upgrade #3125

Merged
merged 5 commits into from
Feb 18, 2021

Conversation

TeddyAndrieux
Copy link
Collaborator

Component:

'salt', 'lifecycle'

Context:

Upgrade get a lot of flakies on environment that are a bit slow

Summary:

  • Backport 9ee0c53
  • Bump timeout waiting for containers after local container engine upgrade from 60s to 120s
  • Backport 73835be
  • Increase Salt master sock_pool_size (from 1 to 15) and worker_threads (from 5 to 10)

After kubelet upgrade all static pod may restart, so we need to wait for
some "useful" pods to be running before going further in the upgrade
process.
NOTE: That before this PR we already sleep 20 seconds, so this commit
only add a proper check after this sleep

(cherry picked from commit 9ee0c53)
In upgrade script, bump the default to wait for a container to be ready
from default 60s to 120s, to avoid some flakiness on environment that
are a bit slow during upgrade process
@TeddyAndrieux TeddyAndrieux added topic:lifecycle Issues related to upgrade or downgrade of MetalK8s complexity:medium Something that requires one or few days to fix topic:salt Everything related to SaltStack in our product labels Feb 17, 2021
@TeddyAndrieux TeddyAndrieux requested a review from a team February 17, 2021 16:18
@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Hello teddyandrieux,

My role is to assist you with the merge of this
pull request. Please type @bert-e help to get information
on this process, or consult the user documentation.

Status report is not available.

@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Conflict

A conflict has been raised during the creation of
integration branch w/2.8/improvement/improve-upgrade-robustness with contents from improvement/improve-upgrade-robustness
and development/2.8.

I have not created the integration branch.

Here are the steps to resolve this conflict:

 $ git fetch
 $ git checkout -B w/2.8/improvement/improve-upgrade-robustness origin/development/2.8
 $ git merge origin/improvement/improve-upgrade-robustness
 $ # <intense conflict resolution>
 $ git commit
 $ git push -u origin w/2.8/improvement/improve-upgrade-robustness

@TeddyAndrieux TeddyAndrieux force-pushed the improvement/improve-upgrade-robustness branch from 43c6ddf to a29fe17 Compare February 17, 2021 16:18
@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Waiting for approval

The following approvals are needed before I can proceed with the merge:

  • the author

  • one peer

Peer approvals must include at least 1 approval from the following list:

@TeddyAndrieux
Copy link
Collaborator Author

/approve

@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Waiting for approval

The following approvals are needed before I can proceed with the merge:

  • the author

  • one peer

Peer approvals must include at least 1 approval from the following list:

The following options are set: approve

@TeddyAndrieux TeddyAndrieux force-pushed the improvement/improve-upgrade-robustness branch from a29fe17 to 8bc1f3e Compare February 17, 2021 18:45
@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

History mismatch

Merge commit #d562b23cde07622357e2bd752b87ca700c3d228f on the integration branch
w/2.8/improvement/improve-upgrade-robustness is merging a branch which is neither the current
branch improvement/improve-upgrade-robustness nor the development branch
development/2.8.

It is likely due to a rebase of the branch improvement/improve-upgrade-robustness and the
merge is not possible until all related w/* branches are deleted or updated.

Please use the reset command to have me reinitialize these branches.

The following options are set: approve

@TeddyAndrieux
Copy link
Collaborator Author

/reset

@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Reset complete

I have successfully deleted this pull request's integration branches.

The following options are set: approve

@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Conflict

A conflict has been raised during the creation of
integration branch w/2.8/improvement/improve-upgrade-robustness with contents from improvement/improve-upgrade-robustness
and development/2.8.

I have not created the integration branch.

Here are the steps to resolve this conflict:

 $ git fetch
 $ git checkout -B w/2.8/improvement/improve-upgrade-robustness origin/development/2.8
 $ git merge origin/improvement/improve-upgrade-robustness
 $ # <intense conflict resolution>
 $ git commit
 $ git push -u origin w/2.8/improvement/improve-upgrade-robustness

The following options are set: approve

Time to time, especially on really slow platform, we got failure because
salt state execution timeout. Increase salt-master default timeout to 20

(cherry picked from commit 73835be)
Time to time salt-master get overloaded because he receive to much
query, for example during upgrade and one environment a bit slow some
salt states may timeout and make the upgrade fail.
To avoid that kind of issue just bump the `sock_pool_size` on salt
master (from 1 to 15) to avoid blocking waiting for zeromq communications
and also bump the `worker_threads` on salt master (from 5 to 10) to
avoid some failure if you have too many communication with the salt
master (e.g.: because of upgrade + storage operator)

Sees: saltstack/salt#53147
@TeddyAndrieux TeddyAndrieux force-pushed the improvement/improve-upgrade-robustness branch from 8bc1f3e to 59d0969 Compare February 17, 2021 18:53
@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Waiting for approval

The following approvals are needed before I can proceed with the merge:

  • the author

  • one peer

Peer approvals must include at least 1 approval from the following list:

The following options are set: approve

Copy link
Contributor

@gdemonet gdemonet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code is OK, just an edit to make for the changelog entry (sorry 😇)

CHANGELOG.md Outdated Show resolved Hide resolved
@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Waiting for approval

The following approvals are needed before I can proceed with the merge:

  • the author

  • one peer

Peer approvals must include at least 1 approval from the following list:

The following reviewers are expecting changes from the author, or must review again:

The following options are set: approve

@TeddyAndrieux TeddyAndrieux force-pushed the improvement/improve-upgrade-robustness branch from 59d0969 to debe85e Compare February 17, 2021 19:00
@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

History mismatch

Merge commit #59d096901b694558ab7a9bc40f9a77b3962f6c3c on the integration branch
w/2.8/improvement/improve-upgrade-robustness is merging a branch which is neither the current
branch improvement/improve-upgrade-robustness nor the development branch
development/2.8.

It is likely due to a rebase of the branch improvement/improve-upgrade-robustness and the
merge is not possible until all related w/* branches are deleted or updated.

Please use the reset command to have me reinitialize these branches.

The following options are set: approve

@TeddyAndrieux
Copy link
Collaborator Author

/reset

@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Reset complete

I have successfully deleted this pull request's integration branches.

The following options are set: approve

@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Conflict

A conflict has been raised during the creation of
integration branch w/2.8/improvement/improve-upgrade-robustness with contents from improvement/improve-upgrade-robustness
and development/2.8.

I have not created the integration branch.

Here are the steps to resolve this conflict:

 $ git fetch
 $ git checkout -B w/2.8/improvement/improve-upgrade-robustness origin/development/2.8
 $ git merge origin/improvement/improve-upgrade-robustness
 $ # <intense conflict resolution>
 $ git commit
 $ git push -u origin w/2.8/improvement/improve-upgrade-robustness

The following options are set: approve

Copy link
Contributor

@gdemonet gdemonet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@bert-e
Copy link
Contributor

bert-e commented Feb 17, 2021

Build failed

The build for commit did not succeed in branch w/2.8/improvement/improve-upgrade-robustness.

The following options are set: approve

@bert-e
Copy link
Contributor

bert-e commented Feb 18, 2021

In the queue

The changeset has received all authorizations and has been added to the
relevant queue(s). The queue(s) will be merged in the target development
branch(es) as soon as builds have passed.

The changeset will be merged in:

  • ✔️ development/2.7

  • ✔️ development/2.8

The following branches will NOT be impacted:

  • development/1.0
  • development/1.1
  • development/1.2
  • development/1.3
  • development/2.0
  • development/2.1
  • development/2.2
  • development/2.3
  • development/2.4
  • development/2.5
  • development/2.6

There is no action required on your side. You will be notified here once
the changeset has been merged. In the unlikely event that the changeset
fails permanently on the queue, a member of the admin team will
contact you to help resolve the matter.

IMPORTANT

Please do not attempt to modify this pull request.

  • Any commit you add on the source branch will trigger a new cycle after the
    current queue is merged.
  • Any commit you add on one of the integration branches will be lost.

If you need this pull request to be removed from the queue, please contact a
member of the admin team now.

The following options are set: approve

@bert-e
Copy link
Contributor

bert-e commented Feb 18, 2021

I have successfully merged the changeset of this pull request
into targetted development branches:

  • ✔️ development/2.7

  • ✔️ development/2.8

The following branches have NOT changed:

  • development/1.0
  • development/1.1
  • development/1.2
  • development/1.3
  • development/2.0
  • development/2.1
  • development/2.2
  • development/2.3
  • development/2.4
  • development/2.5
  • development/2.6

Please check the status of the associated issue None.

Goodbye teddyandrieux.

@bert-e bert-e merged commit debe85e into development/2.7 Feb 18, 2021
@bert-e bert-e deleted the improvement/improve-upgrade-robustness branch February 18, 2021 10:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
complexity:medium Something that requires one or few days to fix topic:lifecycle Issues related to upgrade or downgrade of MetalK8s topic:salt Everything related to SaltStack in our product
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants