Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calico update and typha #7528

Merged
merged 3 commits into from
Sep 10, 2019

Conversation

mikesplain
Copy link
Contributor

@mikesplain mikesplain commented Sep 7, 2019

After #7517 I noticed much of our calico is out of sync with upstream so I thought it may be worth reformatting to match upstream adding back our customizations. I know @gjtempleton has been working on adding Typha however he was facing rebasing issues, so I pulled them in here while upgrading us. I'm doing more testing on this but I think we should target 1.15 for this and possibly 1.14 depending on testing time.

Key points on this pr:

  • Add Typha (thanks @gjtempleton for taking care of the hard part)
  • Updates to calico 3.8.2
  • Replaces our format of calico for the official calico one which should allow updating easier in the future.

Todo:

  • Test upgrade from master to this pr
  • Test upgrade from 1.13
  • Test upgrade from 1.14

/hold
Fixes #7181
Fixes #7158

@mikesplain mikesplain added this to the 1.15 milestone Sep 7, 2019
@k8s-ci-robot k8s-ci-robot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Sep 7, 2019
@mikesplain mikesplain modified the milestones: 1.15, 1.14 Sep 7, 2019
Copy link
Member

@gjtempleton gjtempleton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for doing the hard work of picking this up and progressing it! The original got a bit forgotten about.

Just a couple of comments from me. W.r.t. the Typha enabling on a running cluster not automatically triggering a restart of the pod my inclination would be to leave the restarts to the user. I don't think we previously saw any issues with restarting pods where the only change was Typha config on our clusters, however this was ~Calico 3.2.

I'm away having some solid type 2 fun on a bike this week so I'm unfortunately not able to test my theories out until next weekend.

@mikesplain mikesplain force-pushed the calico_update_and_typha branch from 3c19662 to 8407565 Compare September 9, 2019 00:22
gjtempleton and others added 3 commits September 9, 2019 10:00
Adds the ability to configure the number of Typha replicas when using Calico CNI in 1.12+
to limit the impact of Calico on the APIServer and increase the scalability of the cluster.

Also adds the ability to configure Typha's Prometheus config.

Add Passing TyphaReplicas Validation Test
@mikesplain mikesplain force-pushed the calico_update_and_typha branch from 8407565 to b76d9b1 Compare September 9, 2019 14:00
@mikesplain
Copy link
Contributor Author

/retest

@mikesplain
Copy link
Contributor Author

Testing complete, works as expected at the moment.

@mikesplain
Copy link
Contributor Author

/hold cancel

@k8s-ci-robot k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 9, 2019
@mikesplain
Copy link
Contributor Author

Once this is lgtmed, I'll cherry pick to the 1.15 branch and work on a backport to 1.14.

@justinsb
Copy link
Member

Thanks @mikesplain - looks great

/approve
/lgtm

Not sure if we should cherry-pick this or #7517 to 1.14 ... I'd like to try to get both 1.14 and 1.15 out soonish, and I'm just not sure how broken we are if we just taken #7517 vs this (more complete) fix?

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 10, 2019
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: justinsb, mikesplain

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [justinsb,mikesplain]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot merged commit a7eb9a2 into kubernetes:master Sep 10, 2019
@mikesplain mikesplain deleted the calico_update_and_typha branch September 10, 2019 13:29
@mikesplain
Copy link
Contributor Author

Thanks @justinsb, I actually forgot I opened #7527 so we should be fine on 1.14. I opened up #7553 to get this into 1.15. Thanks!

k8s-ci-robot added a commit that referenced this pull request Sep 10, 2019
…28-origin-release-1.15

Automated cherry pick of #7528: Add ability to configure number of Typha Replicas
Comment on lines -626 to -637
# Set to enable the experimental Prometheus metrics server
- name: FELIX_PROMETHEUSMETRICSENABLED
value: "{{- or .Networking.Calico.PrometheusMetricsEnabled "false" }}"
# TCP port that the Prometheus metrics server should bind to
- name: FELIX_PROMETHEUSMETRICSPORT
value: "{{- or .Networking.Calico.PrometheusMetricsPort "9091" }}"
# Enable Prometheus Go runtime metrics collection
- name: FELIX_PROMETHEUSGOMETRICSENABLED
value: "{{- or .Networking.Calico.PrometheusGoMetricsEnabled "true" }}"
# Enable Prometheus process metrics collection
- name: FELIX_PROMETHEUSPROCESSMETRICSENABLED
value: "{{- or .Networking.Calico.PrometheusProcessMetricsEnabled "true" }}"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Was removing these intentional? They still seem to be valid according to the calico docs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Ability to Configure Typha for Calico CNI
5 participants