Integration test for cni-repair-controller #316

alpeb · 2024-01-02T20:29:37Z

(Note this will fail until linkerd/linkerd2#11699 lands)

The integration-cni-plugin.yml workflow (formerly known as cni-plugin-integration.yml) has been expanded to run the new recipe cni-repair-controller-integration, which performs the following steps:

Rebuilds the linkerd-cni-repair-controller crate and cni-plugin
Creates a new cluster at version v1.27.6-k3s1 (version required for Calico to work)
Triggers a new ./cni-repair-controller/integration/run.sh script which:
- Installs Calico
- Installs the latest linkerd-edge CLI
- Installs linkerd-cni and wait for it to become ready
- Install the linkerd control plane in CNI mode
- Install a pause DaemonSet

The linkerd-cni instance has been configured to include an extra initContainer that will delay its start for 15s. Since we waited for it to become ready, this doesn't affect the initial install. But then a new node is added to the cluster, and this delay allows for the new pause DaemonSet replica to start before the full CNI config is ready, so we can observe its failure to come up. Once the new linkerd-cni replica becomes ready we observe how the pause failed replica is replaced by a new healthy one.

(Note this will fail until linkerd/linkerd2#11699 lands) The `integration-cni-plugin.yml` workflow (formerly known as `cni-plugin-integration.yml`) has been expanded to run the new recipe `cni-repair-controller-integration`, which performs the following steps: - Rebuilds the `linkerd-cni-repair-controller` crate and `cni-plugin` - Creates a new cluster at version `v1.27.6-k3s1` (version required for Calico to work) - Triggers a new `./cni-repair-controller/integration/run.sh` script which: - Installs Calico - Installs the latest linkerd-edge CLI - Installs `linkerd-cni` and wait for it to become ready - Install the linkerd control plane in CNI mode - Install a `pause` DaemonSet The `linkerd-cni` instance has been configured to include an extra initContainer that will delay its start for 15s. Since we waited for it to become ready, this doesn't affect the initial install. But then a new node is added to the cluster, and this delay allows for the new `pause` DaemonSet replica to start before the full CNI config is ready, so we can observe its failure to come up. Once the new `linkerd-cni` replica becomes ready we observe how the `pause` failed replica is replaced by a new healthy one.

mateiidavid · 2024-01-24T14:55:59Z

.dockerignore

@@ -1,2 +1 @@
 rust-toolchain
-target/


Was this change accidental? Or do we need the target dir in the context when building an image?

Good catch. This was a leftover from an iteration where the binary wasn't build inside the same Dockerfile.

mateiidavid · 2024-01-24T15:35:31Z

cni-repair-controller/integration/linkerd-cni-config.yml

+# the full CNI config is ready and enter a failure mode
+extraInitContainers:
+- name: sleep
+  image: busybox


on the nitpicky side, the CNI plugin runs alpine, can we re-use the same image so we don't pull busybox in tests?

Good thinking 👍

alpeb requested a review from a team as a code owner January 2, 2024 20:29

alpeb mentioned this pull request Jan 2, 2024

Integration test for reinitialize-pods #309

Closed

alpeb force-pushed the alpeb/cni-repair-controller-tests branch from 9fbaa2e to e4fb23b Compare January 2, 2024 21:41

alpeb force-pushed the alpeb/cni-repair-controller-tests branch from e4fb23b to 89c3415 Compare January 22, 2024 14:12

alpeb requested a review from mateiidavid January 22, 2024 14:30

mateiidavid approved these changes Jan 24, 2024

View reviewed changes

@mateiidavid's feedback

fbfd2bc

alpeb merged commit fb9c51e into main Jan 25, 2024
18 checks passed

alpeb deleted the alpeb/cni-repair-controller-tests branch January 25, 2024 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration test for cni-repair-controller #316

Integration test for cni-repair-controller #316

alpeb commented Jan 2, 2024

mateiidavid Jan 24, 2024

alpeb Jan 25, 2024

mateiidavid Jan 24, 2024

alpeb Jan 25, 2024

Integration test for cni-repair-controller #316

Integration test for cni-repair-controller #316

Conversation

alpeb commented Jan 2, 2024

mateiidavid Jan 24, 2024

Choose a reason for hiding this comment

alpeb Jan 25, 2024

Choose a reason for hiding this comment

mateiidavid Jan 24, 2024

Choose a reason for hiding this comment

alpeb Jan 25, 2024

Choose a reason for hiding this comment