Make encryption-related connectivity tests more reliable #2035

giorio94 · 2023-10-12T14:36:59Z

Please refer to the individual commit messages for additional details.

giorio94 · 2023-10-13T17:00:50Z

Closed and reopened to retrigger tests.

giorio94 · 2023-10-16T11:40:57Z

EKS (tunnel) failure is addressed by #2045
Marking ready for review again

giorio94 · 2023-10-17T09:20:36Z

Converting back to draft while investigating one failure surfaced in https://github.com/cilium/cilium/actions/runs/6544264527/job/17770427183

giorio94 · 2023-10-17T14:22:25Z

E2E and ipsec tests completed correctly in cilium/cilium#28635

giorio94 · 2023-10-17T14:23:20Z

Back in review state

brb

Nice! Small nit.

connectivity/tests/encryption.go

brb · 2023-10-18T08:14:42Z

The EKS failure looks legit - https://github.com/cilium/cilium-cli/actions/runs/6550230402/job/17788873914?pr=2035.

giorio94 · 2023-10-18T08:16:40Z

The EKS failure looks legit - https://github.com/cilium/cilium-cli/actions/runs/6550230402/job/17788873914?pr=2035.

Yes, it is addressed by #2045 (that's why this PR is still blocked). Essentially node to pod connectivity is broken due to the leftover iptables chains, and apparently we are not testing that path anywhere else.

Currently, we always create the host-netns daemonset in a single cluster only, even when clustermesh is enabled. Yet, this can cause issues when connectivity tests depend on its presence, such as node-to-node encryption. Hence, let's adapt the logic to create it in both clusters. Signed-off-by: Marco Iorio <[email protected]>

When running in tunnel mode, we currently validate that pod to pod connectivity does not leak any unencrypted packet (if either IPSec or WireGuard are enabled) by constructing a tcpdump filter that uses as source IP address the one of the interface the traffic is routed to, instead of the source IP of the client. According to the commit message of f512df4, this was required to correctly test node to node connectivity, which leverages the same logic. Yet, this breaks the check for pod to pod traffic, which does not get masquerated. Indeed, the current check would not fail when forcefully run on a cluster with encryption disabled. Let's instead modify the filter, so that we 'OR' together both the original client IP and the one of the outgoing interface, accounting for both scenarios. Signed-off-by: Marco Iorio <[email protected]>

When running in ENI mode, the outgoing interface of pod originating traffic is different from the one that would be used by host originating traffic towards the same destination. This breaks the current pod-to-pod encryption validation, as the source interface for the tcpdump filter is determined based on the routes towards the given destination only. Let's update the source interface determination to additionally consider the source address. This approach had been initially suggested by Paul Chaignon in cilium#1241, but then reverted in [1] because `ip route get` returns and error in case the `from` address is not assigned to any local interface. We can work around this by specifying an input interface: let's use lo as it should be always present. [1]: 2fc0835 ("connectivity: Fix iface derivation in encrypt tests") Signed-off-by: Marco Iorio <[email protected]>

When both tunneling and host firewall is enabled, traffic from a pod to a remote node gets forwarded through the tunnel to preserve the source identity. This breaks the current node-to-node encryption tests, though, which expect the packets to be seen on the native device. Let's adapt the source interface determination to override the detected device in this situation, and always sniff on the tunnel interface. Signed-off-by: Marco Iorio <[email protected]>

Currently, encryption checks are run only when encryption (either IPSec or WireGuard) is enabled. While this approach is sensible, it also led to cases in which the tests were broken in certain circumstances, and we didn't notice [1,2,3,4]. Let's instead always run these tests, asserting that unencrypted packets are either seen or not based on the feature set. Hence, we can immediately notice if there's a regression, and no unencrypted packets are detected even if encryption is not enabled. [1]: f512df4 ("connectivity: fix node-to-node encryption tests") [2]: HEAD~3 ("connectivity: fix pod-to-pod encryption validation") [3]: HEAD~2 ("connectivity: fix encryption validation when running in ENI mode") [4]: HEAD~1 ("connectivity: fix encryption validation when host firewall is enabled") Signed-off-by: Marco Iorio <[email protected]>

Currently, the encryption tests validate that no unencrypted packets are leaked by capturing them on the source only. Since we can capture only egressing packets (as the XFRM stack recirculates the ingressing ones during decryption), this means that we are checking only one direction. Let's extend this check to validate both directions, starting a tcpdump capture both on the source and destination hosts. We leverage the bidirectional validation for the pod-to-pod encryption case though, as with the node-to-node one it is particularly tricky to construct the correct filter without additional information (as packets might be masquerated at the source, and in that case they should be sniffed on a different interface). Signed-off-by: Marco Iorio <[email protected]>

Disable the encryption tests when running the connectivity tests on a single node cluster, as they require that source and destination are hosted on different nodes. Signed-off-by: Marco Iorio <[email protected]>

giorio94 · 2023-10-19T07:36:34Z

Kind / installation-and-connectivity (helm) hit #1894

giorio94 requested review from a team as code owners October 12, 2023 14:37

giorio94 requested review from brb and nebril October 12, 2023 14:37

giorio94 temporarily deployed to ci October 12, 2023 14:37 — with GitHub Actions Inactive

giorio94 marked this pull request as draft October 12, 2023 14:53

giorio94 force-pushed the mio/connectivity-encryption branch from 27108e5 to 2ee4044 Compare October 12, 2023 16:19

giorio94 temporarily deployed to ci October 12, 2023 16:19 — with GitHub Actions Inactive

giorio94 force-pushed the mio/connectivity-encryption branch from 2ee4044 to 16e2425 Compare October 13, 2023 12:46

giorio94 temporarily deployed to ci October 13, 2023 12:46 — with GitHub Actions Inactive

giorio94 temporarily deployed to ci October 13, 2023 13:27 — with GitHub Actions Inactive

giorio94 had a problem deploying to ci October 13, 2023 16:10 — with GitHub Actions Failure

giorio94 force-pushed the mio/connectivity-encryption branch from 5bb2293 to 51d7a88 Compare October 13, 2023 16:11

giorio94 had a problem deploying to ci October 13, 2023 16:11 — with GitHub Actions Failure

giorio94 had a problem deploying to ci October 13, 2023 16:30 — with GitHub Actions Failure

giorio94 temporarily deployed to ci October 13, 2023 16:38 — with GitHub Actions Inactive

giorio94 closed this Oct 13, 2023

giorio94 reopened this Oct 13, 2023

giorio94 temporarily deployed to ci October 13, 2023 17:00 — with GitHub Actions Inactive

giorio94 mentioned this pull request Oct 16, 2023

gha: cleanup stale AWS-CNI iptables rules in EKS tunnel workflow #2045

Merged

giorio94 force-pushed the mio/connectivity-encryption branch from 51d7a88 to 33b2591 Compare October 16, 2023 06:28

giorio94 temporarily deployed to ci October 16, 2023 06:28 — with GitHub Actions Inactive

giorio94 force-pushed the mio/connectivity-encryption branch from 33b2591 to 98147cf Compare October 16, 2023 06:49

giorio94 temporarily deployed to ci October 16, 2023 06:49 — with GitHub Actions Inactive

giorio94 marked this pull request as ready for review October 16, 2023 11:40

giorio94 added the dont-merge/blocked Another PR must be merged before this one. label Oct 16, 2023

giorio94 force-pushed the mio/connectivity-encryption branch from 98147cf to 48a0d80 Compare October 17, 2023 10:50

giorio94 temporarily deployed to ci October 17, 2023 10:50 — with GitHub Actions Inactive

nebril approved these changes Oct 17, 2023

View reviewed changes

brb approved these changes Oct 17, 2023

View reviewed changes

connectivity/tests/encryption.go Outdated Show resolved Hide resolved

giorio94 force-pushed the mio/connectivity-encryption branch from 48a0d80 to 472531f Compare October 17, 2023 16:39

giorio94 temporarily deployed to ci October 17, 2023 16:39 — with GitHub Actions Inactive

giorio94 added 7 commits October 18, 2023 19:38

connectivity: don't run encryption tests on single node

e2431dc

Disable the encryption tests when running the connectivity tests on a single node cluster, as they require that source and destination are hosted on different nodes. Signed-off-by: Marco Iorio <[email protected]>

giorio94 force-pushed the mio/connectivity-encryption branch from 472531f to e2431dc Compare October 18, 2023 17:38

giorio94 temporarily deployed to ci October 18, 2023 17:38 — with GitHub Actions Inactive

giorio94 removed the dont-merge/blocked Another PR must be merged before this one. label Oct 18, 2023

brb merged commit 75c850c into cilium:main Oct 19, 2023
19 checks passed

giorio94 deleted the mio/connectivity-encryption branch November 16, 2023 10:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make encryption-related connectivity tests more reliable #2035

Make encryption-related connectivity tests more reliable #2035

giorio94 commented Oct 12, 2023

giorio94 commented Oct 13, 2023

giorio94 commented Oct 16, 2023

giorio94 commented Oct 17, 2023

giorio94 commented Oct 17, 2023

giorio94 commented Oct 17, 2023

brb left a comment

brb commented Oct 18, 2023

giorio94 commented Oct 18, 2023 •

edited

Loading

giorio94 commented Oct 19, 2023

Make encryption-related connectivity tests more reliable #2035

Make encryption-related connectivity tests more reliable #2035

Conversation

giorio94 commented Oct 12, 2023

giorio94 commented Oct 13, 2023

giorio94 commented Oct 16, 2023

giorio94 commented Oct 17, 2023

giorio94 commented Oct 17, 2023

giorio94 commented Oct 17, 2023

brb left a comment

Choose a reason for hiding this comment

brb commented Oct 18, 2023

giorio94 commented Oct 18, 2023 • edited Loading

giorio94 commented Oct 19, 2023

giorio94 commented Oct 18, 2023 •

edited

Loading