Add API spec for FQDN selector #200

rahulkjoshi · 2024-02-22T17:54:14Z

Issue #133

Provides the proposed API spec as well as some expected behaviors for both application Pods and ANP implementors to follow.

k8s-ci-robot · 2024-02-22T17:54:23Z

Hi @rahulkjoshi. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

netlify · 2024-02-22T17:54:30Z

✅ Deploy Preview for kubernetes-sigs-network-policy-api ready!

Name	Link
🔨 Latest commit	`0a56a07`
🔍 Latest deploy log	https://app.netlify.com/sites/kubernetes-sigs-network-policy-api/deploys/664cc5531954c60008ea6bb8
😎 Deploy Preview	https://deploy-preview-200--kubernetes-sigs-network-policy-api.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

astoycos

Gave this a quick pass and I'll give it another one this week thanks @rahulkjoshi

npeps/npep-133-fqdn-egress-selector.md

tssurya · 2024-03-05T06:57:36Z

/ok-to-test

k8s-ci-robot · 2024-03-06T05:37:03Z

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

npeps/npep-133-fqdn-egress-selector.md

Dyanngg

LGTM

k8s-ci-robot · 2024-03-26T21:36:01Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: Dyanngg, rahulkjoshi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [Dyanngg]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

npeps/npep-133-fqdn-egress-selector.md

astoycos · 2024-03-28T20:53:40Z

Just a few minor questions, otherwise great job @rahulkjoshi!

npeps/npep-133-fqdn-egress-selector.md

fasaxc · 2024-04-09T15:12:59Z

npeps/npep-133-fqdn-egress-selector.md

+        record](https://kubernetes.io/docs/concepts/services-networking/dns-pod-service/#services)
+        because the generated DNS records contain a list of all the Pod IPs that
+        back the service.
+   *  ❌ Not Supported:


Just to add another gotcha: we had a user who used DNS load balancing to load balance to a set of external IPs, each in a separate k8s cluster. They then tried to use DNS policy with that; it would work when a pod connected to another cluster but not the local cluster because kube-proxy was short-circuiting the load balancer and handling the external IP in-cluster.

User thought they were using an external load balancer but that load balancer used external IPs under the covers. Very confusing!

I love these stories :D
so did the problem happen because DNS matching happened after service loadbalancing? IOW, would matching network policy before service resolution solve this problem?

Perhaps, but then it'd be very hard to interleave normal and DNS policy in anything that resembles iptables. The DNS policy would happen "first" in order to see the pre-DNAT IPs but what if a higher priority normal policy should have dropped that traffic? BPF dataplanes that do policy and NAT can keep the pre-DNAT information around to match on at the policy stage.

This is similar to a discussion we had about the egress CIDRs @tssurya

I forget if we documented the decision, but I think we concluded that trying to select traffic heading to service VIPs or service LoadBalancer IPs is not guaranteed to work. I think that same restriction can be applied here.

fasaxc · 2024-04-15T16:16:02Z

npeps/npep-133-fqdn-egress-selector.md

+    // "*" matches 0 or more DNS valid characters (except for "."), and may
+    // occur anywhere in the pattern.
+    //  Examples:
+    //    - `*.kubernetes.io` matches subdomains of kubernetes.io at that level.


I asked my colleagues in our solutions team for examples of matching multiple URL segments, we do have customers that use * to match multiple parts in a prefix. some examples they were able to share:

<team>.<platform>.example.com matched with *.example.com

<sessionID>.<session>.<random>.<region>.cache.amazonaws.com matched with *.cache.amazonaws.com

I think the AWS one would be *.*.*.*.cache.amazonaws.com in the proposed spec, which feels a bit messy.

very nice examples, thank you for gathering!
I am getting more convinced every day that * should match any number of labels, unless someone can point out a security/performance/etc concerns about doing that

I'm ok with that -- I can see CDNs getting pretty deeply nested so having * match multiple labels should be ok.

I don't imagine this should have any performance implications. Security-wise, you sacrifice some explicitness. But on the other hand, when you allow *.domain.com, you've already indicated you trust the owner domain.com

Perhaps this can be useful, if not just for some additional perspectives.

In cilium/cilium#22081, I've been discussing this topic with maintainers of Cilium (wich regards to CiliumNetworkPolicy) . Though still not decided or implemented, we've been leaning towards * for single-level subdomain matching and ** for multi-level subdomains matching.

Personally I don't see any problem with * matching multiple subdomains, as you have proposed here. It's more intuitive 👍

npeps/npep-133-fqdn-egress-selector.md

astoycos · 2024-04-23T16:43:59Z

npeps/npep-133-fqdn-egress-selector.md

+    // +optional
+    // +listType=set
+    // +kubebuilder:validation:MinItems=1
+    Domains []Domain `json:"domains,omitempty"`


In the meeting @npinaeva Proposed DomainName here however regardless Antonio suggested we always reference an RFC for decisions like this

https://datatracker.ietf.org/doc/html/rfc1034 S3.1 is the relevant one;

A domain is identified by a domain name, and consists of that part of
the domain name space that is at or below the domain name which
specifies the domain.

So a Domain is a subtree of domain space whereas a Domain Name is the textual name of one particular node in that tree. Not sure that gives a clear answer to our naming dilemma though! * patterns represent subtrees, FQDNs represent single nodes in the tree! Perhaps DomainNamePatterns conveys exactly what our field represents (but I think Domains and DomainNames are both defensible/attackable depending on how closely you read the RFC 😆 )?

The "." suffix is also explained in that section. A "." at the end of the domain name means it is absolute, whereas no "." means that it is relative to the local domain and the DNS resolver should add suffixes as needed.

I think a domain name with * is still considered a domain name. I read RFC as "domain name is a string, domain is an internal structure of the domain name space".
If we read closer to the wildcards definition S4.3.3, it says

In the previous algorithm, special treatment was given to RRs with owner
names starting with the label "*".

S3.6 defines

When we talk about a specific RR, we assume it has the following:
owner - which is the domain name where the RR is found.

So owner name is the domain name, and owner name may start with *, which makes it a valid domain name?
Reading RFCs is fun :D

They do say that the owner is a domain(!) but they also say

If any match, copy them into the answer section, but set the owner of the RR to be QNAME, and not the node with the "*" label.

I.e. don't send * on the wire and

The owner name of the wildcard RRs is of the form "*.<anydomain>", where <anydomain> is any domain name. <anydomain> should not contain other * labels

Which implicitly says that a wildcard is not a domain so, it's a bit contradictory!

I think the wildcard comes from implementation concerns, they're really talking about the configuration for a DN server (and probably a particular one at that) and saying "you'll need wildcards in your implementation to handle mail and they should work like this to avoid these problems".

aojea · 2024-04-26T10:45:46Z

npeps/npep-133-fqdn-egress-selector.md

+1. Each implementation will provide guidance on which DNS name-server is
+   considered authoritative for resolving domain names. This could be the
+   `kube-dns` Service or potentially some other DNS provider specified in the
+   implementation's configuration.


does it mean that if implementation X considers kube-dns authoritative but the Pod uses a different nameserver the policy will not apply?

As long as the two nameservers return the same IPs for domains it's not a problem. But if kube-dns says the IP is 1.2.3.4, the implementation only allows 1.2.3.4. If the pod's nameserver then says 5.6.7.8, then traffic will be denied.

aojea · 2024-04-26T10:47:35Z

npeps/npep-133-fqdn-egress-selector.md

+1. FQDN policies should not affect the ability of workloads to resolve domains,
+   only their ability to communicate with the IP backing them.
+   *  For example, if a policy allows traffic to `kubernetes.io`, any selected
+      Pods can still resolve `wikipedia.org` or
+      `my-services.default.svc.cluster.local`, but can not send traffic to them
+      unless allowed by a different rule.


I kind of infer that this is based on some implementation details, but is hard to get from the text why this will happen ... "ability of workloads to resolve domains" means to filter DNS traffic or DNS packets? why should the policy filter things are not supposed to do? how is this different than any other traffic?

I'm trying to clarify that this spec is explicitly not doing DNS filtering. So even though the policy says "allow traffic to kubernetes.io", you can still resolve any domains you like.

I can add a clarification explicitly saying that FQDN policies will not perform DNS Filtering.

aojea · 2024-04-26T10:49:12Z

npeps/npep-133-fqdn-egress-selector.md

+   *  Pods are expected to make a DNS query for a domain before sending traffic
+      to it. If the Pod fails to send a DNS request and instead just sends
+      traffic to the IP (either because of caching or a static config), traffic
+      is not guaranteed to flow.


is this implementable? this means the implementation has to cache all possible IPs of all the possible domains, no?

aojea · 2024-04-26T10:50:18Z

npeps/npep-133-fqdn-egress-selector.md

+   *  Pods should respect the TTL of DNS records they receive. Trying to
+      establish new connection using DNS records that are expired is not
+      guaranteed to work.
+   *  When the TTL for a DNS record expires, the implementor should stop


what TTL records is referring here? the TTL records of the DNS response that the Pod has received?
does it imply that the implementation always has to sniff the traffic or access the nameserver logs?

ok, I see the table below, good work with that table, really intuitive

Yes we've pretty much agreed that the only way to correctly implement this is via a proxy that listens to DNS responses.

aojea · 2024-04-26T10:55:26Z

npeps/npep-133-fqdn-egress-selector.md

+        The implementer can still deny traffic to `1.2.3.4` because no single
+        response contained the full chain required to resolve the domain.
+
+## Alternatives


Other alternative is implement DNS filtering, as cisco umbrella https://docs.umbrella.com/deployment-umbrella/docs/point-your-dns-to-cisco or dnsfilter do ...

npinaeva · 2024-05-27T13:26:04Z

/lgtm
thanks @rahulkjoshi!

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Feb 22, 2024

k8s-ci-robot requested a review from astoycos February 22, 2024 17:54

k8s-ci-robot requested a review from Dyanngg February 22, 2024 17:54

k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Feb 22, 2024

astoycos reviewed Feb 27, 2024

View reviewed changes

rahulkjoshi force-pushed the main branch from 134f95b to dfd16f1 Compare February 29, 2024 21:02

k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Feb 29, 2024

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Mar 5, 2024

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 6, 2024

rahulkjoshi force-pushed the main branch 2 times, most recently from 658a1bb to 33b7833 Compare March 6, 2024 18:32

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 7, 2024

npinaeva reviewed Mar 7, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

rahulkjoshi force-pushed the main branch from 33b7833 to 821e3a4 Compare March 11, 2024 01:42

Dyanngg reviewed Mar 26, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Show resolved Hide resolved

Dyanngg approved these changes Mar 26, 2024

View reviewed changes

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 26, 2024

astoycos reviewed Mar 28, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

astoycos reviewed Mar 28, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

aojea reviewed Apr 2, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

aojea reviewed Apr 2, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Show resolved Hide resolved

aojea reviewed Apr 2, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

rahulkjoshi force-pushed the main branch from 821e3a4 to 19c7a98 Compare April 4, 2024 22:09

npinaeva reviewed Apr 8, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

fasaxc reviewed Apr 9, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

fasaxc reviewed Apr 9, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

fasaxc reviewed Apr 9, 2024

View reviewed changes

fasaxc reviewed Apr 15, 2024

View reviewed changes

rahulkjoshi force-pushed the main branch 2 times, most recently from f28f400 to 54f2331 Compare April 18, 2024 15:51

npinaeva reviewed Apr 19, 2024

View reviewed changes

npeps/npep-133-fqdn-egress-selector.md Outdated Show resolved Hide resolved

astoycos reviewed Apr 23, 2024

View reviewed changes

aojea reviewed Apr 26, 2024

View reviewed changes

rahulkjoshi force-pushed the main branch from 54f2331 to bbf31c5 Compare May 21, 2024 16:00

Add API spec for FQDN selector

0a56a07

rahulkjoshi force-pushed the main branch from bbf31c5 to 0a56a07 Compare May 21, 2024 16:01

illrill mentioned this pull request May 23, 2024

[ACNS] FQDN Filtering Policies Azure/AKS#4205

Closed

k8s-ci-robot assigned npinaeva May 27, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 27, 2024

k8s-ci-robot merged commit 6bafaf0 into kubernetes-sigs:main May 27, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add API spec for FQDN selector #200

Add API spec for FQDN selector #200

rahulkjoshi commented Feb 22, 2024

k8s-ci-robot commented Feb 22, 2024

netlify bot commented Feb 22, 2024 •

edited

Loading

astoycos left a comment

tssurya commented Mar 5, 2024

k8s-ci-robot commented Mar 6, 2024

Dyanngg left a comment

k8s-ci-robot commented Mar 26, 2024

astoycos commented Mar 28, 2024

fasaxc Apr 9, 2024

npinaeva Apr 10, 2024

fasaxc Apr 15, 2024

rahulkjoshi Apr 18, 2024

fasaxc Apr 15, 2024

npinaeva Apr 15, 2024

rahulkjoshi Apr 18, 2024

illrill Apr 29, 2024 •

edited

Loading

astoycos Apr 23, 2024

fasaxc Apr 25, 2024

npinaeva Apr 26, 2024

fasaxc May 7, 2024

aojea Apr 26, 2024

rahulkjoshi May 21, 2024

aojea Apr 26, 2024

rahulkjoshi May 21, 2024

aojea Apr 26, 2024

aojea Apr 26, 2024 •

edited

Loading

aojea Apr 26, 2024

rahulkjoshi May 21, 2024

aojea Apr 26, 2024

npinaeva commented May 27, 2024

Add API spec for FQDN selector #200

Add API spec for FQDN selector #200

Conversation

rahulkjoshi commented Feb 22, 2024

k8s-ci-robot commented Feb 22, 2024

netlify bot commented Feb 22, 2024 • edited Loading

✅ Deploy Preview for kubernetes-sigs-network-policy-api ready!

astoycos left a comment

Choose a reason for hiding this comment

tssurya commented Mar 5, 2024

k8s-ci-robot commented Mar 6, 2024

Dyanngg left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Mar 26, 2024

astoycos commented Mar 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

illrill Apr 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aojea Apr 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

npinaeva commented May 27, 2024

netlify bot commented Feb 22, 2024 •

edited

Loading

illrill Apr 29, 2024 •

edited

Loading

aojea Apr 26, 2024 •

edited

Loading