-
Notifications
You must be signed in to change notification settings - Fork 748
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CNI not removing network built on a node after IP is lost externally and IPAMD reconciles this state #2834
Comments
@AbeOwlu what is this "external event" that reclaims an IP on an ENI? Only the IPAM daemon should be assigning and unassigning IPs to an ENI. Before calling the EC2 API to unassign IPs, it removes those IPs from the datastore. That precondition is required to avoid this exact scenario |
There's an automation pipeline that's incorrectly, (I might add) seeing a drift in the VPC network and unassigns an IP from an EC2 instance at the moment.
Will update with more details and logs... |
I think I hit this issue too. Let me circle back with some more info |
We had this issue. aws/amazon-vpc-resource-controller-k8s#412 which deleted branch ENI from pods. CNI didn't do anything about the missing network interface or lost IP address |
@AbeOwlu - CNI will not remove any interface that doesn't manage. For any external changes introduced to the interfaces that CNI manages, if they are not in use, it will garbage collect them. If it didn't happen, and you can reproduce this as bug, let us know. Otherwise, we can close this ticket. |
This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 14 days |
Issue closed due to inactivity. |
IPAM reconciliation:
Scenario;
10.0.2.99
10.0.2.99
remains unchanged on the local node however, other node peers are no longer able to reach this pod on10.0.2.99
of its host nodes, it is reachable from this local host and kubernetes liveness probes are succeeding - keeping an unhealthy pod in the clusterWhat you expected to happen:
"UnAssignPodIPAddress: Unassign IP 10.0.2.99 from sandbox aws-cni/7f9240...
the CNI is triggered to tear down the network route with this IP, and liveness probe may eventually fail and attempt to heal this pod.How to reproduce it (as minimally and precisely as possible):
Anything else we need to know?:
nodeIPPoolReconcile
process, should the CNI be invoked to updateHostNetwork for the removed IPs?Environment:
kubectl version
):cat /etc/os-release
):uname -a
):The text was updated successfully, but these errors were encountered: