-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI flake: podman run with new:pod and static-ip: Error adding network: failed to set bridge addr #7583
Comments
Problem: if either of the two "podman network create" tests fail, all subsequent retries will also fail because the created network has not been cleaned up (so "network create" will fail with EEXIST). Solution: run "podman network rm" as deferred cleanup instead of in each test. This is NOT a fix for containers#7583 - it is just a way to allow ginkgo to retry a failing test. Signed-off-by: Ed Santiago <[email protected]>
I will take a look. |
OK, I don't thing there is anything wrong with the "new:pod and static-ip" command itself. I ran this test for hours on a slow and fast computer without any problems. Looking at the error it seems that podman is trying to set a ip on an existing bridge interface (cni-podman1) which already has a different ip address. You could force this error by doing something like this:
Don't delete this network to force this error.
So the real problem could be that |
Similar in my Podman running on WSL CentOS 8.1 when I try to run Theia container: Which IP the Podman bridge listens? - Must be 0.0.0.0 because WSL 2 V-switch implements NIC only. |
@PavelSosin-320 What command are you using? |
I use on WSL2 Centos 8.1 image. |
@PavelSosin-320 I'm confused as to the question about what IP the CNI bridge listens on - bridges don't listen on any IP, because they're a layer 2 construct. There's an interface on the host in the bridge and another interface for each container to allow communications to the gateway, and then we configure iptables in masquerade mode to NAT the subnet in use on the bridge. It seems like we're not even getting that far, as it's complaining that it's unable to configure the bridge? |
Yes, it is true! I don't see how to configure the bridge. There is the long-lasting discussion about the bridge vs NIC in the WSL 2 VM. The is neither a bridge nor even a fully-functional V-switch in the WSL2. The eth0 in the VM is created on-the-fly and IP address is generated every time when Windows LXSS service is started. The VM itself is a singleton and I see cni-podman0 interface from the CentOS7 distro running Docker side. And vise-verse, I see Docker0 bridge from the CentOS8.1 distro's side running Podman. |
A friendly reminder that this issue had no activity for 30 days. |
@edsantiago @Luap99 Any movement on this issue? |
Still happening, although it looks like the cleanup has worked so it's not triple-failing any more (and hence not causing complete CI-run failures). Logs below are November and October only, I choose to skip the many late-September instances:
|
A friendly reminder that this issue had no activity for 30 days. |
@edsantiago Does this still happen, or was this fixed by #7943 ? |
Looks like #7943 merged on October 7. In addition to the October/November incidents mentioned in a comment above, we have the following since then:
|
A friendly reminder that this issue had no activity for 30 days. |
Only one instance in the last month, and it was a one-off (i.e. didn't cause a CI failure). Guess I'll close and hope for the best.
|
The symptom is:
These are all three-time failures, causing the entire CI job to fail, probably because the first failure leaves things in a bad state for the test reruns -- subsequent attempts fail with:
So, at a minimum, I would suggest fixing the test so as to clean up after itself.
Flake history:
make nixpkgs
#7408The text was updated successfully, but these errors were encountered: