Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tf-elastx_ubuntu18-calico continuously is failed #7811

Closed
oomichi opened this issue Jul 19, 2021 · 5 comments
Closed

tf-elastx_ubuntu18-calico continuously is failed #7811

oomichi opened this issue Jul 19, 2021 · 5 comments
Labels
kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test.

Comments

@oomichi
Copy link
Contributor

oomichi commented Jul 19, 2021

Which jobs are failing:

tf-elastx_ubuntu18-calico

Which test(s) are failing:

https://gitlab.com/kargo-ci/kubernetes-sigs-kubespray/-/jobs/1436259424

Apply complete! Resources: 21 added, 0 changed, 0 destroyed.
Outputs:
bastion_fips = []
floating_network_id = "600b8501-78cb-4155-9c9f-23dfcba88828"
k8s_master_fips = tolist([
  "185.141.30.185",
])
k8s_node_fips = tolist([
  "185.141.30.18",
])
private_subnet_id = "7d1f958d-7f6c-466d-9294-ac5e1ce863d6"
router_id = " ab95917c-41fb-4881-b507-3a6dfe9403df "
/builds/kargo-ci/kubernetes-sigs-kubespray
+ cd -
+ ansible-playbook tests/cloud_playbooks/wait-for-ssh.yml
[WARNING]: Invalid characters were found in group names but not replaced, use
-vvvv to see details
PLAY [all] *********************************************************************
TASK [Wait until SSH is available] *********************************************
Monday 19 July 2021  20:38:10 +0000 (0:00:00.028)       0:00:00.028 *********** 
fatal: [1436259424-k8s-node-1]: FAILED! => {"changed": false, "elapsed": 240, "msg": "Timeout when waiting for 185.141.30.18:22"}
fatal: [1436259424-k8s-master-1]: FAILED! => {"changed": false, "elapsed": 240, "msg": "Timeout when waiting for 185.141.30.185:22"}

Since when has it been failing:

Testgrid link:

Reason for failure:

Anything else we need to know:

@oomichi oomichi added the kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. label Jul 19, 2021
@oomichi
Copy link
Contributor Author

oomichi commented Jul 19, 2021

The test job creates a control-plane node and a worker node on OpenStack cloud with terraform:

$ cd tests
$ make create-tf -s
$ cd ..
$ terraform apply -auto-approve -parallelism=1 contrib/terraform/openstack

then checks ssh-connectivity with tests/cloud_playbooks/wait-for-ssh.yml
Then a timeout happened on the ssh-connectivity check.

@floryut
Copy link
Member

floryut commented Jul 20, 2021

Elastx can sometimes be pretty flaky.. but not sure if we can do anything except ping guys

@oomichi
Copy link
Contributor Author

oomichi commented Jul 20, 2021

Elastx can sometimes be pretty flaky.. but not sure if we can do anything except ping guys

Yeah, that is true.
How about disabling the test job temporary?

@oomichi
Copy link
Contributor Author

oomichi commented Jul 27, 2021

Now we don't face this issue.
Let's close this.

/close

@k8s-ci-robot
Copy link
Contributor

@oomichi: Closing this issue.

In response to this:

Now we don't face this issue.
Let's close this.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test.
Projects
None yet
Development

No branches or pull requests

3 participants