Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retriable errors must not populate fatal error message in status #285

Open
prajnutanix opened this issue May 17, 2023 · 2 comments
Open

Comments

@prajnutanix
Copy link

/kind bug

What steps did you take and what happened:

This is during normal capx cluster deployment flow. I saw nutanixcluster object's status contain a fatal error for a retriable error scenario:
" Get "https://10.45.92.5:9440/api/nutanix/v3/users/me": dial tcp 10.45.92.5:9440: i/o timeout" . This went away after few minutes, and cluster deployment was a success.

What did you expect to happen:
Errors which upon retry also cannot be fixed (like password incorrect or connecting with PC endpoint returns http 404 etc), must populate FatalError message in status. Other retriable errors must not populate this field.

Anything else you would like to add:

Miscellaneous information that will assist in solving the issue.

Environment:

  • Cluster-api-provider-nutanix version: v1.2.1

  • Kubernetes version: (use kubectl version):
    Client Version: v1.25.3
    Kustomize Version: v4.5.7
    Server Version: v1.25.3

  • OS (e.g. from /etc/os-release): CentOS 7

@nutanix-cn-prow-bot
Copy link

@prajnutanix: The label(s) kind/bug cannot be applied, because the repository doesn't have them.

In response to this:

/kind bug

What steps did you take and what happened:

This is during normal capx cluster deployment flow. I saw nutanixcluster object's status contain a fatal error for a retriable error scenario:
" Get "https://10.45.92.5:9440/api/nutanix/v3/users/me": dial tcp 10.45.92.5:9440: i/o timeout" . This went away after few minutes, and cluster deployment was a success.

What did you expect to happen:
Errors which upon retry also cannot be fixed (like password incorrect or connecting with PC endpoint returns http 404 etc), must populate FatalError message in status. Other retriable errors must not populate this field.

Anything else you would like to add:

Miscellaneous information that will assist in solving the issue.

Environment:

  • Cluster-api-provider-nutanix version: v1.2.1

  • Kubernetes version: (use kubectl version):
    Client Version: v1.25.3
    Kustomize Version: v4.5.7
    Server Version: v1.25.3

  • OS (e.g. from /etc/os-release): CentOS 7

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@tuxtof
Copy link
Contributor

tuxtof commented Aug 28, 2023

@prajnutanix normally fix in recent release
can you retest and confirm
thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants