Unable to deploy driver - Failed getting project and zone #490

prashantokochavara · 2020-04-17T16:36:48Z

I've double checked all credentials, but not sure why I keep hitting this issue for any version I deploy - stable or alpha.

Any idea what could be going wrong here?

msau42 · 2020-04-17T17:25:27Z

Which version of the driver are you using?

Also, the error message is cut off, could you paste the entire "Failed to get cloud provider" error?

msau42 · 2020-04-17T17:27:53Z

I see you are using v0.7.0.

prashantokochavara · 2020-04-17T17:28:59Z

correct - installing the alpha version for snapshot feature support.
Happens with stable, dev, alpha all versions 0.5.0 - 0.7.0 though.

Full output..

I0417 16:33:29.986931 1 main.go:67] Driver vendor version v0.7.0-gke.0
I0417 16:33:29.987006 1 gce.go:80] Using GCE provider config
I0417 16:33:29.987166 1 gce.go:125] GOOGLE_APPLICATION_CREDENTIALS env var set /etc/cloud-sa/cloud-sa.json
I0417 16:33:29.987176 1 gce.go:129] Using DefaultTokenSource &oauth2.reuseTokenSource{new:jwt.jwtSource{ctx:(*context.cancelCtx)(0xc000296300), conf:(*jwt.Config)(0xc000118780)}, mu:sync.Mutex{state:0, sema:0x0}, t:(*oauth2.Token)(nil)}
F0417 16:33:31.212427 1 main.go:83] Failed to get cloud provider: Failed getting Project and Zone: Get http://169.254.169.254/computeMetadata/v1/instance/zone: dial tcp 169.254.169.254:80: connect: connection refused

msau42 · 2020-04-17T17:34:49Z

Do you have hostNetwork: true set in the DaemonSet spec?

prashantokochavara · 2020-04-17T19:47:45Z

Nope, I do not. Do I need to add that in the spec somewhere?

msau42 · 2020-04-17T19:51:51Z

Yes see

gcp-compute-persistent-disk-csi-driver/deploy/kubernetes/base/node.yaml

Line 19 in a0c8fa2

hostNetwork: true

prashantokochavara · 2020-04-17T20:28:37Z

I tried that and still facing the same issue.
This is an OpenShift environment that I am working with - 4.3 with 1.16 K8. Any supportability issues?

msau42 · 2020-04-17T20:32:04Z

@gnufied @jsafrane are you aware of any configuration that needs to be done in openshift to access the GCP metadata server?

jsafrane · 2020-04-20T11:49:37Z

Not sure what's GCP metadata server... Is it the link-local address to get VM metadata? The DaemonSet pods must use hostNetwork: true.

OpenShift does not allow random pods to get to VM metadata, we used to put some sensitive material (don't remember exactly, some certificates?).

jsafrane · 2020-04-20T12:22:26Z

There is nothing else OpenShift specific...

msau42 · 2020-04-20T15:03:55Z

Yes I mean link local address to get vm metadata: 169.254.169.254:80: connect: connection refused

prashantokochavara · 2020-04-20T23:09:03Z

@msau42 have we been able to confirm that non-OCP environments are not having this issue?

msau42 · 2020-04-20T23:17:02Z

Yes, we have CI running successfully in kubetest GCP environment:
https://k8s-testgrid.appspot.com/provider-gcp-compute-persistent-disk-csi-driver#Kubernetes%20Master%20Driver%20Latest

@gnufied @jsafrane are you able to run the pd driver in your ocp environment?

jsafrane · 2020-04-21T15:48:04Z

Yes, I am able to run e2e tests with manifests from https://github.com/kubernetes/kubernetes/tree/master/test/e2e/testing-manifests/storage-csi/gce-pd on GCP.

jsafrane · 2020-04-21T17:29:23Z

Yes, I am able to run e2e tests with manifests from https://github.com/kubernetes/kubernetes/tree/master/test/e2e/testing-manifests/storage-csi/gce-pd on GCP.

prashantokochavara · 2020-04-21T20:28:35Z

I'm reading another issue but from an AWS project with similar issues being faced..
aws/aws-node-termination-handler#21

Also, I am hitting similar metadata issues when using EC2 with OCP and EBS CSI Driver.
Is there a common codepath between the two CSI drivers by any chance?

prashantokochavara · 2020-04-27T14:55:13Z

@msau42
kubernetes-sigs/aws-ebs-csi-driver#474 (comment)
Could the GCP Driver be hitting the same thing?

msau42 · 2020-04-27T18:00:50Z

Are you trying to run the controller on a node that doesn't have access to the metadata service?

msau42 · 2020-04-27T18:02:45Z

There is no common code path between the two drivers, but the ideas are similar. They both require access to the metadata service in order to get project/zone information of the cluster they're running in.

There was work being done in both drivers to remove this requirement and allow controllers to be run outside of the Kubernetes cluster. But it requires additional arguments to be passed into the driver, and is not the normal case.

fejta-bot · 2020-07-26T18:18:48Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-08-25T19:00:48Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2020-09-24T19:42:59Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2020-09-24T19:43:07Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 26, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 25, 2020

k8s-ci-robot closed this as completed Sep 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to deploy driver - Failed getting project and zone #490

Unable to deploy driver - Failed getting project and zone #490

prashantokochavara commented Apr 17, 2020

msau42 commented Apr 17, 2020

msau42 commented Apr 17, 2020

prashantokochavara commented Apr 17, 2020 •

edited

Loading

msau42 commented Apr 17, 2020

prashantokochavara commented Apr 17, 2020

msau42 commented Apr 17, 2020

prashantokochavara commented Apr 17, 2020

msau42 commented Apr 17, 2020

jsafrane commented Apr 20, 2020

jsafrane commented Apr 20, 2020

msau42 commented Apr 20, 2020

prashantokochavara commented Apr 20, 2020

msau42 commented Apr 20, 2020

jsafrane commented Apr 21, 2020

jsafrane commented Apr 21, 2020

prashantokochavara commented Apr 21, 2020

prashantokochavara commented Apr 27, 2020

msau42 commented Apr 27, 2020

msau42 commented Apr 27, 2020

fejta-bot commented Jul 26, 2020

fejta-bot commented Aug 25, 2020

fejta-bot commented Sep 24, 2020

k8s-ci-robot commented Sep 24, 2020

Unable to deploy driver - Failed getting project and zone #490

Unable to deploy driver - Failed getting project and zone #490

Comments

prashantokochavara commented Apr 17, 2020

msau42 commented Apr 17, 2020

msau42 commented Apr 17, 2020

prashantokochavara commented Apr 17, 2020 • edited Loading

msau42 commented Apr 17, 2020

prashantokochavara commented Apr 17, 2020

msau42 commented Apr 17, 2020

prashantokochavara commented Apr 17, 2020

msau42 commented Apr 17, 2020

jsafrane commented Apr 20, 2020

jsafrane commented Apr 20, 2020

msau42 commented Apr 20, 2020

prashantokochavara commented Apr 20, 2020

msau42 commented Apr 20, 2020

jsafrane commented Apr 21, 2020

jsafrane commented Apr 21, 2020

prashantokochavara commented Apr 21, 2020

prashantokochavara commented Apr 27, 2020

msau42 commented Apr 27, 2020

msau42 commented Apr 27, 2020

fejta-bot commented Jul 26, 2020

fejta-bot commented Aug 25, 2020

fejta-bot commented Sep 24, 2020

k8s-ci-robot commented Sep 24, 2020

prashantokochavara commented Apr 17, 2020 •

edited

Loading