Failure during planning phase when node pool resources are exhausted #4116

coryodaniel · 2019-07-26T17:25:12Z

When a machine type is exhausted in a zone, terraform plan fails making it impossible to destroy, taint, or change the nodepool.

To continue you have to delete the node pool in the GCP UI or CLI and then remove it from the terraform state.

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment
If an issue is assigned to the "modular-magician" user, it is either in the process of being autogenerated, or is planned to be autogenerated soon. If an issue is assigned to a user, that user is claiming responsibility for the issue. If an issue is assigned to "hashibot", a community member has claimed the issue already.

Terraform Version

Affected Resource(s)

google_container_node_pool

Terraform Configuration Files

Applies to any GCP machine type that is exhausted.

Debug Output

Error: Error reading NodePool "pool-worker-preemptible" from cluster "CLUSTER_NAME": Nodepool "pool-worker-preemptible" has status "RUNNING_WITH_ERROR" with message "asia-northeast1-a: Deploy error: Not all instances running in IGM after 4m4.752299662s. Expect 1. Current errors: [ZONE_RESOURCE_POOL_EXHAUSTED_WITH_DETAILS]: Instance 'INSTANCE_NAME' creation failed: The zone 'projects/ox-delivery-prod/zones/asia-northeast1-a' does not have enough resources available to fulfill the request. '(resource type:compute)'."

Expected Behavior

The plan reads the current state and does not fail.

Actual Behavior

The plan reads and encounters an error on the resource and crashes.

Steps to Reproduce

terraform apply
Wait for the machine type to be exhausted in a region
terraform plan # this will crash

Important Factoids

Machine type is exhausted. Currently requesting n1-standard-96 pvms. These are easy to exhaust (and I know we shouldn't be using them), but the bug exists for other machine types as well.

References

May be related to terraform import fails with the zone does not have enough resources available to fulfill the request #3304

The text was updated successfully, but these errors were encountered:

paddycarver · 2019-08-02T10:00:27Z

I think this is definitely related to #3304. I think the crux of the problem is that we need to detect an error for when resources are exhausted, but we need to distinguish that from other errors, and I think that's trickier than it would appear at first blush.

rileykarson · 2019-12-16T18:20:48Z

Closing as dupe of #3304

ghost · 2020-03-28T14:23:56Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. If you feel I made an error 🤖 🙉 , please reach out to my human friends 👉 [email protected]. Thanks!

ghost added the bug label Jul 26, 2019

paddycarver self-assigned this Aug 1, 2019

rileykarson closed this as completed Dec 16, 2019

ghost locked and limited conversation to collaborators Mar 28, 2020

github-actions bot added service/container forward/review In review; remove label to forward labels Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failure during planning phase when node pool resources are exhausted #4116

Failure during planning phase when node pool resources are exhausted #4116

coryodaniel commented Jul 26, 2019

paddycarver commented Aug 2, 2019

rileykarson commented Dec 16, 2019

ghost commented Mar 28, 2020

Failure during planning phase when node pool resources are exhausted #4116

Failure during planning phase when node pool resources are exhausted #4116

Comments

coryodaniel commented Jul 26, 2019

Community Note

Terraform Version

Affected Resource(s)

Terraform Configuration Files

Debug Output

Expected Behavior

Actual Behavior

Steps to Reproduce

Important Factoids

References

paddycarver commented Aug 2, 2019

rileykarson commented Dec 16, 2019

ghost commented Mar 28, 2020