Terraform destroy of helm_release resources. #593

Ragib95 · 2020-09-29T11:13:43Z

Terraform Version and Provider Version

Terraform v0.12.26
provider.aws v2.65.0
provider.helm v1.3.0
provider.kubernetes v1.11.3

Provider Version

provider.helm v1.3.0

Affected Resource(s)

helm_release

Terraform Configuration Files

# Copy-paste your Terraform configurations here - for large Terraform configs,
# please use a service like Dropbox and share a link to the ZIP file. For
# security, you can also encrypt the files using our GPG public key.

provider "helm" {
  kubernetes {
    load_config_file = false
    host             = "${aws_eks_cluster.aws_eks.endpoint}"

    cluster_ca_certificate = "${base64decode(aws_eks_cluster.aws_eks.certificate_authority.0.data)}"

    token = data.aws_eks_cluster_auth.main.token

  }
}

resource "helm_release" "nginx-ingress" {
  name             = "nginx-ingress"
  chart            = "/nginx-ingress/"
  namespace        = "opsera"
  create_namespace = true
  timeout          = 600

  values = [
    "${file("value.yaml")}"
  ]

  depends_on = [
    "aws_eks_node_group.node",
    "helm_release.cluster-autoscaler",
    "aws_acm_certificate.public_cert"
  ]
}

Debug Output

helm_release.nginx-ingress: Destroying... [id=nginx-ingress]
helm_release.nginx-ingress: Destruction complete after 8s
aws_eks_node_group.node: Destroying... [id=*****node]

Panic Output

Expected Behavior

helm_release destruction should wait for all resources (pods, services, and ingress) to be in a destroyed state before going into Destruction complete state.

Actual Behavior

It's going into Destruction complete state within 7-8 secs before pods and services are fully destroyed. This results in EKS node destruction getting started and leaves ELB attached to service.

Reason:- Before helm is releasing pods and services, terraform started deleting node and cluster leaving pods in Terminating state.

Steps to Reproduce

Create an EKS cluster with Nginx ingress.
Destroy the resources using terraform destroy
It's giving timeout error as ELB attached to Nginx service is not getting destroyed.

Important Factoids

References

🌱 Bump k8s.io/client-go from 0.27.1 to 0.28.1 #1234

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

The text was updated successfully, but these errors were encountered:

alexsomesan · 2020-10-07T15:11:22Z

@Ragib95 This is expected behaviour due to a limitation in Terraform that causes it to not recognise the implicit dependency between the Helm resource and the EKS cluster resource. Terraform tries to parallelise the destroy operations when no dependency is known between the resources. This can lead to the EKS cluster being destroyed before the Helm release itself.

I'd suggest setting an explicit dependency on the EKS cluster resource in the helm_release resource, like this:

depends_on = [
  aws_eks_cluster.aws_eks,
]

aareet · 2021-01-06T17:34:37Z

We currently don't have a way to know what resources are created. We will have to wait for helm/helm#2378 to be implemented.

devurandom · 2021-01-07T14:40:56Z

I am unable to terraform destroy -target=... a helm_release resource:

Error: uninstall: Release not loaded: metrics-server: release: not found

Is this another manifestation of this issue, or should I open a separate one?

jocutajar · 2021-09-15T09:59:21Z

We currently don't have a way to know what resources are created. We will have to wait for helm/helm#2378 to be implemented.

Issue closed, but not fixed.

visla-xugeng · 2021-09-23T21:42:44Z

I got same error when I tried to destroy resource with terraform. The helm release got deleted, but the pods were in "Terminating" status. And I found that all of helm chart resources got this issue.

my terraform structure:
dev: call modules
prod: call modules
modules: all resources (included helm charts) are built in module directory

Any solution or ideas?

FearlessHyena · 2021-10-27T15:05:09Z

We currently don't have a way to know what resources are created. We will have to wait for helm/helm#2378 to be implemented.

Issue closed, but not fixed.

Seems like the referenced helm issue has been fixed by helm/helm#9702
Would it make it easier to solve this issue?

avinashpancham · 2021-11-22T08:41:37Z

@alexsomesan as mentioned earlier in this thread helm/helm#9702 seems to solve this issue from within Helm.

Then I think it can be solved in the Terraform Helm provider by adding a new wait_for_destroy argument, that is passed to the Helm uninstall command.

Don't exactly know how to do it, but if you could point me in the right direction I could give it a try.

ClenchPaign · 2021-12-27T11:44:46Z

Any update on the Terraform side for helm/helm#9702?

jferris · 2022-02-26T18:58:01Z

I believe this was resolved by #786. After upgrading the Helm provider to 2.4, the 'wait' attribute of the helm_release is respected during terraform destroy.

RicoToothless · 2022-05-27T06:08:13Z

I believe this was resolved by #786. After upgrading the Helm provider to 2.4, the 'wait' attribute of the helm_release is respected during terraform destroy.

I think is nope.

I used 2.4.1, 2.5.0 and 2.5.1.

wait didn't fix the issue for me. (Default value is true, by the way)

jocutajar · 2022-07-14T10:39:09Z

Hi, #786 is an impressive MR (to say the least)! I'm not brave enough to go dig into it. Do we need a test scenario for the wait on destroy?

WillerWasTaken · 2022-09-02T15:50:36Z

Our current workaround, which aint great but... yeah...

resource "helm_release" "nginx_ingress_controller" {
  name       = local.service_name_ingress-nginx
  namespace  = var.namespace
  repository = "https://kubernetes.github.io/ingress-nginx"
  chart      = "ingress-nginx"
  version    = "4.2.1"

  values = [
    yamlencode(local.helm_chart_ingress-nginx_values)
  ]
  
  max_history = 3
  depends_on = [
    helm_release.aws_load_balancer_controller,
    time_sleep.wait_nginx_termination
  ]
}

# Helm chart destruction will return immediately, we need to wait until the pods are fully evicted
# https://github.com/hashicorp/terraform-provider-helm/issues/593
resource "time_sleep" "wait_nginx_termination" {
  destroy_duration = "${local.ingress_nginx_terminationGracePeriodSeconds}s"
}

Putting a fixed sleep timer does the job, waiting more than necessary but does the job for now :/

github-actions · 2023-09-03T00:01:14Z

Marking this issue as stale due to inactivity. If this issue receives no comments in the next 30 days it will automatically be closed. If this issue was automatically closed and you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. This helps our maintainers find and focus on the active issues. Maintainers may also remove the stale label at their discretion. Thank you!

Ragib95 added the bug label Sep 29, 2020

alexsomesan added acknowledged dependencies and removed bug labels Oct 7, 2020

sebglon mentioned this issue Oct 8, 2020

Helm resources are not recreated if the underlying cluster is destroyed #591

Closed

aareet added the upstream-helm Issue is in Helm not the provider label Jan 6, 2021

strowi mentioned this issue Oct 26, 2022

[Bug]: context dealing exceeded due to metrics-server pod not terminating on terraform destroy aws-ia/terraform-aws-eks-blueprints#353

Closed

1 task

bianchi2 mentioned this issue Nov 10, 2022

CLIP-1701: Wait until pods in Helm release are terminated before destroying nfs module atlassian-labs/data-center-terraform#282

Merged

3 tasks

github-actions bot added the stale label Sep 3, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Oct 4, 2023

github-actions bot locked as resolved and limited conversation to collaborators Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Terraform destroy of helm_release resources. #593

Terraform destroy of helm_release resources. #593

Ragib95 commented Sep 29, 2020

alexsomesan commented Oct 7, 2020

aareet commented Jan 6, 2021

devurandom commented Jan 7, 2021

jocutajar commented Sep 15, 2021

visla-xugeng commented Sep 23, 2021

FearlessHyena commented Oct 27, 2021

avinashpancham commented Nov 22, 2021

ClenchPaign commented Dec 27, 2021

jferris commented Feb 26, 2022

RicoToothless commented May 27, 2022

jocutajar commented Jul 14, 2022

WillerWasTaken commented Sep 2, 2022

github-actions bot commented Sep 3, 2023

Terraform destroy of helm_release resources. #593

Terraform destroy of helm_release resources. #593

Comments

Ragib95 commented Sep 29, 2020

Terraform Version and Provider Version

Provider Version

Affected Resource(s)

Terraform Configuration Files

Debug Output

Panic Output

Expected Behavior

Actual Behavior

Steps to Reproduce

Important Factoids

References

Community Note

alexsomesan commented Oct 7, 2020

aareet commented Jan 6, 2021

devurandom commented Jan 7, 2021

jocutajar commented Sep 15, 2021

visla-xugeng commented Sep 23, 2021

FearlessHyena commented Oct 27, 2021

avinashpancham commented Nov 22, 2021

ClenchPaign commented Dec 27, 2021

jferris commented Feb 26, 2022

RicoToothless commented May 27, 2022

jocutajar commented Jul 14, 2022

WillerWasTaken commented Sep 2, 2022

github-actions bot commented Sep 3, 2023