data sources that generate temporary credentials should not persist values in plan tfstate #24886

llamahunter · 2020-05-07T01:16:56Z

Terraform Version

Terraform v0.12.24

Terraform Configuration Files

data "aws_eks_cluster" "example" {
  name = "example"
}

data "aws_eks_cluster_auth" "example" {
  name = "example"
}

provider "kubernetes" {
  host                   = "${data.aws_eks_cluster.example.endpoint}"
  cluster_ca_certificate = "${base64decode(data.aws_eks_cluster.example.certificate_authority.0.data)}"
  token                  = "${data.aws_eks_cluster_auth.example.token}"
  load_config_file       = false
}

Debug Output

From terraform apply:

Error: Unauthorized

  on .terraform/modules/prometheus_operator/modules/prometheus-operator/main.tf line 36, in resource "kubernetes_namespace" "this":
  36: resource "kubernetes_namespace" "this" {

From EKS authorization log:

time="2020-05-06T05:17:59Z" level=warning msg="access denied" client="127.0.0.1:55512" error="input token was not properly formatted: X-Amz-Date parameter is expired (15 minute expiration) 2020-05-06 01:09:00 +0000 UTC" method=POST path=/authenticate

Note the very stale X-Amz-Date token (2020-05-06 01:09:00Z) relative to the current time (2020-05-06T05:17:59Z). The token date corresponds to the time at which the terraform plan was run, several hours earlier.

Crash Output

Expected Behavior

data.aws_eks_cluster_auth.example.token should be refreshed on apply. Authentication tokens should not be cached as part of the plan.

Actual Behavior

data.aws_eks_cluster_auth.example.token is cached in the plan, and attempted to be reused later on apply. But, tokens only have validity for 15 minutes.

Steps to Reproduce

terraform plan and save plan output
wait more than 15 minutes
terraform apply existing cached plan

Additional Context

EKS clusters get their authentication tokens via a backdoor mechanism with AWS IAM. The current AWS IAM role is used to communicate with a service in the EKS cluster to obtain a K8S token that can be used for only 15 minutes to access the cluster. The aws_eks_cluster_auth data source is the way to get this token in terraform. It should not be cached as part of the plan, but regenerated on every terraform call.

References

I filed the below issue on the terraform-provider-aws project, but they said this is actually a shortcoming of the Terraform Core.

aws_eks_cluster_auth should not cache token in plan terraform-provider-aws#13189

The text was updated successfully, but these errors were encountered:

gblues · 2020-07-28T22:12:50Z

This is biting us in the butt (also AWS EKS)--the kubernetes auth token is generated at the very start of the planning process, so when the plan process takes >15 minutes, the apply process fails.

mnowrot · 2020-09-24T14:06:35Z

Is there any way to influence this 15 minutes timeout? (Perhaps from the AWS EKS side?)

gblues · 2020-09-24T23:55:50Z

There isn't.

However, what you can do (and what we have done to work around this) is to wrap all kubernetes operations in helm charts, because the Helm provider can refresh credentials at execute time and therefore the plan can be executed at any point (assuming no state changes occur that would invalidate the plan in the meantime).

Thanks to @bear454 for demonstrating this workaround in the SUSE/cap-terraform PR that Github has helpfully linked above.

llamahunter · 2020-09-25T00:03:45Z

That kind of defeats the purpose of using the kubernetes terraform provider to track resources.

bear454 · 2020-09-25T02:52:00Z

Well, if it's broken, sometimes you have to work around it.

dschunack · 2021-03-10T19:26:37Z

Is anyone working on this ?

llamahunter · 2021-05-04T17:41:16Z

anyone addressing this? It's a pain for AWS eks k8s clusters.

stevehipwell · 2021-06-09T11:57:11Z

/bump

rotilho · 2022-03-30T08:10:32Z

@danieldreier you labeled it as an enhancement but it sounds more like a bug. It's impossible at the moment to apply a saved plan due to this issue.

Can you relabel it as a bug?

flixx · 2023-02-10T13:54:18Z

Hello,
this limitation makes it hard to use Atlantis for manage kubernetes resources.
With atlantis a terraform plan is created and saved once a Pull Request is created.

Then after the Pull Request was reviewed, the plan can be applied with a comment.
An error happens if the PR was opened more then 15min ago.

There is an issue for this in the atlantis project here:
runatlantis/atlantis#800

Would be really nice to have this fixed.

Maybe as an idea how this could be solved:

Terraform could re-create the plan always when using terraform apply - even when a saved plan is used as an input.
If a saved plan was given, terraform could validate if the re-generated plan still matches with the input plan and otherwise error out. To avoid backwards compatibility issues this behavior might be controlled with a flag (--refresh-plan)

joemiller · 2023-04-26T01:14:07Z

Is there a known workaround here? This is a significant problem for some CI workflows.

SirineBenbrahim · 2023-08-31T18:00:05Z

+1 on this issue. This makes it impossible to automate workflows with auth tokens.

Unichron · 2023-09-08T13:34:34Z

For the kubernetes provider I solved it with the help of exec plugins: https://registry.terraform.io/providers/hashicorp/kubernetes/latest/docs#exec-plugins

Although it depends on some binary being available, it should be relatively easy to achieve in most CI/CD workflows.

Example for GKE:

provider "kubernetes" {
  host                   = "https://${data.google_container_cluster.this.endpoint}"
  cluster_ca_certificate = base64decode(data.google_container_cluster.this.master_auth[0].cluster_ca_certificate)
  exec {
    api_version = "client.authentication.k8s.io/v1beta1"
    command     = "gke-gcloud-auth-plugin"
  }
}

llamahunter changed the title ~~data sources that generate credentials should not persist values in plan tfstate~~ data sources that generate temporary credentials should not persist values in plan tfstate May 7, 2020

danieldreier added the enhancement label May 12, 2020

bear454 mentioned this issue Aug 13, 2020

bsc#1174919 SUSE/cap-terraform#85

Merged

llamahunter mentioned this issue Oct 22, 2020

[Help Needed] AWS iam role to manipulate EKS cluster runatlantis/atlantis#800

Closed

bflad mentioned this issue Dec 1, 2020

aws_eks_cluster_auth should not cache token in plan hashicorp/terraform-provider-aws#13189

Closed

andreykaipov mentioned this issue Feb 20, 2021

Subsequent applications with no changes cause state's serial to increment, causing stale plans #27827

Closed

stevehipwell mentioned this issue Jun 21, 2021

Error during EKS Creation: Post "http://localhost/api/v1/namespaces/kube-system/configmaps": dial tcp 127.0.0.1:80: connect: connection refused terraform-aws-modules/terraform-aws-eks#1280

Closed

apparentlymart mentioned this issue Jan 3, 2023

Allow replace_triggered_by to exist in a cycle #31707

Open

Santhin mentioned this issue Jul 3, 2023

feat: add aws cli getindata/docker-atlantis#25

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data sources that generate temporary credentials should not persist values in plan tfstate #24886

data sources that generate temporary credentials should not persist values in plan tfstate #24886

llamahunter commented May 7, 2020

gblues commented Jul 28, 2020

mnowrot commented Sep 24, 2020 •

edited

Loading

gblues commented Sep 24, 2020

llamahunter commented Sep 25, 2020

bear454 commented Sep 25, 2020

dschunack commented Mar 10, 2021

llamahunter commented May 4, 2021

stevehipwell commented Jun 9, 2021

rotilho commented Mar 30, 2022

flixx commented Feb 10, 2023

joemiller commented Apr 26, 2023 •

edited

Loading

SirineBenbrahim commented Aug 31, 2023

Unichron commented Sep 8, 2023

data sources that generate temporary credentials should not persist values in plan tfstate #24886

data sources that generate temporary credentials should not persist values in plan tfstate #24886

Comments

llamahunter commented May 7, 2020

Terraform Version

Terraform Configuration Files

Debug Output

Crash Output

Expected Behavior

Actual Behavior

Steps to Reproduce

Additional Context

References

gblues commented Jul 28, 2020

mnowrot commented Sep 24, 2020 • edited Loading

gblues commented Sep 24, 2020

llamahunter commented Sep 25, 2020

bear454 commented Sep 25, 2020

dschunack commented Mar 10, 2021

llamahunter commented May 4, 2021

stevehipwell commented Jun 9, 2021

rotilho commented Mar 30, 2022

flixx commented Feb 10, 2023

joemiller commented Apr 26, 2023 • edited Loading

SirineBenbrahim commented Aug 31, 2023

Unichron commented Sep 8, 2023

mnowrot commented Sep 24, 2020 •

edited

Loading

joemiller commented Apr 26, 2023 •

edited

Loading