-
Notifications
You must be signed in to change notification settings - Fork 986
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"error":"Internal error occurred: failed calling webhook \"validation.webhook.karpenter.sh\" #6879
Comments
Could you please share what your current Karpenter deployment looks like? |
@rschalo find version : v0.37.2 deployment file and v1.0.1 deployment files below version: v0.37.2 Source: karpenter/templates/poddisruptionbudget.yamlapiVersion: policy/v1
|
It's super difficult to interpret this unstructured datadump, can you instead create and link a public gist with a single YAML file? |
I think it's not spelled out in the upgrade documentation, but apparently the issue goes away if you delete the Validating and Mutating webhook registrations that were created by Karpenter 0.37 after upgrading to 1.0 |
Thank you for this @gnadaban ! This upgrade has been .... quite the process. |
Description
I attempted to upgrade Karpenter from version v0.37.0 to v1.0.1. Following the documentation, I first upgraded to the minor version v0.37.2, enabling webhooks and patching NodePools, NodeClaims, and EC2NodeClass, which worked fine.
Later, I upgraded to version v1.0.1. However, after the upgrade, the Karpenter controller did not function as expected, and I encountered the following error:
"{"level":"ERROR","time":"2024-08-27T07:49:42.502Z","logger":"controller","message":"Reconciler error","commit":"62a726c","controller":"nodeclaim.tagging","controllerGroup":"karpenter.sh","controllerKind":"NodeClaim","NodeClaim":{"name":"nats-nmjlx"},"namespace":"","name":"nats-nmjlx","reconcileID":"efb1972f-570f-4bad-b952-130449b299c7","error":"Internal error occurred: failed calling webhook "validation.webhook.karpenter.sh": failed to call webhook: the server rejected our request for an unknown reason"}"
I applied the NodePool, NodeClass, and EC2NodeClass CRDs using the following commands:
kubectl apply -f
"https://raw.githubusercontent.com/aws/karpenter-provider-aws/v1.0.1/pkg/apis/crds/karpenter.sh_nodepools.yaml"
kubectl apply -f
"https://raw.githubusercontent.com/aws/karpenter-provider-aws/v1.0.1/pkg/apis/crds/karpenter.k8s.aws_ec2nodeclasses.yaml"
kubectl apply -f
"https://raw.githubusercontent.com/aws/karpenter-provider-aws/v1.0.1/pkg/apis/crds/karpenter.sh_nodeclaims.yaml"
I then generated and applied the Karpenter YAML file as follows:
helm template karpenter oci://public.ecr.aws/karpenter/karpenter --version "1.0.1" --namespace "kube-system"
--set "settings.clusterName=nonprod-eks"
--set "settings.interruptionQueue=nonprod-eks"
--set "serviceAccount.annotations.eks.amazonaws.com/role-arn=arn:aws:iam::852911611684:role/KarpenterControllerRole-nonprod-eks"
--set controller.resources.requests.cpu=1
--set controller.resources.requests.memory=1Gi
--set controller.resources.limits.cpu=1
--set controller.resources.limits.memory=1Gi > karpenter.yaml
Please advise on how to resolve the this issue .
The text was updated successfully, but these errors were encountered: