Error: failed to fetch endpoints from etcd cluster member list: context deadline exceeded #31674

milan-dikkumburage · 2022-02-09T14:52:49Z

What happened?

I tried to Set up a High Availability etcd Cluster with kubeadm.
I followed the official guide https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/setup-ha-etcd-with-kubeadm/
after all the steps execute when i check cluster health its giving below error message

[root@etcd-01 ~]# docker run --rm -it \

--net host
-v /etc/kubernetes:/etc/kubernetes k8s.gcr.io/etcd:3.5.1-0 etcdctl
--cert /etc/kubernetes/pki/etcd/peer.crt
--key /etc/kubernetes/pki/etcd/peer.key
--cacert /etc/kubernetes/pki/etcd/ca.crt
--endpoints https://137.184.157.161:2379 endpoint health --cluster
{"level":"warn","ts":"2022-02-09T08:12:22.497Z","logger":"etcd-client","caller":"v3/retry_interceptor.go:62","msg":"retrying of unary invoker failed","target":"etcd-endpoints://0xc00045e540/137.184.157.161:2379","attempt":0,"error":"rpc error: code = DeadlineExceeded desc = latest balancer error: last connection error: connection error: desc = "transport: Error while dialing dial tcp 137.184.157.161:2379: connect: connection refused""}
Error: failed to fetch endpoints from etcd cluster member list: context deadline exceeded

What you expected to happen?

as per the official guide etcd cluster should be healthy state.

How to reproduce it (as minimally and precisely as possible)?

You can follow the official guide with https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/setup-ha-etcd-with-kubeadm/

Anything else we need to know?

Versions

kubeadm version (use v1.23.3):

Environment:

Kubernetes version (use v1.23.3):
Cloud provider or hardware configuration:

Master Nodes -03
Worker Nodes -03
etcd -03
HA load balancer -01

Digital Ocean cloud
all instance - 4 HB Ram ,2 CPU ,80 GB disk

OS (e.g. from /etc/os-release):
CentOS 8

[root@etcd-01 ~]# cat /etc/os-release
NAME="CentOS Stream"
VERSION="8"
ID="centos"
ID_LIKE="rhel fedora"
VERSION_ID="8"
PLATFORM_ID="platform:el8"
PRETTY_NAME="CentOS Stream 8"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:centos:centos:8"
HOME_URL="https://centos.org/"
BUG_REPORT_URL="https://bugzilla.redhat.com/"
REDHAT_SUPPORT_PRODUCT="Red Hat Enterprise Linux 8"
REDHAT_SUPPORT_PRODUCT_VERSION="CentOS Stream"

Kernel (e.g. uname -a):

[root@etcd-01 ~]# uname -a
Linux etcd-01 4.18.0-277.el8.x86_64 #1 SMP Wed Feb 3 20:35:19 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux

Container runtime (CRI) (e.g. containerd, cri-o):
Docker
Container networking plugin (CNI) (e.g. Calico, Cilium):
wavenet
Others:

The text was updated successfully, but these errors were encountered:

k8s-ci-robot · 2022-02-09T14:52:56Z

@sanjaz10: This issue is currently awaiting triage.

SIG Docs takes a lead on issue triage for this website, but any Kubernetes member can accept issues by applying the triage/accepted label.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

milan-dikkumburage · 2022-02-10T06:03:26Z

Issue fixed my self. In the Step 01 Configure the kubelet to be a service manager for etcd remove the this line "
--container-runtime=remote --container-runtime-endpoint=unix:///var/run/containerd/containerd.sock " . Please use below command in the steps 01.
cat << EOF > /etc/systemd/system/kubelet.service.d/20-etcd-service-manager.conf
[Service]
ExecStart=
ExecStart=/usr/bin/kubelet --address=127.0.0.1 --pod-manifest-path=/etc/kubernetes/manifests --cgroup-driver=systemd
Restart=always
EOF

neolit123 · 2022-02-10T08:15:11Z

See this if you are using docker:
https://kubernetes.io/blog/2022/01/07/kubernetes-is-moving-on-from-dockershim/

milan-dikkumburage · 2022-02-10T12:46:57Z

@neolit123 yea. that's why i already removed and checked .

k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Feb 9, 2022

milan-dikkumburage closed this as completed Feb 10, 2022

milan-dikkumburage mentioned this issue Feb 10, 2022

ETCD cluster install failed :- Error: failed to fetch endpoints from etcd cluster member list: context deadline exceeded kubernetes/kubeadm#2651

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error: failed to fetch endpoints from etcd cluster member list: context deadline exceeded #31674

Error: failed to fetch endpoints from etcd cluster member list: context deadline exceeded #31674

milan-dikkumburage commented Feb 9, 2022

k8s-ci-robot commented Feb 9, 2022

milan-dikkumburage commented Feb 10, 2022 •

edited

Loading

neolit123 commented Feb 10, 2022

milan-dikkumburage commented Feb 10, 2022

Error: failed to fetch endpoints from etcd cluster member list: context deadline exceeded #31674

Error: failed to fetch endpoints from etcd cluster member list: context deadline exceeded #31674

Comments

milan-dikkumburage commented Feb 9, 2022

What happened?

What you expected to happen?

How to reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

Versions

k8s-ci-robot commented Feb 9, 2022

milan-dikkumburage commented Feb 10, 2022 • edited Loading

neolit123 commented Feb 10, 2022

milan-dikkumburage commented Feb 10, 2022

milan-dikkumburage commented Feb 10, 2022 •

edited

Loading