-
Notifications
You must be signed in to change notification settings - Fork 9.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation: add FAQs on membership operation #7028
Conversation
@@ -62,6 +62,22 @@ With longer latencies, the default etcd configuration may cause frequent electio | |||
|
|||
etcdctl provides a `snapshot` command to create backups. See [backup][backup] for more details. | |||
|
|||
#### Always remove first when replacing member? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How to replace an etcd member?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also i wonder if it is a good idea to link related issue with the question in faq. in that way, it gives reader a context. what do you think?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why would this need context? it should be self-contained...
@@ -62,6 +62,22 @@ With longer latencies, the default etcd configuration may cause frequent electio | |||
|
|||
etcdctl provides a `snapshot` command to create backups. See [backup][backup] for more details. | |||
|
|||
#### Always remove first when replacing member? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#### Should I add a member before removing an unhealthy member?
|
||
#### Why so strict about membership change? | ||
|
||
etcd sets `strict-reconfig-check` in order to reject reconfiguration requests that would cause quorum loss. Abandoning quorum is really risky (especially when the cluster is already in a bad way). We're aware that losing quorum is painful, but disabling quorum on membership could lead to full fledged cluster inconsistency and that would be even worse in many applications ("disk geometry corruption" being a candidate for most terrifying). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/ in a bad way/unhealthy
s/We're aware.../Although it may be tempting to disable quorum checking if there's quorum loss to add a new member, this could lead to full fledged cluster inconsistency. For many applications, this will make the problem even worse ("disk geometry corruption" being a candidate for most terrifying).
|
||
On the other hand, if the downed member is removed from cluster membership first, the number of members becomes 2 and the quorum remains at 2. Following that removal by adding a new member will also keep the quorum steady at 2. So, even if the new node can't be brought up, it's still possible to remove the new member through quorum on the remaining live members. | ||
|
||
#### Why so strict about membership change? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why won't etcd accept my membership changes?
@@ -62,6 +62,22 @@ With longer latencies, the default etcd configuration may cause frequent electio | |||
|
|||
etcdctl provides a `snapshot` command to create backups. See [backup][backup] for more details. | |||
|
|||
#### Always remove first when replacing member? | |||
|
|||
When replacing an etcd node, we recommend to remove the member first and then add its replacement. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/we recommend/it's important
Copy Anthony's answer from: etcd-io#6103 etcd-io#6114
All fixed. PTAL. Thanks! |
lgtm |
Copying @heyitsanthony's answers from: