Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-enable stats-based replica rebalancing #2051

Closed
jseldess opened this issue Oct 18, 2017 · 3 comments
Closed

Re-enable stats-based replica rebalancing #2051

jseldess opened this issue Oct 18, 2017 · 3 comments
Assignees
Milestone

Comments

@jseldess
Copy link
Contributor

jseldess commented Oct 18, 2017

This means turning on both load-based replica rebalancing and size-based replica rebalancing. So in addition to some how-to docs, we'll need to update our architecture docs.

Background: https://airtable.com/tblD3oZPLJgGhCmch/viwINAgXA5on9pr2T/recWJ095g7CriLJ88
PM: @awoods187
Eng Partner: @a-robinson

@jseldess
Copy link
Contributor Author

jseldess commented Aug 28, 2018

PR: cockroachdb/cockroach#28340

From release notes:

Range leases are now automatically rebalanced throughout the cluster to even out the amount of QPS being handled by each node. [#28340][#28340] {% comment %}doc{% endcomment %}

@a-robinson, is this work now done and ready to be documented?

@a-robinson
Copy link
Contributor

Not quite, I'd suggest waiting until next week.

@jseldess jseldess removed the ready label Aug 28, 2018
@jseldess
Copy link
Contributor Author

jseldess commented Sep 5, 2018

From forum:

  • The decisions are based on the number of queries processed by a node made over the last half hour or so using an exponentially weighted moving average to more heavily weight recent load. They only happen if load on the cluster is above a certain threshold, the difference in load between nodes is substantial (greater than 25% difference by default), and there are leases that could be transferred that will meaningfully even out the load in the cluster.
  • Only leases are transferred based on load as of today. Very little I/O is involved in lease transfers, since the data itself doesn’t have to be moved from the nodes it’s currently on in order to change the leaseholder.

rmloveland added a commit that referenced this issue Oct 25, 2018
Fixes #2051.

Summary of changes:

- Add a paragraph to *Architecture > Replication Layer* describing that
  as of v2.1, in addition to the rebalancing that occurs when nodes are
  added or removed, we also rebalance leases and replicas based on load.
  Also added links to relevant cluster setting and zone config docs for
  those who want more info.
rmloveland added a commit that referenced this issue Oct 25, 2018
Fixes #2051.

Summary of changes:

- Add a paragraph to *Architecture > Replication Layer* describing that
  as of v2.1, in addition to the rebalancing that occurs when nodes are
  added or removed, we also rebalance leases and replicas based on load.
  Also added links to relevant cluster settings and zone config docs for
  those who want more info.
rmloveland added a commit that referenced this issue Oct 26, 2018
Fixes #2051.

Summary of changes:

- Add a paragraph to *Architecture > Replication Layer* describing that
  as of v2.1, in addition to the rebalancing that occurs when nodes are
  added or removed, we also rebalance leases and replicas based on load.
  Also added links to relevant cluster settings and zone config docs for
  those who want more info.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants