Ensure only primary sender drives slot ownership updates #754

PingXie · 2024-07-07T05:02:40Z

Fixes a regression introduced in PR #445, which allowed a message from a replica
to update the slot ownership of its primary. The regression results in a
replicaof cycle, causing server crashes due to the cycle detection assert. The
fix restores the previous behavior where only primary senders can trigger
clusterUpdateSlotsConfigWith.

Additional changes:

Handling of primaries without slots is obsoleted by new handling of when a
sender that was a replica announces that it is now a primary.
Replication loop detection code is unchanged but shifted downwards.
Some variables are renamed for better readability and some are introduced to
avoid repeated memcmp() calls.

Fixes #753.

codecov · 2024-07-07T05:13:47Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 70.28%. Comparing base (1a8bd04) to head (a1f0bc0).

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable     #754      +/-   ##
============================================
+ Coverage     70.25%   70.28%   +0.03%     
============================================
  Files           112      112              
  Lines         60590    60587       -3     
============================================
+ Hits          42567    42586      +19     
+ Misses        18023    18001      -22

Files	Coverage Δ
src/cluster_legacy.c	`86.00% <100.00%> (+0.35%)`	⬆️

... and 11 files with indirect coverage changes

hpatro

Question from the issue (repeating here):

Is it any problematic to handle topology update from replica with higher config epoch?

src/cluster_legacy.c

zuiderkwast

The change seems to make sense but I don't fully understand the fix and how it solves the problem. The essence of the problem can be seen in the diff screenshot in #753? Please update the PR description to independently describe the change, rather than only referring to the issue.

It's harder to review the fix when there's refactoring in the same commit. It's easier if the refacoring and the acutal fix are at least in two separate commits.

(I imagine Oran Agra would have rejected the renaming of variables, to avoid any kind of unnecessary changes.)

src/cluster_legacy.c

PingXie · 2024-07-10T20:38:24Z

Is it any problematic to handle topology update from replica with higher config epoch?

Yeah I think it will work but I feel that it might involve a bigger change. This change on the other hand is more scoped, relatively speaking. We can definitely explore this idea separately/incrementally.

PingXie · 2024-07-10T20:57:54Z

The change seems to make sense but I don't fully understand the fix and how it solves the problem. The essence of the problem can be seen in the diff screenshot in #753? Please update the PR description to independently describe the change, rather than only referring to the issue.

Yes - replicas updating the slot ownership on behalf of their primaries is the regression (or behavior change) introduced by #445.

It's harder to review the fix when there's refactoring in the same commit. It's easier if the refacoring and the acutal fix are at least in two separate commits.

(I imagine Oran Agra would have rejected the renaming of variables, to avoid any kind of unnecessary changes.)

I don't agree with the "unnecessary changes". All these changes are either a continuation/fine-tune of #445 or for a good reason of improving readability. I have gone through this code path more times than I can count but I still get lost easily because the naming is just so confusing. A blank statement of "unnecessary changes" is just saying "I don't know what I am doing" - sorry for the unnecessarily strong response :)

zuiderkwast · 2024-07-10T23:32:54Z

It's harder to review the fix when there's refactoring in the same commit. It's easier if the refacoring and the acutal fix are at least in two separate commits.
(I imagine Oran Agra would have rejected the renaming of variables, to avoid any kind of unnecessary changes.)

I don't agree with the "unnecessary changes". All these changes are either a continuation/fine-tune of #445 or for a good reason of improving readability. I have gone through this code path more times than I can count but I still get lost easily because the naming is just so confusing.

Sorry for causing frustration! I know you've very familiar with this code. More so than the rest of us.

With "unnecessary changes", I just meant changes that don't change the logic, i.e. refactoring, not strictly necessary to fix the bug. I didn't say I'm against it. Only that I'd prefer them in separate commits, since it would have made it easier for me to read the diff and spot the actual behaviour change. The diff is +68 −82, while the actual fix is ~5 lines.

Even the title "ensure only primary sender drives..." seemed a bit cryptic to me at first, but of course it's obvious to anyone who has this code fresh in memory. (My though processes involved "What's a primary sender again... Hm.. right, it's when we're receiving a packet on the cluster bus from a primary.")

I'm trying to make reviewing easier for myself by requiring more from the contributors. Not sure if it's sane but I want to be able to handle more PRs faster without being burnt out. I should know you're in the same position.

PingXie · 2024-07-11T03:58:48Z

Sorry for causing frustration! I know you've very familiar with this code. More so than the rest of us.

No worries - we are all doing our jobs.

With "unnecessary changes", I just meant changes that don't change the logic, i.e. refactoring, not strictly necessary to fix the bug. I didn't say I'm against it. Only that I'd prefer them in separate commits, since it would have made it easier for me to read the diff and spot the actual behaviour change. The diff is +68 −82, while the actual fix is ~5 lines.

I disagree. I don't consider this a refactoring, which I agree is better handled in a separate PR. There is a reason why I introduced the regression in the first place. Our discussions on the names show the exact problem of this code not having established clear mental concepts and this was partly the reason why I introduced the regression in the first place. Making these changes in a separate PR loses the exact context of why these changes are needed. And refactoring PRs don't normally get the right amount of scrutinization, which is what I want for this PR. I am all for any nitpick that could help improve the code quality in such a critical part of the code base.

Even the title "ensure only primary sender drives..." seemed a bit cryptic to me at first, but of course it's obvious to anyone who has this code fresh in memory. (My though processes involved "What's a primary sender again... Hm.. right, it's when we're receiving a packet on the cluster bus from a primary.")

I'm trying to make reviewing easier for myself by requiring more from the contributors. Not sure if it's sane but I want to be able to handle more PRs faster without being burnt out. I should know you're in the same position.

I believe I have explained the issue to my best ability in #753, which I linked to this PR as well. Have you got a chance to check it out? I have a tendency to split the bug and PR. Will make sure to carry the context over for this (and future) PR.

Signed-off-by: Ping Xie <[email protected]>

PingXie · 2024-07-11T05:04:46Z

I have updated the names per recommendation. I think they look clearer now. @zuiderkwast PTAL.

Also I think there is another (0-day) bug. We never check the sender's config epoch before accepting its claim of being a primary. I will address it in a separate PR :).

https://github.com/valkey-io/valkey/blob/unstable/src/cluster_legacy.c#L3119

zuiderkwast

Great! These "sender-claimed" names definitely make things more clear.

Also I think there is another (0-day) bug. We never check the sender's config epoch before accepting its claim of being a primary. I will address it in a separate PR :).

OK. I wouldn't mind having it in the same PR (just a separate commit within the PR for easier reading). But a separate PR is also good to if we want to backport it or something.

I don't consider this a refactoring, which I agree is better handled in a separate PR.

I never said separate PR. That would be excessive. I just meant separate commits within the same PR. It might have made it easier to review, though this may just have been me trying to find excuses for not understanding the fix.

I believe I have explained the issue to my best ability in #753, which I linked to this PR as well. Have you got a chance to check it out? I have a tendency to split the bug and PR. Will make sure to carry the context over for this (and future) PR.

Yes, the issue is very good. The problem description and the description of the change are not the same things though. In the PR I think it's good to describe the actual change. The problem description doesn't need to be repeated in detail in the PR when it's explained in the issue.

For the PR description, I'd be happy just what you wrote under "Fix:", and possibly mentioning the additional changes to make it easier to understand the diff. No need to update it now, but I'm copy-pasting some sentences here just to show what I mean, what I think is enough, yet useful for reviewing and also enough to have in the commit log after merging:

Fixes a regression introduced in PR #445, which allowed a message from a replica
to update the slot ownership of its primary. The regression results in a
replicaof cycle, causing server crashes due to the cycle detection assert. The
fix restores the previous behavior where only primary senders can trigger
clusterUpdateSlotsConfigWith.

Additional changes:

* Handling of primaries without slots is obsoleted by new handling of when a
  sender that was a replica announces that it is now a primary.
* Replication loop detection code is unchanged but shifted downwards.
* Some variables are renamed for better readability and some are introduced to
  avoid repeated memcmp() calls.

PingXie · 2024-07-12T07:36:06Z

Thanks @zuiderkwast!

OK. I wouldn't mind having it in the same PR (just a separate commit within the PR for easier reading). But a separate PR is also good to if we want to backport it or something.

Let me start a new PR on this one. It is probably easier that way.

I never said separate PR. That would be excessive. I just meant separate commits within the same PR. It might have made it easier to review, though this may just have been me trying to find excuses for not understanding the fix.

Got it. My bad. I misunderstood. Reviewing commits is a good point.

The problem description and the description of the change are not the same things though. In the PR I think it's good to describe the actual change. The problem description doesn't need to be repeated in detail in the PR when it's explained in the issue.

For the PR description, I'd be happy just what you wrote under "Fix:", and possibly mentioning the additional changes to make it easier to understand the diff. No need to update it now, but I'm copy-pasting some sentences here just to show what I mean, what I think is enough, yet useful for reviewing and also enough to have in the commit log after merging

Make sense. I can do that too.

madolson · 2024-07-12T16:52:22Z

OK. I wouldn't mind having it in the same PR (just a separate commit within the PR for easier reading). But a separate PR is also good to if we want to backport it or something.

I want it in a separate PR. I'm going to fairly strongly disagree with Viktor that I think in this specific case the refactor made the change harder for me to follow. I think that is because I'm more familiar with the code, outside of the newly introduced variables I found everything harder to follow.

madolson · 2024-07-12T16:56:13Z

Yes - replicas updating the slot ownership on behalf of their primaries is the regression (or behavior change) introduced by #445.

Yeah, allowing replicas to update slot ownership on behalf of their primaries is still the change that makes me feel uncomfortable. In the past the algorithm was to just trust primaries as much as possible.

PingXie · 2024-07-12T16:59:26Z

I think that is because I'm more familiar with the code, outside of the newly introduced variables I found everything harder to follow

This is likely proximity bias :). I would suggest also reviewing the function without diff'ing line by line, and then back to the diff. This is how I come to being at peace with it. Let me know what you think afterwards.

madolson · 2024-07-12T17:12:27Z

This is likely proximity bias

Everyone is biased.

madolson · 2024-07-12T17:17:07Z

Let me know what you think afterwards.

I'll stand behind my original statement that the refactoring made evaluating the technical changes more difficult, and would still prefer it in a separate PR. I also think that is a better broad decision to take as a team. The problem with a separate commit is that we squash and merge, and lose the commit.

In some cases you are introducing new concepts, and for that the new names make sense. But in other it just seems like you are renaming variables.

madolson

I think the change makes sense. We probably should avoid taking input from the replica about three state of its primaries as much as possible.

src/cluster_legacy.c

hpatro

Mostly LGTM.

@PingXie @madolson Could we converge and close the PR out? I will further merge unstable to #573 and get it ready.

…eplicaof-cycle

PingXie · 2024-07-15T21:57:42Z

I'll stand behind my original statement that the refactoring made evaluating the technical changes more difficult, and would still prefer it in a separate PR. I also think that is a better broad decision to take as a team. The problem with a separate commit is that we squash and merge, and lose the commit.

Sure. I can do a separate PR in the future.

Could we converge and close the PR out?

Will clean this PR up in a bit.

Signed-off-by: Ping Xie <[email protected]>

src/cluster_legacy.c

Signed-off-by: Ping Xie <[email protected]>

PingXie self-assigned this Jul 7, 2024

PingXie requested review from zuiderkwast, hpatro and madolson July 7, 2024 22:13

hpatro reviewed Jul 10, 2024

View reviewed changes

src/cluster_legacy.c Outdated Show resolved Hide resolved

src/cluster_legacy.c Outdated Show resolved Hide resolved

src/cluster_legacy.c Show resolved Hide resolved

zuiderkwast reviewed Jul 10, 2024

View reviewed changes

src/cluster_legacy.c Outdated Show resolved Hide resolved

src/cluster_legacy.c Outdated Show resolved Hide resolved

src/cluster_legacy.c Show resolved Hide resolved

src/cluster_legacy.c Show resolved Hide resolved

PingXie added 2 commits July 10, 2024 21:17

Ensure only primary sender drives slot ownership updates

d79516a

Signed-off-by: Ping Xie <[email protected]>

Update names

8b9ca7d

Signed-off-by: Ping Xie <[email protected]>

PingXie force-pushed the replicaof-cycle branch from 5223ecb to 8b9ca7d Compare July 11, 2024 04:47

PingXie added 4 commits July 10, 2024 21:51

Revert sender_node to sender

ef8970a

Signed-off-by: Ping Xie <[email protected]>

Revert sender_message to hdr

bb5f48f

Signed-off-by: Ping Xie <[email protected]>

Fix clang-format

b4ae582

Signed-off-by: Ping Xie <[email protected]>

Rename sender_claimed_primary_node to sender_claimed_primary

b533450

Signed-off-by: Ping Xie <[email protected]>

zuiderkwast approved these changes Jul 11, 2024

View reviewed changes

madolson reviewed Jul 12, 2024

View reviewed changes

src/cluster_legacy.c Outdated Show resolved Hide resolved

src/cluster_legacy.c Outdated Show resolved Hide resolved

hpatro approved these changes Jul 15, 2024

View reviewed changes

Merge branch 'unstable' of https://github.com/valkey-io/valkey into r…

6ddc119

…eplicaof-cycle

Incorporate review feedback

a8d87e9

Signed-off-by: Ping Xie <[email protected]>

hpatro reviewed Jul 16, 2024

View reviewed changes

src/cluster_legacy.c Outdated Show resolved Hide resolved

Rename n back to node

a1f0bc0

Signed-off-by: Ping Xie <[email protected]>

madolson approved these changes Jul 16, 2024

View reviewed changes

PingXie merged commit 66d0f7d into valkey-io:unstable Jul 16, 2024
20 checks passed

PingXie deleted the replicaof-cycle branch July 16, 2024 20:05

hpatro mentioned this pull request Jul 17, 2024

Avoid shard id update of replica if not matching with primary shard id #573

Open

PingXie mentioned this pull request Jul 17, 2024

Missing Check for Sender's Config Epoch Before Accepting Primary Claim #798

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure only primary sender drives slot ownership updates #754

Ensure only primary sender drives slot ownership updates #754

PingXie commented Jul 7, 2024 •

edited by zuiderkwast

Loading

codecov bot commented Jul 7, 2024 •

edited

Loading

hpatro left a comment

zuiderkwast left a comment

PingXie commented Jul 10, 2024

PingXie commented Jul 10, 2024

zuiderkwast commented Jul 10, 2024

PingXie commented Jul 11, 2024 •

edited

Loading

PingXie commented Jul 11, 2024

zuiderkwast left a comment

PingXie commented Jul 12, 2024

madolson commented Jul 12, 2024

madolson commented Jul 12, 2024 •

edited

Loading

PingXie commented Jul 12, 2024 •

edited

Loading

madolson commented Jul 12, 2024

madolson commented Jul 12, 2024

madolson left a comment

hpatro left a comment

PingXie commented Jul 15, 2024

Ensure only primary sender drives slot ownership updates #754

Ensure only primary sender drives slot ownership updates #754

Conversation

PingXie commented Jul 7, 2024 • edited by zuiderkwast Loading

codecov bot commented Jul 7, 2024 • edited Loading

Codecov Report

hpatro left a comment

Choose a reason for hiding this comment

zuiderkwast left a comment

Choose a reason for hiding this comment

PingXie commented Jul 10, 2024

PingXie commented Jul 10, 2024

zuiderkwast commented Jul 10, 2024

PingXie commented Jul 11, 2024 • edited Loading

PingXie commented Jul 11, 2024

zuiderkwast left a comment

Choose a reason for hiding this comment

PingXie commented Jul 12, 2024

madolson commented Jul 12, 2024

madolson commented Jul 12, 2024 • edited Loading

PingXie commented Jul 12, 2024 • edited Loading

madolson commented Jul 12, 2024

madolson commented Jul 12, 2024

madolson left a comment

Choose a reason for hiding this comment

hpatro left a comment

Choose a reason for hiding this comment

PingXie commented Jul 15, 2024

PingXie commented Jul 7, 2024 •

edited by zuiderkwast

Loading

codecov bot commented Jul 7, 2024 •

edited

Loading

PingXie commented Jul 11, 2024 •

edited

Loading

madolson commented Jul 12, 2024 •

edited

Loading

PingXie commented Jul 12, 2024 •

edited

Loading