Enhance Leader Rotation Logic to Address Edge Cases in Leader Selection #4798

GheisMohammadi · 2024-11-12T02:59:29Z

Issue

This PR refines the leader rotation logic in NthNextValidator to handle previously unaddressed edge cases that could lead to unexpected behavior or repeated selection of the same leader. In the refactored function for leader rotation, several corner cases were addressed to ensure robustness in the selection of the next validator.

One critical improvement was handling cases where next is greater than zero. In the original code, if the list wraps around from the last validator back to the beginning, there was a risk that the first validator could be skipped entirely. This is especially problematic when the validator list is circular, as it could lead to unfair distribution in leader selection. The refactored function avoids this by introducing attempts as a counter that methodically cycles through all nodes in publicKeys, ensuring no validator is skipped due to wrap-around.

Another addressed case involves scenarios where next is zero, yet the pubKey of the current leader isn't found within publicKeys. In the original code, if the pubKey index was -1 (indicating it wasn’t present), this situation could cause a crash when trying to access an invalid index. The refactored version improves safety by first logging this absence as an error and, if next is zero, returning the first public key in the list as a fallback. This default ensures stability, allowing the function to handle missing keys gracefully without interrupting leader selection.

Finally, the revised code prevents repeatedly selecting the same leader when a different validator isn’t readily available. In the original function, if the loop failed to find a unique validator, it would repeatedly return the same pubKey, potentially leading to repetitive selection cycles. The refactored version adds a check to ensure that only a distinct validator is returned. Additionally, a limit on attempts guarantees that the function terminates if no unique validator is found after a full cycle, logging a warning as an indication of this fallback behavior. This effectively prevents inefficient leader selection and ensures smoother rotations.

This PR introduces changes to the leader rotation logic that require a hard fork (HF) for full network-wide consistency. Because this update impacts how validators are selected, implementing it without a hard fork would lead to differing leader rotations among nodes running different versions of the code, potentially causing consensus issues.

For now, the hard fork epoch has been set to "TBD" (to be determined) to allow further discussion and coordination among stakeholders. This placeholder will be updated once the team agrees on an appropriate epoch for deployment.

This PR addresses 3 corner cases mentioned in issue #4796

sophoah · 2024-11-19T07:57:14Z

@GheisMohammadi please check the travis failure

…on version 2

Frozen · 2024-11-21T01:54:07Z

New functionality contains 0 tests coverage.

sophoah · 2024-11-21T03:44:07Z

New functionality contains 0 tests coverage.

Good point though it's not a new feature. Is it possible to write test cases here cc @GheisMohammadi

…ow it skips some validators

GheisMohammadi · 2024-11-22T00:29:01Z

New functionality contains 0 tests coverage.

Good point though it's not a new feature. Is it possible to write test cases here cc @GheisMohammadi

I added tests for three edge cases to demonstrate how Leader Rotation v1 fails to handle them. Additionally, I created the same tests for Leader Rotation v2 to validate that it successfully addresses all three edge cases.

sophoah

Approving this one but we'll need a different iteration of NthNextValidatorV2 as discussed in the protocol team meeting

sophoah · 2024-11-22T06:49:04Z

@Frozen please review/comment/approve

GheisMohammadi self-assigned this Nov 12, 2024

sophoah requested a review from Frozen November 19, 2024 07:48

GheisMohammadi added 3 commits November 20, 2024 12:50

add new version of NthNextValidator to find next leader

db7f17c

add NthNextValidatorV2 to consensus and adjust hf epochs

5253e39

add NthNextValidatorV2 to decider, fix epoch number for leader rotati…

d5d8358

…on version 2

GheisMohammadi force-pushed the hf/leader_rotation_v2 branch from 83cb640 to d5d8358 Compare November 20, 2024 04:51

sophoah mentioned this pull request Nov 21, 2024

View change: Support for checking if validators belongs to the same key. #4802

Open

GheisMohammadi added 7 commits November 21, 2024 22:51

add test for NthNextValidatorV2

c22db0f

add a test for a failed edge case for NthNextValidator

e09262e

add a new test for a failed edge cases for NthNextValidator

781bae5

add a new test for a failed edge cases for NthNextValidator to show h…

d1ef361

…ow it skips some validators

Add test for NthNextValidatorV2 to validate handling of edge case 1

d498d36

Add test for NthNextValidatorV2 to validate handling of edge case 2

0263b49

Add test for NthNextValidatorV2 to validate handling of edge case 3

918e55e

fix quorom test format

d5f6f9a

sophoah approved these changes Nov 22, 2024

View reviewed changes

Frozen approved these changes Nov 27, 2024

View reviewed changes

sophoah merged commit 6e7b891 into dev Nov 27, 2024
4 checks passed

sophoah deleted the hf/leader_rotation_v2 branch November 27, 2024 04:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance Leader Rotation Logic to Address Edge Cases in Leader Selection #4798

Enhance Leader Rotation Logic to Address Edge Cases in Leader Selection #4798

GheisMohammadi commented Nov 12, 2024 •

edited

Loading

sophoah commented Nov 19, 2024

Frozen commented Nov 21, 2024

sophoah commented Nov 21, 2024

GheisMohammadi commented Nov 22, 2024

sophoah left a comment

sophoah commented Nov 22, 2024

Enhance Leader Rotation Logic to Address Edge Cases in Leader Selection #4798

Enhance Leader Rotation Logic to Address Edge Cases in Leader Selection #4798

Conversation

GheisMohammadi commented Nov 12, 2024 • edited Loading

Issue

sophoah commented Nov 19, 2024

Frozen commented Nov 21, 2024

sophoah commented Nov 21, 2024

GheisMohammadi commented Nov 22, 2024

sophoah left a comment

Choose a reason for hiding this comment

sophoah commented Nov 22, 2024

GheisMohammadi commented Nov 12, 2024 •

edited

Loading