Miner changes for time-based tenure extends #5493

obycode · 2024-11-21T20:25:40Z

This currently also includes #5452, since it has changes that would conflict. Once that is merged into develop, I will merge develop into feat/time-based-tenure-extend and then clean this branch up.

See #5361

With this change, the signer will accept a tenure extend from miner N-1 when miner N wins a sortition but commits to the wrong parent tenure.

The previous design using a global singleton causes trouble in testing, when we have multiple miners running in different threads of the same process.

This is useful when checking the behavior during forking.

…tend

…xtend

`SignerDBListener` struct is for a new thread that is always processing StackerDB messages from the signers during a mining tenure. `SignerCoordinator` is the interface that the miner uses with the `SignerDBListener`, to propose a block and wait for signatures.

hstove · 2024-11-22T20:43:32Z

Should the signerdb_listener module be called stackerdb_listener instead?

…xtend

obycode · 2024-11-22T20:46:30Z

Should the signerdb_listener module be called stackerdb_listener instead?

Oh, right. It's signer messages in the stacker db. I guess the signerdb is another thing. Oops. Thanks!

obycode · 2024-11-26T02:49:11Z

This is "ready for review" as it should now 🤞 pass all of the existing integration tests, so it's ready for a first look. The last piece is to enable the actual usage of the tenure extend now that all the pieces are in place.

See #5476

obycode · 2024-11-26T14:24:12Z

Forgot to add a comment, but with the last commits, the tenure extend is functional! Still needs more testing and a review of failures in existing tests.

…xtend

jcnelson · 2024-11-26T21:50:21Z

testnet/stacks-node/src/nakamoto_node/signer_coordinator.rs

+    /// Tracks signatures for blocks
+    ///   - key: Sha512Trunc256Sum (signer signature hash)
+    ///   - value: BlockStatus
+    blocks: Arc<(Mutex<HashMap<Sha512Trunc256Sum, BlockStatus>>, Condvar)>,


It seems like you have a low-lift opportunity for a future-proofed refactoring here.

In other places, if struct Foo needs to share its state with other threads, it has a corresponding FooComms struct which contains all of these Arc<Mutex<..>>-wrapped data. The FooComms struct then exposes getters and setters for the wrapped data, so the caller doesn't need to bother with knowing how to deal with whatever concurrency primitives are used to access or mutate the Foo instance's data (e.g. locking the mutex or dereferencing the Arc the right way). The Foo implementation has a method or instantiating a FooComms.

It seems like you could have a StackerDBSessionComms struct which contained blocks and signer_idle_timestamps, and gets instantiated from the StackerDBSession.

jcnelson · 2024-11-26T21:52:45Z

testnet/stacks-node/src/nakamoto_node/signer_coordinator.rs

+                ChainstateError::MinerAborted
+            })?;
+
+        sc.listener_thread = Some(listener_thread);


One of these days, we need to centralize all the singleton thread-creation that happens in the node.

jcnelson · 2024-11-26T21:54:09Z

testnet/stacks-node/src/nakamoto_node/signer_coordinator.rs

+    ) -> Result<Vec<MessageSignature>, NakamotoNodeError> {
+        // Add this block to the block status map.
+        // Create a scope to drop the lock on the block status map.
+        {


This scoped locking trick is something we should help callers avoid.

jcnelson · 2024-11-26T21:56:04Z

testnet/stacks-node/src/nakamoto_node/signer_coordinator.rs

+        let mut blocks = lock.lock().expect("FATAL: failed to lock block status");
+
+        loop {
+            let (guard, timeout_result) = cvar


This is another example of something we should help callers avoid. Low-level synchronization like waiting on a condition variable can be hidden within a Comms struct.

jcnelson · 2024-11-26T21:58:34Z

testnet/stacks-node/src/nakamoto_node/stackerdb_listener.rs

+                        } = accepted;
+                        let tenure_extend_timestamp = response_data.tenure_extend_timestamp;
+
+                        let (lock, cvar) = &*self.blocks;


This is something a Comms struct could hide from the caller.

jcnelson · 2024-11-26T21:59:42Z

testnet/stacks-node/src/nakamoto_node/stackerdb_listener.rs

+                        );
+                    }
+                    SignerMessageV0::BlockResponse(BlockResponse::Rejected(rejected_data)) => {
+                        let (lock, cvar) = &*self.blocks;


Same here -- this needlessly marries the synchronization implementation (something that can be hidden) to the business logic.

jcnelson

My biggest feedback here is that the code mixes a lot of low-level thread synchronization code into the business logic, which I think is something we should strive to avoid. The synchronization logic may change down the road depending on what other threads need to interact with signature-gathering, so we should do what we do elsewhere and wrap all of the thread synchronization / state-sharing behind a "Comms" struct, and provide methods there that better reflect the business logic's needs.

obycode · 2024-11-27T14:22:55Z

Thanks for that suggestion @jcnelson. I did that refactoring in cef0dd4. Let me know if that seems like the right level of abstraction now.

obycode · 2024-11-27T14:41:01Z

Note to reviewers - there is an integration test for this behavior in #5471.

…xtend

…acks-network/stacks-core into feat/miner-tenure-extend

obycode added 24 commits November 9, 2024 16:29

test: add test for tenure-extend upon failed miner

8fe3394

feat: implement tenure-extend after bad sortition winner

311ad50

See #5361

feat: make signer accept tenure extend on bad sortition

2a4a09b

With this change, the signer will accept a tenure extend from miner N-1 when miner N wins a sortition but commits to the wrong parent tenure.

Merge branch 'develop' into feat/tenure-extend-no-blocks

c294617

test: add tenure_extend_after_bad_commit to yaml file

a56a73c

refactor: move the StackerDBChannel into the EventDispatcher

6438551

The previous design using a global singleton causes trouble in testing, when we have multiple miners running in different threads of the same process.

feat: add an index for block state

4420c82

docs: update changelogs

3fa8116

Merge branch 'develop' into feat/tenure-extend-no-blocks

0b3a2c1

chore: improve comment about checking the parent tenure

df8f240

test: add unit test for SignerDb::get_canonical_tip

44769cf

chore: remove unnecessary log

4c7c5aa

feat: simplify signerdb migration

a9acfa0

chore: cleanup unused

58fda00

refactor: clean up continue_tenure

2178846

refactor: last_block_contains_tenure_change_tx

d8140e0

test: additional checks requested in PR review

54c88c6

feat: add ability to disable tenure-extend for tests

ba2faf7

This is useful when checking the behavior during forking.

fix: fix import for test-only feature

965f58b

refactor: add comments and improve naming

cd5e7cc

Merge branch 'feat/tenure-extend-no-blocks' into feat/miner-tenure-ex…

7f6e5fc

…tend

Merge branch 'feat/time-based-tenure-extend' into feat/miner-tenure-e…

3297863

…xtend

feat: add timeout and additional checks in get_block_status

04270d7

Merge branch 'feat/time-based-tenure-extend' into feat/miner-tenure-e…

eddada4

…xtend

obycode added 3 commits November 22, 2024 15:51

chore: SignerDBListener -> StackerDBListener

f92c819

ifix: resolve merge errors

cf540a5

chore: finish rename

aea205b

obycode marked this pull request as ready for review November 26, 2024 02:43

obycode requested review from a team as code owners November 26, 2024 02:43

obycode requested review from hstove and jferrant November 26, 2024 02:43

obycode added 2 commits November 26, 2024 06:55

feat: extend tenure based on time

1a67a1c

See #5476

chore: cleanup

48c8a10

obycode added 6 commits November 26, 2024 10:56

Merge branch 'feat/time-based-tenure-extend' into feat/miner-tenure-e…

31b9c0b

…xtend

fix: resolve errors after merge

8929537

chore: remove duplicates in CHANGELOGs due to merge

65c5b70

fix: merge artifact

280d536

fix: merge artifacts

a4d378f

chore: upgrade debug log to info

379ce66

jcnelson reviewed Nov 26, 2024

View reviewed changes

jcnelson requested changes Nov 26, 2024

View reviewed changes

refactor: move synchronization details into StackerDBListenerComms

cef0dd4

obycode requested a review from jcnelson November 27, 2024 14:31

obycode and others added 2 commits November 27, 2024 17:08

Merge branch 'feat/time-based-tenure-extend' into feat/miner-tenure-e…

0674bf5

…xtend

Merge branch 'feat/time-based-tenure-extend' of https://github.com/st…

ec8b83e

…acks-network/stacks-core into feat/miner-tenure-extend

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Miner changes for time-based tenure extends #5493

Miner changes for time-based tenure extends #5493

obycode commented Nov 21, 2024

hstove commented Nov 22, 2024

obycode commented Nov 22, 2024

obycode commented Nov 26, 2024

obycode commented Nov 26, 2024

jcnelson Nov 26, 2024 •

edited

Loading

jcnelson Nov 26, 2024

jcnelson Nov 26, 2024

jcnelson Nov 26, 2024

jcnelson Nov 26, 2024

jcnelson Nov 26, 2024

jcnelson left a comment

obycode commented Nov 27, 2024

obycode commented Nov 27, 2024

Miner changes for time-based tenure extends #5493

Are you sure you want to change the base?

Miner changes for time-based tenure extends #5493

Conversation

obycode commented Nov 21, 2024

hstove commented Nov 22, 2024

obycode commented Nov 22, 2024

obycode commented Nov 26, 2024

obycode commented Nov 26, 2024

jcnelson Nov 26, 2024 • edited Loading

Choose a reason for hiding this comment

jcnelson Nov 26, 2024

Choose a reason for hiding this comment

jcnelson Nov 26, 2024

Choose a reason for hiding this comment

jcnelson Nov 26, 2024

Choose a reason for hiding this comment

jcnelson Nov 26, 2024

Choose a reason for hiding this comment

jcnelson Nov 26, 2024

Choose a reason for hiding this comment

jcnelson left a comment

Choose a reason for hiding this comment

obycode commented Nov 27, 2024

obycode commented Nov 27, 2024

jcnelson Nov 26, 2024 •

edited

Loading