Core time: Core Management (request_core_count) #2211

eskimor · 2023-11-07T17:56:06Z

The core time chain is managing cores for the relay chain. We have four variables to consider:

Total number of cores derived from (bulk + on-demand + legacy auction)
Number of legacy cores
Number of bulk cores
Number of on-demand cores

Any bulk core can become an on-demand core. By just placing an indefinite (no end_hint) pool assignment for a core. Thus it makes sense to unify those two. We will have a number of bulk cores and how many cores of those should be on-demand can be decided by the core time chain, by simply sending appropriate assignments. Reducing above list to:

Total number of cores derived from (bulk + on-demand + legacy auction)
Number of legacy cores
Number of bulk cores

Restrictions

The total number of cores determines things like backing groups and is not supposed to change within a session. The relay chain needs to stay in control on when a change in the total number of cores happens.

Current State

Currently there is no bulk, just on-demand and legacy. The amount of cores for each are managed via a relay chain configuration.

Desired State

Legacy as getting phased out anyway, will stay being managed by the relay chain. Thus we will have two types of cores: bulk + legacy. The number of legacy cores is transparent to the core time chain.

E.g. if we have 40 legacy cores, then bulk cores will start at core index 40 from the perspective of the relay chain. The core time chain does not need to be concerned with this at all: If there are let's say 10 bulk cores to start with, they will be indexed [0..9] from the perspective of the core time chain. The relay chain, will do the offset calculation and will add the number of legacy cores, to get the "correct" core number for assignments as received from the core time chain.

On-demand/Instantaneous

We would suggest that the core time chain has a configuration setting the number of desired on-demand cores (the old configuration in the relay chain will thus be removed). With this the core time chain will issue a normal assign_core message, whenever that configuration changes. This allows for maximum flexibility: E.g. eventually this might not be a configuration, but determined automatically based on demand for normal bulk cores.

Total number of (bulk) cores

As explained above, the total number of cores is only allowed to change at session boundaries. With this restriction, the interface as described in RFC-5:

fn request_core_count(
    count: u16,
)

and

fn notify_core_count(
    count: u16,
)

works, but the dependency on session buffering is hidden. It is important to realize that the response notify_core_count might take the relay chain up to almost two sessions send back that message. The core time chain should be able to handle this long delay gracefully.

Worth to mention, that the core count we are talking here is the number of bulk cores (including instantaneous as designated via assignments).

Due to the asynchronicity of message passing, on change the number of cores available as seen by the core time chain and by the relay chain will be out of sync for a bit. Consequences should be minor though, if some care is taken on reduction:

Reducing the number of cores

This is the more dangerous change. It is recommended to have native (provided by the system, not by assignment from a buyer) pool/instantaneous cores at the top (highest core numbers). As reduction of those cores will have no negative impact on buyers.

Due to the above mentioned asynchronicity, it is theoretically possible for the relay chain (already operating at the reduced core count) to drop assignments it receives from the core time chain (still operating at the larger core count). This should not be a real concern though, as those assignments would have become void shortly afterwards anyway. To avoid negative side effects, it would be recommended to stop selling core assignments for a core long before removing it.

Increasing the number of cores

This is non-problematic. As the relay chain will always have updated its core count before the broker chain. Thus, in this case the core count on the relay chain will always be either equal or larger compared to the core time chain view. Hence there is no risk of sent assignments being invalid due to asynchronicity.

Implementation

core time: Have overall bulk core number configuration on the core time chain
core time: Send request_core_count messages to the relay chain, whenever that configuration is changed..
relay: Buffer requested core count in session buffered configuration. On session change, send notify_core_count back, whenever core count changes. That configuration should either not be possible to change other than via request_count message coming from the core time chain.

Implementation Phase 2:

relay: Phase out legacy parachains (reducing core count on the relay chain)
core time: Increase bulk core count on the core time chain accordingly
relay: Once no legacy chains exist anymore, remove code and configuration. Only existing assignments will be "bulk" now.

The text was updated successfully, but these errors were encountered:

joepetrowski · 2023-11-12T14:29:03Z

Due to the above mentioned asynchronicity, it is theoretically possible for the relay chain (already operating at the reduced core count) to drop assignments it receives from the core time chain (still operating at the larger core count). This should not be a real concern though, as those assignments would have become void shortly afterwards anyway. To avoid negative side effects, it would be recommended to stop selling core assignments for a core long before removing it.

Similar to how assets get trapped in XCM, perhaps when a core assignment is dropped, the Relay Chain could store (or send a message back to the Coretime chain) with a ticket. The chain that didn't get its block executed could then claim it for a new core assignment.

The price may have changed in the meantime, but this should be a pretty rare occurrence, and the buyer was willing to pay what they previously did.

BradleyOlson64 · 2023-11-13T20:18:09Z

It is important to realize that the response notify_core_count might take the relay chain up to almost two sessions send back that message.

I see why it would take until at least the start of the next session, but why nearly two sessions?

eskimor · 2023-11-14T10:13:08Z

The next session must be known in the previous session already for determinism. E.g. imagine that the parameters of the next session would be free to change until the very last block of the previous session. If then there is a re-org/reversion we might end up with two sessions for a given session index, which actually differ. This is not sound. See also also: #633

eskimor · 2024-04-03T09:50:08Z

Done.

…2211)`

Original PR with more context: paritytech/parity-bridges-common#2211

Original PR with more context: paritytech/parity-bridges-common#2211 Signed-off-by: Branislav Kontur <[email protected]> Co-authored-by: Svyatoslav Nikolsky <[email protected]>

eskimor added this to parachains team board Nov 7, 2023

eskimor converted this from a draft issue Nov 7, 2023

eskimor changed the title ~~Core Count Management~~ Core Management Nov 9, 2023

eskimor mentioned this issue Nov 9, 2023

Migrations to coretime and initial settings & system chains #2255

Closed

6 tasks

eskimor changed the title ~~Core Management~~ Core time: Core Management (request_core_count) Nov 15, 2023

eskimor moved this from Backlog to In Progress in parachains team board Dec 1, 2023

eskimor self-assigned this Dec 5, 2023

eskimor moved this from In Progress to Review in progress in parachains team board Dec 21, 2023

eskimor moved this from Review in progress to Completed in parachains team board Mar 19, 2024

eskimor closed this as completed Apr 3, 2024

bkontur pushed a commit that referenced this issue May 15, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

d39881d

bkontur pushed a commit that referenced this issue May 15, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

36ed775

bkontur pushed a commit that referenced this issue May 15, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

4f9d8d9

bkontur pushed a commit that referenced this issue May 15, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

54407b0

bkontur pushed a commit that referenced this issue May 16, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

1eb0a21

bkontur pushed a commit that referenced this issue May 17, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

cd52b68

bkontur pushed a commit that referenced this issue May 17, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

cea594a

bkontur pushed a commit that referenced this issue May 17, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

6a604fa

bkontur pushed a commit that referenced this issue May 20, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

5915e27

bkontur pushed a commit that referenced this issue May 21, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

a9ce27b

bkontur pushed a commit that referenced this issue May 22, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

5ad2f98

bkontur pushed a commit that referenced this issue May 23, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

e33bff7

bkontur pushed a commit that referenced this issue May 30, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

616900a

bkontur pushed a commit that referenced this issue Jun 4, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

2508734

bkontur pushed a commit that referenced this issue Jun 5, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

2a1e7b6

bkontur pushed a commit that referenced this issue Jun 7, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

b2e0ef0

bkontur added a commit that referenced this issue Jul 3, 2024

Revert: `prune messages from confirmation tx, not from the on_idle (#…

8eb8b99

…2211)`

bkontur added a commit that referenced this issue Jul 3, 2024

Revert: `prune messages from confirmation tx, not from the on_idle (#…

a367b49

…2211)`

bkontur added a commit that referenced this issue Jul 4, 2024

prune messages from confirmation tx, not from the on_idle (#2211)

e6dea3a

Original PR with more context: paritytech/parity-bridges-common#2211

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core time: Core Management (request_core_count) #2211

Core time: Core Management (request_core_count) #2211

eskimor commented Nov 7, 2023 •

edited

Loading

joepetrowski commented Nov 12, 2023

BradleyOlson64 commented Nov 13, 2023

eskimor commented Nov 14, 2023

eskimor commented Apr 3, 2024

Core time: Core Management (request_core_count) #2211

Core time: Core Management (request_core_count) #2211

Comments

eskimor commented Nov 7, 2023 • edited Loading

Restrictions

Current State

Desired State

On-demand/Instantaneous

Total number of (bulk) cores

Reducing the number of cores

Increasing the number of cores

Implementation

Implementation Phase 2:

joepetrowski commented Nov 12, 2023

BradleyOlson64 commented Nov 13, 2023

eskimor commented Nov 14, 2023

eskimor commented Apr 3, 2024

eskimor commented Nov 7, 2023 •

edited

Loading