add(RFC): sharding + index alloc #566

kaiserd · 2023-01-26T14:50:01Z

This PR is part of the secure scaling roadmap
and addresses vacp2p/research#160

Here is how it integrates into the bigger picture:

New RFC: BCP Waku Simple Scaling research#160 will use RFC 51 (in this PR) will as a component
Status Communities will we specified in raw RFC(standards): Status communities research#166
raw RFC(BCP): Status communities over Waku Simple Scaling research#167 will describe how to use Status Communities over scaled Waku (which includes sharding)

Notes

The index allocation in form of an informational RFC is a suggestion.
We could also opt to manage allocation in another document, or do allocation in another way.
We could also add/mention various levels of network segregation in this RFC (or in another document).
With this RFC, apps can have segregated shard clusters.
Imo, this level of segregation should be enough.
We could, however, additionally segregate the discv5 discovery network (at the cost of connectivity),
as well as completely segregate the gossipsub network (with the current version of the RFC specific control messages are shared beyond shard boundaries).

cc @Menduist @rymnc @LNSD @fryorcraken @cammellos @corpetty

fryorcraken · 2023-01-27T02:30:11Z

content/docs/rfcs/51/README.md

+for global shard 43.
+And for shard 43 of the Status app (which has allocated index 16):
+
+`subscribe("/waku2/xxx", 16, 43)`


What would the actual pubsub string would like?

Added in df6c44a

fryorcraken · 2023-01-27T02:33:13Z

content/docs/rfcs/51/README.md

+
+| key              | value                |
+|---               |---                   |
+| `relay-shard-0`  | `0x0000100000000000` |


Agreed. I used relay-shard because there might be other kinds of shards.
What about rshard?

Moved it to rshard in all occurrences in 4e4bc56.
Wdyt?

I wonder about the other types of shards. Relay is the backbone of the network. Hence a relay shard, impacts all other protocols (store, light push, filter). So a relay shard sounds to be the most impactful type of shard there can be in Waku.

However, rshard sounds good for a more cautious and future proof name.

I thought about potential New Message Dissemination Methods (currently last item on vacp2p/research#154)
These would be an alternative to Waku Relay and would come with other trade-offs. E.g. better for suited for 1:1 communicastion, or lower latency but lower anonymity guarantee. These dissemination networks could be sharded, too.
(But this is in the farther future.)

content/docs/rfcs/52/README.md

jm-clius

Yes! Thanks. This LGTM as a raw spec and static sharding makes sense to me as an initial strategy for scaling.

content/docs/rfcs/51/README.md

Co-authored-by: Hanno Cornelius <[email protected]>

rymnc

LGTM!

alrevuelta

nice! Left some comments.

A more generic question. Whats is the intention behind having both static sharding and automatic sharding? I mean do you plan the network to support both, or is static sharding the most immediate scaling solution and automatic sharding the evolution of it?

Wondering if we should just have named sharding (which we currently support without modying the code) and then aim directly to automatic sharding.

Thanks!

alrevuelta · 2023-01-30T07:51:22Z

content/docs/rfcs/51/README.md

+which allow application protocols to scale in the number of content topics.
+This document also covers discovery of topic shards.
+
+# Named Sharding


Wondering if "named sharding" is already covered here.

Yes. The document mentiones his (in this section), along with the option (in the note) to merge RFC 23 here.
I put this in to this RFC to consolidate sharding strategies into one RFC, and to categorize this approach as "named sharding", distinguishing it from the other strategies.
We could leave RFC 23 as an informational RFC discussing naming strategies, or merge it here and deprecate 23.

alrevuelta · 2023-01-30T08:10:29Z

content/docs/rfcs/52/README.md

+|     13   |   reserved   |                                     |
+|     14   |   reserved   |                                     |
+|     15   |   reserved   |                                     |
+|     16   |   Status     |  Status main net                    |


Since waku is permissionless how do you enforce this? I mean, this could be an internal recommendation but any app can send messages to Status mainnet shard. So wondering about the impact it will have if people don't respect this.

Similar to the IANA process, there would be no enforcement.
Apps could, however, make their shards permissioned on the app layer.
An attacker who controls enough nodes can still overtake the shards,
or significantly increase load.
Part of this will be addressed by DoS mitigation research.

alrevuelta · 2023-01-30T08:12:29Z

content/docs/rfcs/51/README.md

+A shard cluster is either globally available to all apps (like the default pubsub topic),
+specific for an app protocol,
+or reserved for automatic sharding (see next section).
+In total, there are $2^16 * 64 = 4194304$ shards for which Waku manages discovery.


Wondering if there is any rationale behind this numbers?

How is the mapping to pubsub topics done? Reading whats below, its 1 shard per topic? Will we have 4194304 gossip sub topics?

64 shards per shard cluster is chosen to match the Eth ENR shard representation.
2^16 for the index is the next byte boundary after 2^8, which seemed too low and would not save significant space in the ENR.
(Also, 2^16 is the whole IANA port range, and ranges in this RFC match the IANA ranges.)

If there are strong arguments for other numbers, we can of course adjust while in the raw phase.

How is the mapping to pubsub topics done?

For static sharding: up to the app layer. The document states this.

Reading whats below, its 1 shard per topic?

One shard per pubsub topic yes.

Will we have 4194304 gossip sub topics?

yes.

Will we have 4194304 gossip sub topics?

yes.

Can gossipsub scale to this amount of topics?

(For a long time at least,) most topics/shards would be not be used.
For a very large number of pubsub topics, we might have to adjust (limit) some of the control messages.
As long as the number of control messages (that cross pubsub topic boundaries) sent and received is < O(#number of pubsub topics), it would be fine.

Can gossipsub scale to this amount of topics?

This would be good to check with libp2p team in terms of how this would work in practice. I can't find it now, but long time I ago I tried to have multiple pubsub topics (basically using content topics as pubsub topics) and there were problems with creating meshes for pubsub topics. If a client is listening to these topics ahead of time it is probably fine, but it is something worth checking with network testing too. Maybe Nimbus knows of some potential gotchas here?

cc @Menduist re libp2p and @jm-clius re network testing (not sure who to ping re this)

kaiserd · 2023-01-30T09:54:15Z

Thank you for the feedback.

A more generic question. Whats is the intention behind having both static sharding and automatic sharding?

With automatic sharding, apps do not have to manage sharding.
With static sharding, apps have the option to manage the mapping.
Since static sharding is easier to realize, we do this first for the MVP.

I mean do you plan the network to support both,

yes

oskarth · 2023-01-31T05:21:40Z

content/docs/rfcs/51/README.md

+Assigning content topics to specific shards is up to app protocols,
+but the discovery of these shards is managed by Waku.
+
+These shards are managed in an array of $2^16$ shard clusters.


Pull up these constants and define them above perhaps?

oskarth

Thanks for this, quite thorough as a raw spec! I know it has been merged but just a few comments

oskarth · 2023-01-31T05:25:25Z

content/docs/rfcs/51/README.md

+A shard cluster is either globally available to all apps (like the default pubsub topic),
+specific for an app protocol,
+or reserved for automatic sharding (see next section).
+In total, there are $2^16 * 64 = 4194304$ shards for which Waku manages discovery.


Can gossipsub scale to this amount of topics?

This would be good to check with libp2p team in terms of how this would work in practice. I can't find it now, but long time I ago I tried to have multiple pubsub topics (basically using content topics as pubsub topics) and there were problems with creating meshes for pubsub topics. If a client is listening to these topics ahead of time it is probably fine, but it is something worth checking with network testing too. Maybe Nimbus knows of some potential gotchas here?

cc @Menduist re libp2p and @jm-clius re network testing (not sure who to ping re this)

add(RFC): sharding + index alloc

6835bb6

kaiserd requested review from oskarth, alrevuelta and jm-clius January 26, 2023 14:50

kaiserd mentioned this pull request Jan 26, 2023

New RFC: first version of Waku Relay Sharding vacp2p/research#163

Closed

1 task

fryorcraken reviewed Jan 27, 2023

View reviewed changes

rymnc reviewed Jan 27, 2023

View reviewed changes

content/docs/rfcs/52/README.md Show resolved Hide resolved

kaiserd added 4 commits January 27, 2023 08:02

add(52): Iana link

c0c99e5

add(index): 51,52

b134df0

feedback(51/ENR): relay-shard -> rshard

4e4bc56

add(51): pubsub topic naming

df6c44a

kaiserd requested review from fryorcraken and rymnc January 27, 2023 07:53

jm-clius approved these changes Jan 27, 2023

View reviewed changes

rymnc mentioned this pull request Jan 27, 2023

feat(discv5): advertise custom multiaddresses waku-org/nwaku#1512

Merged

kaiserd and others added 7 commits January 27, 2023 12:34

Update content/docs/rfcs/51/README.md

a8923ec

Co-authored-by: Hanno Cornelius <[email protected]>

Update content/docs/rfcs/51/README.md

99c71f2

Co-authored-by: Hanno Cornelius <[email protected]>

Update content/docs/rfcs/51/README.md

f1a745f

Co-authored-by: Hanno Cornelius <[email protected]>

feedback

00af4aa

feedback

c3ef8ef

feedback

a62dffc

feedback

123ea35

kaiserd force-pushed the add/rfc51-waku-relay-sharding branch from 6066e8f to 123ea35 Compare January 27, 2023 14:12

rymnc approved these changes Jan 30, 2023

View reviewed changes

alrevuelta reviewed Jan 30, 2023

View reviewed changes

kaiserd requested a review from alrevuelta January 30, 2023 09:54

kaiserd mentioned this pull request Jan 30, 2023

New RFC: BCP Waku Simple Scaling vacp2p/research#160

Closed

alrevuelta approved these changes Jan 30, 2023

View reviewed changes

kaiserd merged commit 7c23eea into master Jan 30, 2023

kaiserd deleted the add/rfc51-waku-relay-sharding branch January 30, 2023 13:47

jm-clius mentioned this pull request Jan 30, 2023

Waku fleets: separate pubsub topic for development waku-org/pm#13

Closed

kaiserd mentioned this pull request Jan 30, 2023

raw RFC topic sharding: alternative approaches and update requests vacp2p/research#174

Open

oskarth reviewed Jan 31, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add(RFC): sharding + index alloc #566

add(RFC): sharding + index alloc #566

kaiserd commented Jan 26, 2023

fryorcraken Jan 27, 2023

kaiserd Jan 27, 2023

fryorcraken Jan 27, 2023

kaiserd Jan 27, 2023

kaiserd Jan 27, 2023

fryorcraken Jan 30, 2023

kaiserd Jan 30, 2023

jm-clius left a comment

rymnc left a comment

alrevuelta left a comment

alrevuelta Jan 30, 2023

kaiserd Jan 30, 2023 •

edited

Loading

alrevuelta Jan 30, 2023

kaiserd Jan 30, 2023 •

edited

Loading

alrevuelta Jan 30, 2023

kaiserd Jan 30, 2023 •

edited

Loading

alrevuelta Jan 30, 2023

kaiserd Jan 30, 2023

oskarth Jan 31, 2023

kaiserd commented Jan 30, 2023 •

edited

Loading

oskarth Jan 31, 2023

oskarth left a comment

oskarth Jan 31, 2023

	\| `relay-shard-0` \| `0x0000100000000000` \|
	\| `shard-0` \| `0x0000100000000000` \|

add(RFC): sharding + index alloc #566

add(RFC): sharding + index alloc #566

Conversation

kaiserd commented Jan 26, 2023

Notes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jm-clius left a comment

Choose a reason for hiding this comment

rymnc left a comment

Choose a reason for hiding this comment

alrevuelta left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaiserd Jan 30, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaiserd Jan 30, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaiserd Jan 30, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaiserd commented Jan 30, 2023 • edited Loading

Choose a reason for hiding this comment

oskarth left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaiserd Jan 30, 2023 •

edited

Loading

kaiserd Jan 30, 2023 •

edited

Loading

kaiserd Jan 30, 2023 •

edited

Loading

kaiserd commented Jan 30, 2023 •

edited

Loading