Speed up request buffered hashes #6318

emhane · 2024-02-01T04:11:38Z

closes #6148. closes #6308.

speeds up requesting buffered hashes by

- [x] dividing hashes store (unknown_hashes, buffered_hashes[ and meta]) into eth68 and eth66. this means only hashes will be traversed that can be included in the request being assembled in fill_request_from_buffer_for_peer.

improve performance and maintainability at cost of allocating memory for metadata for all hashes, regardless if seen in eth66 announcements only and hence have no metadata.
use the transactions list on Peer type to search buffered_hashes after pop_any_idle_peer returns. that cache has capacity 10 240 and buffered hashes has capacity 25 600. also, then we just have to search buffered hashes, and not nested lists (even if they just default to 3 elements long) in buffered hashes for our peer returned by pop_any_idle_peer. effectivity of this is up to how well we can update the transactions list in the Peer type on-op. ~~we will only update the transactions list on Peer type when we need to touch it anyway, on-op.~~ we won't move around elements in lists of peer's seen transactions. it serves only as a hint to which hash cannot be pending, not as a perfect list of which hashes are pending. this totally satisfies requirements and otherwise it doesn't scale.

mattsse · 2024-02-01T09:48:29Z

this is still very expensive because we blindly iterate all hashes in the hope the peer is the fallback, can we not keep track of the peer's hashes instead when registering them as fallback peers?

…ashes to front

emhane · 2024-02-01T14:55:08Z

this is still very expensive because we blindly iterate all hashes in the hope the peer is the fallback, can we not keep track of the peer's hashes instead when registering them as fallback peers?

yes, this is step 2 as mentioned above. the peer's hashes are already tracked in Peer type. possibly we add a new list which is a subset of Peer { transactions, .. }. two new lists to be precise, since storing them eth66 and eth68 in same list is counterproductive.

…p-request-buffered-hashes

…param to fill_request_from_buffer_for_peer

crates/net/network/src/transactions/mod.rs

emhane · 2024-02-03T15:38:26Z

this is still very expensive because we blindly iterate all hashes in the hope the peer is the fallback, can we not keep track of the peer's hashes instead when registering them as fallback peers?

we don't even need to register any peer as fallback peer in that case, it's just double up storage of same data that can be derived by checking peer's seen txns against buffered hashes (buffered as in buffered for re-fetch or for first fetch if the hash didn't fit in the request that was triggered by processing the announcement in which the hash was seen the first time).

emhane · 2024-02-10T23:34:10Z

This is how the branch is looking on mainnet rn.

This is samply at commit 739d3c1
https://share.firefox.dev/3STlLDb

Will update with new samply from latest commit later.

Rjected

Will do a more comprehensive review soon, but I just saw these constant names and wanted to see if they could be made shorter

Rjected · 2024-02-12T22:11:50Z

crates/net/network/src/transactions/constants.rs

+    /// Default soft limit for the number of hashes in a
+    /// [`GetPooledTransactions`](reth_eth_wire::GetPooledTransactions) request, when it is filled
+    /// from hashes pending fetch. Default is half of the
+    /// [`SOFT_LIMIT_COUNT_HASHES_IN_GET_POOLED_TRANSACTIONS_REQUEST`] which by spec is 256
+    /// hashes, so 128 hashes.
+    pub const DEFAULT_SOFT_LIMIT_COUNT_HASHES_IN_GET_POOLED_TRANSACTIONS_REQUEST_ON_FETCH_PENDING_HASHES:
+    usize = SOFT_LIMIT_COUNT_HASHES_IN_GET_POOLED_TRANSACTIONS_REQUEST / 2;
+
+    /// Default soft limit for a [`PooledTransactions`](reth_eth_wire::PooledTransactions) response
+    /// when it's used as expected response in calibrating the filling of a
+    /// [`GetPooledTransactions`](reth_eth_wire::GetPooledTransactions) request, when the request
+    /// is filled from hashes pending fetch. Default is half of
+    /// [`DEFAULT_SOFT_LIMIT_BYTE_SIZE_POOLED_TRANSACTIONS_RESPONSE_ON_PACK_GET_POOLED_TRANSACTIONS_REQUEST`],
+    /// which defaults to 128 KiB, so 64 KiB.
+    pub const DEFAULT_SOFT_LIMIT_BYTE_SIZE_POOLED_TRANSACTIONS_RESPONSE_ON_FETCH_PENDING_HASHES:
+        usize = DEFAULT_SOFT_LIMIT_BYTE_SIZE_POOLED_TRANSACTIONS_RESPONSE_ON_PACK_GET_POOLED_TRANSACTIONS_REQUEST / 2;


I like the documentation on these but these names are gigantic, is there any way to make this more concise?

yeah I know they are, but not having enough info can cost a lot of time since then they are interpreted wrong. we already had this with some other constants recently. will eventually change some of these to const functions anyway, then we can asses the lengths again.

onbjerg

lgtm

mattsse

sharing @Rjected view that some names are gigantic, but can bikeshed separately.

lgtm, nice work

emhane added 4 commits January 31, 2024 23:51

Fix bug, rebuffer hashes that were received over broadcast

b15a3d1

Fix typo

db6d3d7

Buffer hashes instead of throw away upon failed channel

e82b284

Divide hashes store into eth68 and eth66

032e5b9

emhane requested review from mattsse, Rjected and gakonst as code owners February 1, 2024 04:11

emhane marked this pull request as draft February 1, 2024 04:11

Only filter out known hashes in async code, and promote yet unknown h…

f417055

…ashes to front

emhane added 6 commits February 1, 2024 16:07

Re-insert debug-assert

3b44dbf

fixup! Divide hashes store into eth68 and eth66

60f790d

Merge branch 'emhane/fix-bug-buffer-known-hashes' into emhane/speed-u…

f80e3ac

…p-request-buffered-hashes

Fix lint

5272ddd

Debug retry test

01ff37d

Logically spearate seen eth68 and eth66 hashes for peer, and pass as …

e1d8d5e

…param to fill_request_from_buffer_for_peer

emhane marked this pull request as ready for review February 2, 2024 03:37

mattsse reviewed Feb 2, 2024

View reviewed changes

crates/net/network/src/transactions/mod.rs Outdated Show resolved Hide resolved

emhane added 10 commits February 3, 2024 18:21

Reset to base branch

30bfc43

Make field names more descriptive

a8739a8

Remove unused dep

919f2e6

Make tx re-fetch logic optional

abc7d75

Add budget to refetch

634c5f5

Store size metadata atomically with retires and fallback peers

8ff6b12

Rescope finding idle peer for refetch

c896c34

Make filling request from hashes pending fetch buffer cheaper

83e308e

Update fallback peers at latest necessary point in time

ee5dec9

Rescope requesting pending hashes

62971d7

emhane added A-networking Related to networking in general A-devp2p Related to the Ethereum P2P protocol labels Feb 10, 2024

emhane added 7 commits February 10, 2024 20:36

Fix scope

80b829f

Track total hashes pending fetch with metrics

f908375

fixup! Fix scope

eea22d9

Track inflight hashes with metrics

e806e3a

fixup! Track inflight hashes with metrics

739d3c1

Fix response size for filling response

c0755e3

Fix crumbs from git context switch

d1971d3

emhane added 10 commits February 11, 2024 12:02

Match constant name with method name

41f721c

fixup! Match constant name with method name

e630e42

Add missing defaults to and fix constants

8f8f441

Add metrics for observing when tx manager is at capacity

d96d830

Fix doc links

f27d06f

Remove obsolete test

ae5000f

Add test for on fetch pending hashes

8c3bc26

Merge branch 'emhane/speed-up-request-buffered-hashes'

c38fcf9

Fix lint

66edf95

Fix docs

7d6f3dd

Rjected reviewed Feb 12, 2024

View reviewed changes

emhane requested review from onbjerg and mattsse February 13, 2024 18:17

onbjerg approved these changes Feb 13, 2024

View reviewed changes

mattsse approved these changes Feb 13, 2024

View reviewed changes

emhane added this pull request to the merge queue Feb 13, 2024

Merged via the queue into main with commit c0f3d38 Feb 13, 2024
29 checks passed

emhane deleted the emhane/speed-up-request-buffered-hashes branch February 13, 2024 19:08

emhane mentioned this pull request Mar 3, 2024

Tracking transaction fetching improvements #6360

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up request buffered hashes #6318

Speed up request buffered hashes #6318

emhane commented Feb 1, 2024 •

edited

Loading

mattsse commented Feb 1, 2024

emhane commented Feb 1, 2024 •

edited

Loading

emhane commented Feb 3, 2024

emhane commented Feb 10, 2024 •

edited

Loading

Rjected left a comment

Rjected Feb 12, 2024

emhane Feb 12, 2024

onbjerg left a comment

mattsse left a comment

Speed up request buffered hashes #6318

Speed up request buffered hashes #6318

Conversation

emhane commented Feb 1, 2024 • edited Loading

mattsse commented Feb 1, 2024

emhane commented Feb 1, 2024 • edited Loading

emhane commented Feb 3, 2024

emhane commented Feb 10, 2024 • edited Loading

Rjected left a comment

Choose a reason for hiding this comment

Rjected Feb 12, 2024

Choose a reason for hiding this comment

emhane Feb 12, 2024

Choose a reason for hiding this comment

onbjerg left a comment

Choose a reason for hiding this comment

mattsse left a comment

Choose a reason for hiding this comment

emhane commented Feb 1, 2024 •

edited

Loading

emhane commented Feb 1, 2024 •

edited

Loading

emhane commented Feb 10, 2024 •

edited

Loading