feat: use broadcast channel for event listeners #8193

fgimenez · 2024-05-10T07:41:07Z

EventListeners implements a multi producer multi consumer queue where each sent value is seen by all consumers.
To achieve this EventListeners allocates a std::Vec to be filled with tokio::sync::UnboundedSender every time EventListeners::new_listener is called.

As every value sent via EventListeners is cloned to each UnboundedReceiver and the channels are unbounded this is prone to unlimited memory growth and eventual OOM attacks.

To prevent this, in this PR tokio's tokio::sync::broadcast multi producer multi consumer queue is used instead.

For now the size of all the broadcast channels is set to 1000, would be good to measure how much is needed for each. Pending adding metrics as suggested in this comment #8193 (comment) will be done in a follow up.

emhane · 2024-05-10T17:42:09Z

check out these types we have

reth/crates/metrics/src/common/mpsc.rs

Lines 36 to 43 in ef01d50

    
           /// A wrapper type around [UnboundedSender](mpsc::UnboundedSender) that updates metrics on send. 
        
           #[derive(Debug)] 
        
           pub struct UnboundedMeteredSender<T> { 
        
               /// The [UnboundedSender](mpsc::UnboundedSender) that this wraps around 
        
               sender: mpsc::UnboundedSender<T>, 
        
               /// Holds metrics for this type 
        
               metrics: MeteredSenderMetrics, 
        
           }

reth/crates/metrics/src/common/mpsc.rs

Lines 75 to 82 in ef01d50

    
           /// A wrapper type around [Receiver](mpsc::UnboundedReceiver) that updates metrics on receive. 
        
           #[derive(Debug)] 
        
           pub struct UnboundedMeteredReceiver<T> { 
        
               /// The [Sender](mpsc::Sender) that this wraps around 
        
               receiver: mpsc::UnboundedReceiver<T>, 
        
               /// Holds metrics for this type 
        
               metrics: MeteredReceiverMetrics, 
        
           }

we use them so far only for the channel NetworkManager->TransactionsManager for incoming transaction gossip, but I'd like to see more of them in the codebase.

we have a panel for observing this channel
https://reth.paradigm.xyz/d/d47d679c-c3b8-40b6-852d-cbfaa2dcdb37/reth---transaction-pool?orgId=1&refresh=30s&viewPanel=95

fgimenez · 2024-05-10T18:38:01Z

@emhane awesome thx! will check how to include something similar for the broadcast channels, the metrics can be very useful to assign the proper size to each

emhane

I see, you don't need a new BeaconEngineMessage to subscribe to the broadcast stream because you clone the sender and pass it to the handle, neat

emhane · 2024-05-20T08:05:56Z

crates/node/events/src/node.rs

+/// Transforms a stream of `Result<T, BroadcastStreamRecvError>` into a stream of `NodeEvent`,
+/// applying a uniform error handling and conversion strategy.
+pub fn handle_broadcast_stream<T>(
+    stream: impl Stream<Item = Result<T, BroadcastStreamRecvError>> + Unpin,
+) -> impl Stream<Item = NodeEvent> + Unpin
+where
+    T: Into<NodeEvent>,
+{
+    stream.map(|result_event| {
+        result_event
+            .map(Into::into)
+            .unwrap_or_else(|err| NodeEvent::Other(format!("Stream error: {:?}", err)))
+    })
+}
+


how about implementing FromIterator here, that will work I think

how about implementing FromIterator here, that will work I think

the map is provided by streamext, and streams are not (sync) iterators, so I'm not sure Fromiterator is the right fit here

https://docs.rs/tokio-stream/latest/tokio_stream/trait.FromStream.html

rip, would be nice but it's sealed, hopefully soon ™️ in stable

crates/rpc/rpc-builder/tests/it/utils.rs

crates/tokio-util/src/event_listeners.rs

crates/net/network/src/transactions/mod.rs

crates/net/network/src/network.rs

crates/consensus/beacon/src/engine/handle.rs

crates/tokio-util/src/event_listeners.rs

mattsse

overall I think this great.
I don't think we'll have any issues with this for engine/pipeline events because those basically just for reporting and it's fine to drop some.

My main concern is the network transaction task which is more likely to drop messages but it relies on networkevents for peer tracking for example.
although 1k messages should be fine, I'd feel more comfortable if we could bump the default capacity to 2k and add a metric for when we lag in the tx task. maybe we should emit peer added/removed separately, but we should still proceed with this.

I'd also like a new function/stream variant that does not return results but rather skips the lag error, this would make the API easier in some places, ref

reth/crates/storage/provider/src/traits/chain.rs

Lines 37 to 43 in cb658ca

    
           /// A Stream of [CanonStateNotification]. 
        
           #[derive(Debug)] 
        
           #[pin_project::pin_project] 
        
           pub struct CanonStateNotificationStream { 
        
               #[pin] 
        
               st: BroadcastStream<CanonStateNotification>, 
        
           }

we could move this stream type to our tokio util crate

we also need this for the txpool channels which is mostlikely the most critical part because exposed over RPC.

mattsse · 2024-05-21T10:27:25Z

crates/net/network/src/transactions/mod.rs

@@ -197,7 +199,7 @@ pub struct TransactionsManager<Pool> {
    /// Subscriptions to all network related events.
    ///
    /// From which we get all new incoming transaction related messages.
-    network_events: UnboundedReceiverStream<NetworkEvent>,
+    network_events: BroadcastStream<NetworkEvent>,


I'm slightly concerned about this, because now we're no longer guaranteed delivery of all network events which can result in wrong peer tracking, for example session closed, although 1000 messages should be sufficient

I think dropping NetworkEvent::SessionEstablished and NetworkEvent::PeerAdded is recoverable, but not sure if dropping NetworkEvent::SessionClosed and NetworkEvent::PeerRemoved can lead to memory leak. depends on if all data structures that are updated accordingly are bounded.

crates/tokio-util/src/event_listeners.rs

Co-authored-by: Emilia Hane <[email protected]>

crates/tokio-util/src/event_listeners.rs

crates/net/network/src/network.rs

fgimenez · 2024-05-22T10:57:41Z

I'd also like a new function/stream variant that does not return results but rather skips the lag error,

makes total sense, done ptal

…ndle event listener on constructor

mattsse · 2024-05-22T17:19:39Z

this is great!
broadcast is def better for this

Co-authored-by: Emilia Hane <[email protected]>

fgimenez force-pushed the fgimenez/event-listeners-broadcast-channel branch from 80f3deb to 9d5d81d Compare May 10, 2024 15:44

emhane added the C-security Issue or pull request related to security. label May 10, 2024

emhane added the A-networking Related to networking in general label May 10, 2024

fgimenez force-pushed the fgimenez/event-listeners-broadcast-channel branch from a6f2112 to 16b61b9 Compare May 13, 2024 09:13

fgimenez changed the title ~~WIP feat: use broadcast channel for event listeners~~ feat: use broadcast channel for event listeners May 13, 2024

fgimenez marked this pull request as ready for review May 13, 2024 16:11

fgimenez requested review from joshieDo, shekhirin, onbjerg, rkrasiuk, mattsse, Rjected, emhane and gakonst as code owners May 13, 2024 16:11

fgimenez force-pushed the fgimenez/event-listeners-broadcast-channel branch from a55828b to 06ff479 Compare May 13, 2024 16:58

fgimenez mentioned this pull request May 14, 2024

WIP feat: bounded consensus events channel #8251

Closed

emhane requested changes May 20, 2024

View reviewed changes

fgimenez requested a review from rakita as a code owner May 20, 2024 17:17

fgimenez force-pushed the fgimenez/event-listeners-broadcast-channel branch from 5da034a to 77df31b Compare May 20, 2024 17:36

fgimenez requested a review from emhane May 21, 2024 08:26

mattsse requested changes May 21, 2024

View reviewed changes

emhane approved these changes May 21, 2024

View reviewed changes

crates/tokio-util/src/event_listeners.rs Outdated Show resolved Hide resolved

fgimenez added 6 commits May 22, 2024 09:16

feat: use broadcast channel for event listeners

e8be997

move event_listeners from manager to network

b62e3a3

updated transactions

6623c1b

update pruner

99c35a4

EventNotifier moved to tokio-util

e389289

configurable broadcast channel size and default

3ec087d

fgimenez and others added 12 commits May 22, 2024 09:16

Apply suggestions from code review

a53f942

Co-authored-by: Emilia Hane <[email protected]>

clippy

e86ec97

use EventListeners in network manager

cfdc856

remove unused method

891d73c

derive Clone

ebafb68

log network event receiving error

1a4d803

eprintln -> error

d52e0eb

panic -> error

5dd690a

do not handle broadcast send Ok result with 0 listeners

23c2d6e

Update crates/tokio-util/src/event_listeners.rs

64b00dc

Co-authored-by: Emilia Hane <[email protected]>

fmt

064c90f

bump default broadcast channel size to 2000

c400887

fgimenez force-pushed the fgimenez/event-listeners-broadcast-channel branch from 4f7d39d to c400887 Compare May 22, 2024 07:16

fgimenez added 2 commits May 22, 2024 11:16

add EventStream

e30e0cf

use EventStream in EventListeners

40180b8

mattsse requested changes May 22, 2024

View reviewed changes

crates/tokio-util/src/event_listeners.rs Outdated Show resolved Hide resolved

crates/net/network/src/network.rs Outdated Show resolved Hide resolved

crates/net/network/src/network.rs Outdated Show resolved Hide resolved

BroadcastStream -> EventStream

8efc1a7

fgimenez added 2 commits May 22, 2024 13:23

remove NetworkHandleMessage::EventListener variant and set network ha…

7bb9960

…ndle event listener on constructor

EventListeners -> EventSender

37f24d5

mattsse approved these changes May 22, 2024

View reviewed changes

fgimenez added this pull request to the merge queue May 22, 2024

Merged via the queue into main with commit d0386b8 May 22, 2024
30 checks passed

fgimenez deleted the fgimenez/event-listeners-broadcast-channel branch May 22, 2024 17:50

fgimenez restored the fgimenez/event-listeners-broadcast-channel branch May 22, 2024 18:42

fgimenez deleted the fgimenez/event-listeners-broadcast-channel branch May 22, 2024 18:44

Rjected pushed a commit that referenced this pull request May 23, 2024

feat: use broadcast channel for event listeners (#8193)

8dbaa46

Co-authored-by: Emilia Hane <[email protected]>

mw2000 pushed a commit to mw2000/reth that referenced this pull request Jun 5, 2024

feat: use broadcast channel for event listeners (paradigmxyz#8193)

fdd183e

Co-authored-by: Emilia Hane <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use broadcast channel for event listeners #8193

feat: use broadcast channel for event listeners #8193

fgimenez commented May 10, 2024 •

edited

Loading

emhane commented May 10, 2024

fgimenez commented May 10, 2024 •

edited

Loading

emhane left a comment

emhane May 20, 2024

Rjected May 20, 2024 •

edited

Loading

emhane May 21, 2024

Rjected May 21, 2024

mattsse left a comment

mattsse May 21, 2024

emhane May 21, 2024

fgimenez commented May 22, 2024

mattsse commented May 22, 2024

	/// A Stream of [CanonStateNotification].
	#[derive(Debug)]
	#[pin_project::pin_project]
	pub struct CanonStateNotificationStream {
	#[pin]
	st: BroadcastStream<CanonStateNotification>,
	}

feat: use broadcast channel for event listeners #8193

feat: use broadcast channel for event listeners #8193

Conversation

fgimenez commented May 10, 2024 • edited Loading

emhane commented May 10, 2024

fgimenez commented May 10, 2024 • edited Loading

emhane left a comment

Choose a reason for hiding this comment

emhane May 20, 2024

Choose a reason for hiding this comment

Rjected May 20, 2024 • edited Loading

Choose a reason for hiding this comment

emhane May 21, 2024

Choose a reason for hiding this comment

Rjected May 21, 2024

Choose a reason for hiding this comment

mattsse left a comment

Choose a reason for hiding this comment

mattsse May 21, 2024

Choose a reason for hiding this comment

emhane May 21, 2024

Choose a reason for hiding this comment

fgimenez commented May 22, 2024

mattsse commented May 22, 2024

fgimenez commented May 10, 2024 •

edited

Loading

fgimenez commented May 10, 2024 •

edited

Loading

Rjected May 20, 2024 •

edited

Loading