Allow users to opt out of the `NetworkBehaviourEventProcess` mechanism #1630

thomaseizinger · 2020-06-26T05:21:16Z

Putting this up as a PoC to spark discussion. Happy to change the name of the attribute if someone can come up with something better 😅

The idea is the following:

Previously, we used the libp2p::Swarm by putting all dependencies that need to react to events from the network into the top-level NetworkBehaviour and put the relevant code into the inject_event callbacks.

I've always found this a bit awkward. Having to put things into the NetworkBehaviour even though they are not really part of the "network stack" of an application doesn't feel "right".
So I've been thinking about an alternative approach and came up with this:

Basically, the Swarm is part of a bigger loop and events are bubbled all the way and distributed to other components in the application.

async fn main() {
	let mut swarm = unimplemented!("build swarm from network behaviour");
	let mut database = unimplemented!("construct database");	

	loop {
		match swarm.next().await {
			BehaviourOutEvent::Ping(message) => {
				database.save_message(message);
			}
		}
	}
}

If the main components of the application are structured around such a polling mechanism, this works really nicely. All components can just be mutable and distribute events to each other without any further locking mechanism.

This PR extends the custom derive to make this usage easier by not requiring the implementation of NetworkBehaviourEventProcess but instead, converts every event into the specificed OutEvent and bubbles it up all the way.
This is already possible by simply saving all events inside the inject_event function and having a custom poll function that emits them again. This PR just makes this pattern easier to use.

Thoughts?
Is there a reason I am missing that makes this usage of rust-libp2p a really bad idea?

The only thing I could think of is starvation of the loop if there is little network activity but I think that can be solved in an acceptable way by using a timeout on the swarm.next() future.

koivunej · 2020-06-26T14:14:58Z

This matches what I've been thinking about for rust-ipfs quite well. I was planning to try just implementing NetworkBehaviour directly but of course it would nicer if this way was supported, perhaps as an off-by-default option in the derive?

The only thing I could think of is starvation of the loop if there is little network activity but I think that can be solved in an acceptable way by using a timeout on the swarm.next() future.

Related to this we have a ... bit ... ugly way of looping around polling everything, including the swarm in non-async context. If one were to use async fn instead, I think this could be worked around with tokio::select!, StreamExt::select_next_some and so on? This was attempted quite a long time ago, can't remember what all issues there were.

thomaseizinger · 2020-06-27T00:12:26Z

The problem we ran into with select! is that you get mutability borrow checker issues if you want to use the Swarm in any other select! branch because the future returned by next still holds a mutable borrow of the Swarm.

I am not sure what a really good solution to this looks like.

One I was thinking of is:

Select all the futures in the loop with the select_all function and drop all the non-completed ones after one resolved. This would solve the mutability borrow checker errors but only works if the futures are safe to drop without loosing progress.
For the swarm, that is - I think - true because it just polls the swarm internally. Actually, the fact that the mutable reference is still active shows that the underlying struct is actively modified and the future doesn't hold any important state internally.
All the other components you are waiting on in the loop need to satisfy this constraint as well, then you should be fine.

tomaka · 2020-06-29T08:08:44Z

The problem we ran into with select! is that you get mutability borrow checker issues if you want to use the Swarm in any other select! branch because the future returned by next still holds a mutable borrow of the Swarm.

I don't know about tokio::select!, but in futures::select! that has now been fixed!
I've been using select! quite successfully for this exact situation in other projects.

tomaka

The PR looks good.
I'm not a fan of the attribute name "bubble_up_events", but I can't come up with anything better.

thomaseizinger · 2020-06-30T00:45:06Z

The problem we ran into with select! is that you get mutability borrow checker issues if you want to use the Swarm in any other select! branch because the future returned by next still holds a mutable borrow of the Swarm.

I don't know about tokio::select!, but in futures::select! that has now been fixed!
I've been using select! quite successfully for this exact situation in other projects.

Maybe I misunderstood the problem then, I thought that could be design not be fixable 🤔

We had something along these lines:

let mut swarm = todo!();
let mut some_other_component = todo!();

loop {
    futures::select! {
        // swarm.next requires &mut self
        swarm_event = swarm.next() => {
            dbg!(swarm_event);
        }
        other_event = some_other_component.next() => {
            // `react_to_event` also requires &mut self
            swarm.react_to_event(other_event) // mutability lifetime error here because of 2nd mutable binding to `swarm`
        }
    }
}

My understanding is, that you can't fix that unless you drop the future returned from swarm.next().

I've been using select! quite successfully for this exact situation in other projects.

I am very eager to learn more about this! Can you point to some code please?

I'm not a fan of the attribute name "bubble_up_events", but I can't come up with anything better.

I will try and think of something better!
I had put this together quickly as a PoC so I didn't want to waste time on naming things :D

tomaka · 2020-06-30T08:55:19Z

My understanding is, that you can't fix that unless you drop the future returned from swarm.next().

That is true, but the macro does make sure that this future gets dropped if you enter the other_event = some_other_component.next() block.

thomaseizinger · 2020-06-30T23:34:18Z

My understanding is, that you can't fix that unless you drop the future returned from swarm.next().

That is true, but the macro does make sure that this future gets dropped if you enter the other_event = some_other_component.next() block.

That is interesting, I will have to play around with this a bit more then to see if I can get it working. Thanks!

By the way, I've thought of some different names for the flag introduced in this PR:

forward_events = true: Because we are forwarding events to the owner of the swarm.
event_process = false: Because we are disabling the local event processing.

We could also introduce an enum parameter "events" (or "handle_events"):

#[behaviour(events = "process_locally")
#[behaviour(events = "forward_to_swarm")

thomaseizinger · 2020-07-02T02:04:22Z

@tomaka I rolled with #[behaviour(event_process = false)] now and also had a crack at updating the documentation accordingly.

I tried to correctly set inter-doc links and the only way I got it working was through the trait.XYZ.html notation for both, the libp2p crate and the libp2p_swarm crate.
I hope that is fine :)

swarm/CHANGELOG.md

romanb

Ad intra-doc-links: Is the problem simply that core-derive has no dependency on libp2p-swarm? If so, am I the only one thinking that even if it is technically not necessary to have a libp2p-swarm dependency in order for it to compile, core-derive should declare one since it provides a macro for libp2p-swarm and depends on a specific API version of that crate? I would then hope that intra-rustdoc-links work and also the included test(s) run against the version specified as a dependency and so check for compatibility.

Ad bubbling up events: Another attribute name suggestion would be event_delegate, defaulting to false. Essentially !delegate = process. I have no strong preferences there though.

thomaseizinger · 2020-07-05T08:06:25Z

Ad intra-doc-links: Is the problem simply that core-derive has no dependency on libp2p-swarm? If so, am I the only one thinking that even if it is technically not necessary to have a libp2p-swarm dependency in order for it to compile, core-derive should declare one since it provides a macro for libp2p-swarm and depends on a specific API version of that crate? I would then hope that intra-rustdoc-links work and also the included test(s) run against the version specified as a dependency and so check for compatibility.

Not quite. The problem I was referring to was that I couldn't make intra-doc links in the form of "crate::NetworkBehaviour" work for both, the libp2p-swarm crate and the libp2p crate.
The problem was that because libp2p-swarm is re-exported in libp2p, there wouldn't be one valid identifier for the types (at least I couldn't make it work).
Using links to the html files works in both cases, regardless of which doc you are browsing :)

Ad bubbling up events: Another attribute name suggestion would be event_delegate, defaulting to false. Essentially !delegate = process. I have no strong preferences there though.

I am not attached to event_process. It does align nicely with the name of the NetworkBehaviourEventProcess trait though :)

romanb · 2020-07-06T08:41:35Z

Not quite. The problem I was referring to was that I couldn't make intra-doc links in the form of "crate::NetworkBehaviour" work for both, the libp2p-swarm crate and the libp2p crate. [..]

That sounds a bit like rust-lang/rust#65983 (context rust-lang/rust#43466). I personally would prefer we stick to intra-rustdoc-links, even if there are still issues, to avoid mixing different styles of links in this project and because with the other type of links we never get any warning about dead links.

thomaseizinger · 2020-07-07T00:58:26Z

Not quite. The problem I was referring to was that I couldn't make intra-doc links in the form of "crate::NetworkBehaviour" work for both, the libp2p-swarm crate and the libp2p crate. [..]

That sounds a bit like rust-lang/rust#65983 (context rust-lang/rust#43466). I personally would prefer we stick to intra-rustdoc-links, even if there are still issues, to avoid mixing different styles of links in this project and because with the other type of links we never get any warning about dead links.

I am happy to change them. In which docs should they work, libp2p or libp2p-swarm?

romanb · 2020-07-07T08:07:08Z

Not quite. The problem I was referring to was that I couldn't make intra-doc links in the form of "crate::NetworkBehaviour" work for both, the libp2p-swarm crate and the libp2p crate. [..]

That sounds a bit like rust-lang/rust#65983 (context rust-lang/rust#43466). I personally would prefer we stick to intra-rustdoc-links, even if there are still issues, to avoid mixing different styles of links in this project and because with the other type of links we never get any warning about dead links.

I am happy to change them. In which docs should they work, libp2p or libp2p-swarm?

If rust-lang/rust#65983 is the cause, then libp2p-swarm, since the other will then eventually be fixed.

thomaseizinger · 2020-07-08T04:22:08Z

@romanb Updated! All the docs I touched use intra-doc links now :)

romanb · 2020-07-08T08:47:37Z

Since this seems to be a backward-compatible addition, I'm inclined to cut a release of libp2p-core-derive upon merge. Let me know if there are objections.

tomaka approved these changes Jun 29, 2020

View reviewed changes

thomaseizinger mentioned this pull request Jul 2, 2020

RequestResponse updates #1639

Closed

thomaseizinger force-pushed the bubble-up-events branch from 88c16fd to f8ea37e Compare July 2, 2020 02:01

thomaseizinger changed the title ~~Allow derived NetworkBehaviour to bubble up events for consumption through swarm.next()~~ Allow users to opt out of the NetworkBehaviourEventProcess mechanism Jul 2, 2020

thomaseizinger force-pushed the bubble-up-events branch from f8ea37e to 80b2abf Compare July 2, 2020 02:02

thomaseizinger marked this pull request as ready for review July 2, 2020 02:02

thomaseizinger requested a review from tomaka July 2, 2020 02:02

tomaka reviewed Jul 3, 2020

View reviewed changes

swarm/CHANGELOG.md Outdated Show resolved Hide resolved

tomaka approved these changes Jul 3, 2020

View reviewed changes

tomaka requested a review from romanb July 3, 2020 10:01

thomaseizinger force-pushed the bubble-up-events branch from 03d0911 to a2d82e4 Compare July 4, 2020 07:55

romanb reviewed Jul 4, 2020

View reviewed changes

thomaseizinger force-pushed the bubble-up-events branch from a2d82e4 to 9fe2b65 Compare July 8, 2020 04:20

thomaseizinger force-pushed the bubble-up-events branch from 9fe2b65 to bfb3815 Compare July 8, 2020 04:24

thomaseizinger and others added 3 commits July 8, 2020 14:24

Allow users to opt-out of the NetworkBehaviourEventProcess mechanism

d7adff2

Add CHANGELOG entry

bfb3815

Merge branch 'master' into bubble-up-events

38748c3

Prepare release.

dc21a94

romanb merged commit c4a5497 into libp2p:master Jul 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow users to opt out of the `NetworkBehaviourEventProcess` mechanism #1630

Allow users to opt out of the `NetworkBehaviourEventProcess` mechanism #1630

thomaseizinger commented Jun 26, 2020

koivunej commented Jun 26, 2020

thomaseizinger commented Jun 27, 2020

tomaka commented Jun 29, 2020

tomaka left a comment

thomaseizinger commented Jun 30, 2020

tomaka commented Jun 30, 2020 •

edited

Loading

thomaseizinger commented Jun 30, 2020

thomaseizinger commented Jul 2, 2020

romanb left a comment

thomaseizinger commented Jul 5, 2020

romanb commented Jul 6, 2020

thomaseizinger commented Jul 7, 2020

romanb commented Jul 7, 2020

thomaseizinger commented Jul 8, 2020

romanb commented Jul 8, 2020

Allow users to opt out of the NetworkBehaviourEventProcess mechanism #1630

Allow users to opt out of the NetworkBehaviourEventProcess mechanism #1630

Conversation

thomaseizinger commented Jun 26, 2020

koivunej commented Jun 26, 2020

thomaseizinger commented Jun 27, 2020

tomaka commented Jun 29, 2020

tomaka left a comment

Choose a reason for hiding this comment

thomaseizinger commented Jun 30, 2020

tomaka commented Jun 30, 2020 • edited Loading

thomaseizinger commented Jun 30, 2020

thomaseizinger commented Jul 2, 2020

romanb left a comment

Choose a reason for hiding this comment

thomaseizinger commented Jul 5, 2020

romanb commented Jul 6, 2020

thomaseizinger commented Jul 7, 2020

romanb commented Jul 7, 2020

thomaseizinger commented Jul 8, 2020

romanb commented Jul 8, 2020

Allow users to opt out of the `NetworkBehaviourEventProcess` mechanism #1630

Allow users to opt out of the `NetworkBehaviourEventProcess` mechanism #1630

tomaka commented Jun 30, 2020 •

edited

Loading