swarm: Make API for creating a new `Swarm` executor aware #3068

thomaseizinger · 2022-10-28T04:48:30Z

Description

This issue is extracted out of #2173 and proposes a solution to issues like #2230.

A Swarm needs to execute background tasks. Currently, each physical connection (i.e. a TCP connection) is executed on its own background task. This improves latency because it means the main event loop of the Swarm which also calls NetworkBehaviour::poll does not get blocked by IO. Also see #2885.

Today, the way a Swarm achieves this is very subtle.

By default, Swarm::new will try to create a futures::executor::Threadpool to execute these connection background tasks. For async-IO types from the tokio runtime, this does not work because they require a tokio reactor to run on the same thread which by default is only active on a worker thread of a tokio Runtime. To achieve this, a user has to use the SwarmBuilder instead and call executor(Box::new(|f| { tokio::spawn(f) })). See for example here:

rust-libp2p/examples/chat-tokio.rs

Lines 109 to 115 in b42f286

    
           SwarmBuilder::new(transport, behaviour, peer_id) 
        
               // We want the connection background tasks to be spawned 
        
               // onto the tokio runtime. 
        
               .executor(Box::new(|fut| { 
        
                   tokio::spawn(fut); 
        
               })) 
        
               .build()

In case creation of the futures::executor::Threadpool fails, the connection tasks are polled on the current thread instead:

rust-libp2p/swarm/src/connection/pool.rs

Lines 398 to 404 in b42f286

    
           fn spawn(&mut self, task: BoxFuture<'static, ()>) { 
        
               if let Some(executor) = &mut self.executor { 
        
                   executor.exec(task); 
        
               } else { 
        
                   self.local_spawns.push(task); 
        
               } 
        
           }

rust-libp2p/swarm/src/connection/pool.rs

Lines 101 to 103 in b42f286

    
           /// If no `executor` is configured, tasks are kept in this set and 
        
           /// polled on the current thread when the [`Pool`] is polled for new events. 
        
           local_spawns: FuturesUnordered<Pin<Box<dyn Future<Output = ()> + Send>>>,

I am proposed to change the API for Swarm and SwarmBuilder to force the user to make a decision on which executor they'd like their tasks to be executed on. There are several options and it is not yet clear, what the best one is:

Introduce a type parameter on Swarm that has a trait-bound for spawning new tasks. This would allow us to re-export type-aliases like libp2p::swarm::tokio::Swarm which would point to something like libp2p::swarm::Swarm<TokioRuntime>. A type-parameter-only solution is likely the easiest to migrate too because it doesn't require any signature changes. We can deprecate the current type in favor of libp2p::swarm::futures_threadpool::Swarm.

Introduce a new parameter to libp2p::swarm::Swarm::new that requires to pass an executor-specific type:

impl Swarm {
    fn new<T>(..., executor: T) -> Self { }
}

Swarm::new(..., TokioExecutor);

There are likely other solutions too. Whatever we choose, it must play nicely with cargo features, i.e. be completely additive in terms of APIs and not change any behaviour.

Motivation

Make it easier for users to use rust-libp2p correctly.
Less boilerplate to configure a custom executor.
Treat all executors equally.

Open questions

Are you planning to do it yourself in a pull request?

Maybe.

The text was updated successfully, but these errors were encountered:

umgefahren · 2022-11-08T10:32:29Z

I'm currently working on a PR to solve this issue. However, I have a small question: Solving this could remove the need for local spawns since FuturesUnordered could also be treated as an executor. I don't see any usages of local spawns besides being a fallback.

thomaseizinger · 2022-11-08T11:12:08Z

I'm currently working on a PR to solve this issue. However, I have a small question: Solving this could remove the need for local spawns since FuturesUnordered could also be treated as an executor. I don't see any usages of local spawns besides being a fallback.

It doesn't quite work unfortunately.
FuturesUnordered requires &mut self to spawn new futures and it needs to be polled to make progress. Both of these are not needed typically for the executors we are interested in.

umgefahren · 2022-11-08T13:17:52Z

I see, however, if there is an executor, there is no need for local spawns, right? So I could just construct an enum encapsulating the different cases.

thomaseizinger · 2022-11-08T13:38:31Z

I see, however, if there is an executor, there is no need for local spawns, right? So I could just construct an enum encapsulating the different cases.

That is correct. It is either one or the other :)

umgefahren · 2022-11-14T13:18:54Z

This issue is closed by #3097. Consider linking it.

thomaseizinger · 2022-11-15T00:16:41Z

This issue is closed by #3097. Consider linking it.

Done!

Previously, the executor for connection tasks silently defaulted to a `futures::executor::ThreadPool`. This causes issues such as #2230. With this patch, we force the user to choose, which executor they want to run the connection tasks on which results in overall simpler API with less footguns. Closes #3068.

Previously, the executor for connection tasks silently defaulted to a `futures::executor::ThreadPool`. This causes issues such as libp2p#2230. With this patch, we force the user to choose, which executor they want to run the connection tasks on which results in overall simpler API with less footguns. Closes libp2p#3068.

This was referenced Oct 28, 2022

Re-design feature sets #2173

Closed

tokio-based tcp transports panic when dialing due to threadpools with unset runtimes. #2230

Closed

thomaseizinger added priority:nicetohave difficulty:moderate help wanted labels Oct 28, 2022

thomaseizinger mentioned this issue Nov 2, 2022

*: Remove use of development_transport in examples and tests #3056

Closed

4 tasks

umgefahren mentioned this issue Nov 8, 2022

feat(swarm): Make executor for connection tasks explicit #3097

Merged

7 tasks

mergify bot closed this as completed in #3097 Nov 15, 2022

This was referenced Dec 2, 2022

swarm: Deprecate Swarm::with_XYZ and enforce creation via SwarmBuilder #3186

Closed

feat: Add Identify + Kademlia chat example #3150

Closed

thomaseizinger mentioned this issue Feb 24, 2023

examples: Add quic to the example chat-tokio #3501

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

swarm: Make API for creating a new `Swarm` executor aware #3068

swarm: Make API for creating a new `Swarm` executor aware #3068

thomaseizinger commented Oct 28, 2022

umgefahren commented Nov 8, 2022

thomaseizinger commented Nov 8, 2022

umgefahren commented Nov 8, 2022

thomaseizinger commented Nov 8, 2022

umgefahren commented Nov 14, 2022

thomaseizinger commented Nov 15, 2022

swarm: Make API for creating a new Swarm executor aware #3068

swarm: Make API for creating a new Swarm executor aware #3068

Comments

thomaseizinger commented Oct 28, 2022

Description

Motivation

Open questions

Are you planning to do it yourself in a pull request?

umgefahren commented Nov 8, 2022

thomaseizinger commented Nov 8, 2022

umgefahren commented Nov 8, 2022

thomaseizinger commented Nov 8, 2022

umgefahren commented Nov 14, 2022

thomaseizinger commented Nov 15, 2022

swarm: Make API for creating a new `Swarm` executor aware #3068

swarm: Make API for creating a new `Swarm` executor aware #3068