New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cluster startup hard block #5515
Open
Zetanova
wants to merge
6
commits into
akkadotnet:dev
Choose a base branch
from
Zetanova:cluster-startup-hard-lock
base: dev
Could not load branches
Branch not found: {{ refName }}
Could not load tags
Nothing to show
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
6 commits
Select commit
Hold shift + click to select a range
b94ed29
fix missing config
Zetanova a1e284e
spec cluster startup channel executor
Zetanova 360a88d
hard block cluster startup
Zetanova 4c3021f
Merge branch 'dev' into cluster-startup-hard-lock
Zetanova 72ee78e
Merge branch 'dev' into cluster-startup-hard-lock
Arkatufus 2619643
Merge branch 'dev' into cluster-startup-hard-lock
Zetanova File filter
Filter by extension
Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
There are no files selected for viewing
74 changes: 74 additions & 0 deletions
74
src/core/Akka.Cluster.Tests/StartupWithChannelExecutorSpec.cs
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,74 @@ | ||
//----------------------------------------------------------------------- | ||
// <copyright file="StartupWithOneThreadSpec.cs" company="Akka.NET Project"> | ||
// Copyright (C) 2009-2021 Lightbend Inc. <http://www.lightbend.com> | ||
// Copyright (C) 2013-2021 .NET Foundation <https://github.com/akkadotnet/akka.net> | ||
// </copyright> | ||
//----------------------------------------------------------------------- | ||
|
||
using System; | ||
using System.Threading; | ||
using Akka.Actor; | ||
using Akka.Actor.Dsl; | ||
using Akka.Configuration; | ||
using Akka.Event; | ||
using Akka.TestKit; | ||
using Akka.Util; | ||
using Xunit; | ||
|
||
namespace Akka.Cluster.Tests | ||
{ | ||
public sealed class StartupWithChannelExecutorSpec : AkkaSpec | ||
{ | ||
private static readonly Config Configuration = ConfigurationFactory.ParseString(@" | ||
akka.actor.creation-timeout = 10s | ||
akka.actor.provider = cluster | ||
akka.actor.default-dispatcher.executor = channel-executor | ||
akka.actor.internal-dispatcher.executor = channel-executor | ||
akka.remote.default-remote-dispatcher.executor = channel-executor | ||
akka.remote.backoff-remote-dispatcher.executor = channel-executor | ||
").WithFallback(ConfigurationFactory.Default()); | ||
|
||
private long _startTime; | ||
|
||
public StartupWithChannelExecutorSpec() : base(Configuration) | ||
{ | ||
_startTime = MonotonicClock.GetTicks(); | ||
} | ||
|
||
private Props TestProps | ||
{ | ||
get | ||
{ | ||
Action<IActorDsl> actor = (c => | ||
{ | ||
c.ReceiveAny((o, context) => context.Sender.Tell(o)); | ||
c.OnPreStart = context => | ||
{ | ||
var log = context.GetLogger(); | ||
var cluster = Cluster.Get(context.System); | ||
log.Debug("Started {0} {1}", cluster.SelfAddress, Thread.CurrentThread.Name); | ||
}; | ||
}); | ||
return Props.Create(() => new Act(actor)); | ||
} | ||
} | ||
|
||
[Fact] | ||
public void A_cluster_must_startup_with_channel_executor_dispatcher() | ||
{ | ||
var totalStartupTime = TimeSpan.FromTicks(MonotonicClock.GetTicks() - _startTime).TotalMilliseconds; | ||
Assert.True(totalStartupTime < (Sys.Settings.CreationTimeout - TimeSpan.FromSeconds(2)).TotalMilliseconds); | ||
Sys.ActorOf(TestProps).Tell("hello"); | ||
Sys.ActorOf(TestProps).Tell("hello"); | ||
Sys.ActorOf(TestProps).Tell("hello"); | ||
|
||
var cluster = Cluster.Get(Sys); | ||
totalStartupTime = TimeSpan.FromTicks(MonotonicClock.GetTicks() - _startTime).TotalMilliseconds; | ||
Assert.True(totalStartupTime < (Sys.Settings.CreationTimeout - TimeSpan.FromSeconds(2)).TotalMilliseconds); | ||
|
||
ExpectMsg("hello"); | ||
ExpectMsg("hello"); | ||
ExpectMsg("hello"); | ||
} | ||
} | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -62,14 +62,17 @@ public ChannelTaskScheduler(ExtendedActorSystem system) | |
//config channel-scheduler | ||
var config = system.Settings.Config.GetConfig("akka.channel-scheduler"); | ||
_maximumConcurrencyLevel = ThreadPoolConfig.ScaledPoolSize( | ||
config.GetInt("parallelism-min"), | ||
config.GetDouble("parallelism-factor", 1.0D), // the scalar-based factor to scale the threadpool size to | ||
config.GetInt("parallelism-max")); | ||
config?.GetInt("parallelism-min", 4) ?? 4, | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Same thing here, these should be placed in a different PR |
||
config?.GetDouble("parallelism-factor", 1.0D) ?? 1.0D, // the scalar-based factor to scale the threadpool size to | ||
config?.GetInt("parallelism-max", 64) ?? 64); | ||
_maximumConcurrencyLevel = Math.Max(_maximumConcurrencyLevel, 1); | ||
_maxWork = Math.Max(config.GetInt("work-max", _maxWork), 3); //min 3 normal work in work-loop | ||
|
||
_workInterval = config.GetInt("work-interval", _workInterval); | ||
_workStep = config.GetInt("work-step", _workStep); | ||
|
||
if (config != null) | ||
{ | ||
_maxWork = Math.Max(config.GetInt("work-max", _maxWork), 3); //min 3 normal work in work-loop | ||
_workInterval = config.GetInt("work-interval", _workInterval); | ||
_workStep = config.GetInt("work-step", _workStep); | ||
} | ||
|
||
//create task schedulers | ||
var channelOptions = new UnboundedChannelOptions() | ||
|
@@ -276,7 +279,7 @@ private int DoWork(int workerId) | |
//the work loop | ||
_threadPriority = TaskSchedulerPriority.Idle; | ||
try | ||
{ | ||
{ | ||
do | ||
{ | ||
rounds++; | ||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are there any flags we should consider adding here? i.e. would it be better to do
Task.Factory.StartNew
withLongRunning
andDenyChildAttach
?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
_clusterCore = GetClusterCoreRef().Result
was always locking and I think its still is.I think what happend is that after the Ask got
improved
the current thread gets used for the actorcell dispatcher.I don't know how/where exactly, but the child-actors of cluster-daemon are using it.
´Task.Run(GetClusterCoreRef)´ should force the ask on a different thread,
this is the only thing that matters, that it the current thread does not leak.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Interesting. Would there be other implications if this is true?
cc: @Aaronontheweb
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The actor system extensions implementation and the locking cluster constructor are both just anti-patterns
and we need to rework it in the future.
#5447
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe its just the "ConfigureAwait(false)" of
akka.net/src/core/Akka.Cluster/Cluster.cs
Line 155 in 44c51f6
The Cluster Extension is called by the ClusterActorRefProvider and it is created in the ActorSystem Startup.
The thread is the same as of the ActorSystem creator and with the normal ForkJoinExecutor they will never mix.
But with the ChannelExecutor that uses the normal ThreadPool of dotnet, the awaiting thread can be used for an ActorCell.
I hope that simply resolve the problem.
task.Result
blocked at .net 4.5 for sure,I don't know why/how the thread gets now reused.