-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Orleans.TestingHost support test clusters without legacy configuration #3878
Conversation
9ac99b4
to
2db2741
Compare
test/Tester/GrainActivatorTests.cs
Outdated
} | ||
|
||
private class TestSiloBuilderFactory : ISiloBuilderFactory | ||
private class TestSiloBuilderFactory : ISiloBuilderConfigurator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: you could rename to TestSiloBuilderConfigurator
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah, yeah, I renamed some, but I'm sure I missed many
Is this fine or should it use Refers to: test/TestExtensions/HostedTestClusterBase.cs:62 in 2db2741. [](commit_id = 2db27411cdf8c8770e2b93c389598b3386a1b7a8, deletion_comment = False) |
The latter, will fix. BTW, your comment isn't anchored to the code. |
{ | ||
public interface ITestClusterManagementGrain : IGrainWithGuidKey | ||
{ | ||
Task<T> GetService<T>(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wouldn't add this grain here (in the package that we ship) since almost no service would be serializable. The only one I can think of is the legacy config objects, so wouldn't pollute the surface with this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Alright, should I make a more specific legacy-themed grain to get config objects or try to retrieve them some other way?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If it was just to get the gateway endpoint, you probably don't even need that, you can derive it from the options, or even ask the client
{ | ||
var gwEndpoint = this.HostedCluster.Primary.NodeConfiguration.ProxyGatewayEndpoint; | ||
var clusterConfig = await this.Client.GetGrain<ITestClusterManagementGrain>(Guid.Empty).GetService<ClusterConfiguration>(); | ||
var gwEndpoint = clusterConfig.Overrides[Silo.PrimarySiloName].ProxyGatewayEndpoint; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are other ways to get the proxy endpoint without the need to resort to serializing the config back from the silo (and in fact that would break if you stop using the legacy config).
If you hold on to the TestClusterOptions, you can peek in there. Otherwise there's also the opportunity to get the gateway list provider options from the client itself without asking the silo for the config
/// <summary>Get the proxy address of the silo</summary> | ||
public SiloAddress ProxyAddress => SiloAddress.New(this.NodeConfiguration.ProxyGatewayEndpoint, 0); | ||
///// <summary>Get the proxy address of the silo</summary> | ||
public SiloAddress GatewayAddress { get; set; } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the default implementation I think could just look for this information in the TestClusterOptions if it uses test cluster membership.
This way other subclasses do not have to rely on extended RPC for it to work (such as MarshalByRefObject as we have in the AppDomainSiloHandle implementation)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wanted this so that I could query it for some tests. It seems right that if SiloAddress is there, then GatewayAddress should also be there.
On the other hand, I'm not confident about the overall class structure for this change (i.e, could things be separated more cleanly?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, it's fine, I don't really mind if we keep this way for now. The reality is that I initially was retrieving it from the container in that way before I did all the work to move to strongly typed config. Today you can just as well get it directly from the configuration.
But as I said, it's fine... Both AppDomain and in-domain strategies can RPC from the TestCluster... If we have other approaches (like out of process or containers) we can refactor it so that the silo handle infers it from the config instead of asking the remote host.
var clientConfiguration = GetOrCreateClientConfiguration(services, configuration); | ||
|
||
// Test is running inside debugger - Make timeout ~= infinite | ||
clientConfiguration.ResponseTimeout = TimeSpan.FromMilliseconds(1000000); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note: Once we move this to strongly typed options, we can get rid of the dependency to the legacy config objects in the TestClusterHostFactory
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved this to LegacyTestClusterConfiguration
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The reality is that this is useful in the non-legacy case too going forward. I would keep this by default, it's just that when you eventually move this setting out to a strongly typed option, then you can stop relying on the legacy configuration entirely
I'm thinking of modifying Something like ~ void ConfigureTestCluster(TestClusterBuilder builder)
{
// probably this would be done in the base class.
builder.AddTestEnvironmentConfiguration();
// Set some legacy configuration options.
builder.ConfigureLegacyConfiguration(cfg =>
{
cfg.ClusterConfiguration.SomeProperty = 3;
cfg.ClientConfiguration.SomeProperty = 3
});
} What do you think? EDIT: the latest commit implements that change to ConfigureTestCluster, but not "AddTestEnvironmentConfiguration" |
I do like the change to use extension methods on the 1 and and only test cluster builder instead of inheritance 👍 👍 in fact now that I remember, I thought of avoiding inheritance too, but never got to it. |
@@ -56,13 +50,6 @@ public void Shutdown() | |||
{ | |||
this.host.StopAsync().GetAwaiter().GetResult(); | |||
} | |||
|
|||
private void InitializeTestHooksSystemTarget() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
where is this part of logic moved to ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved to TestClusterHostFactory
{ | ||
var siloName = options.GetSiloSpecificOptions(i).SiloName; | ||
|
||
this.ClusterConfiguration.GetOrCreateNodeConfigurationForSilo(siloName); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why go back to defining these? can't you avoid it and only do so specifically if a particular test was incorrectly relying on it?
I'd like to prevent re-adding this at the infrastructure level if it was just for an edge case test
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was hoping to avoid it, too, but it's used in non-test code in a non-trivial way which was making tests fail. I had a fix for the one test usage I sent to you, but I reverted it and went with this approach.
orleans/src/Orleans.Runtime/Streams/QueueBalancer/StaticClusterDeploymentConfiguration.cs
Line 26 in 9bd486b
return _clusterConfiguration.Overrides.Keys.ToList(); |
It will [have to] be addressed when we move away from ClusterConfiguration internally
{ | ||
public Task<Guid> GetServiceId() | ||
{ | ||
return Task.FromResult(this.ServiceProvider.GetRequiredService<GlobalConfiguration>().ServiceId); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why not get this from the strongly typed option (instead of the legacy config) which is the authoritative source of truth?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will fix
@dotnet-bot test bvt |
can we have some write up about new interfaces and major changes, so it is easier to review? |
@xiazen the gist of it is this:
|
if (useTestClusterMemebership) | ||
if (useTestClusterMemebership | ||
&& services.All(svc => svc.ServiceType != typeof(IMembershipTable)) | ||
&& services.All(svc => svc.ServiceType != typeof(IMembershipOracle))) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: could avoid double enumeration
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll remove this and make sure UseTestClusterMembership
is correctly set instead
https://ci.dot.net/job/dotnet_orleans/job/master/job/functional_prtest/1013/
|
@dotnet-bot test this please |
Please review but don't merge (I'm sure there will be changes requested anyway) - I need to ensure we have a green run in VSO first. |
yup, sure, pay special attention to Azure tests, since they don't run here, as using different membership providers are directly impacted with how this new test infrastructure works. Should be trivial to resolve if there are issues though |
@@ -284,42 +237,39 @@ public static TimeSpan GetLivenessStabilizationTime(GlobalConfiguration global, | |||
/// <summary> | |||
/// Start an additional silo, so that it joins the existing cluster. | |||
/// </summary> | |||
/// <returns>SiloHandle for the newly started silo.</returns> | |||
/// <returns>SiloHandle2 for the newly started silo.</returns> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Search for old usages of SiloHandle2 and replace with SiloHandle
///// <summary> | ||
///// Restart a previously stopped. | ||
///// </summary> | ||
///// <param name="siloName">Silo to be restarted.</param> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
uncomment XML comments
LGTM. Ship it! :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM except two minor comments
/// <summary> | ||
/// Configures the client builder. | ||
/// </summary> | ||
public abstract void Configure(IConfiguration configuration, IClientBuilder clientBuilder); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do we have an extra ClientBuilderConfiguratorBase.Configure
method here on top of IClientBuilderConfigurator.Configure
? if they are supposed to do different things, maybe name them different
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The explicit IClientBuilderConfigurator
implementation does some work before calling the abstract method. I could collapse the class hierarchy instead
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I might be dumb here, but why cannot we merge two Configure method?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we can, it just means moving this code into any consumer. The idea was that this class would be reusable, but ... YAGNI
} | ||
catch (FileNotFoundException) | ||
{ | ||
configuration = new ClientConfiguration(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure swallowing FileNotFoundException
is a good idea, didn't this change original behavior? And for users who wanted to do StandardLoad
from a config file, it hided exception from them which make it harder for them to notice that they made a mistake somewhere.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not really, this is for the path we're users didn't explicitly pass a config object. If they do want, then they'll get the exception in their code when they try to load the config.
Btw, this code path will be removed as soon as the runtime itself stops requiring the legacy config in the first place
@@ -41,7 +41,7 @@ protected override void ConfigureTestCluster(TestClusterBuilder builder) | |||
{ | |||
Guid serviceId = Guid.NewGuid(); | |||
builder.Options.InitialSilosCount = 4; | |||
|
|||
builder.Options.UseTestClusterMembership = false; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Aren't other external membership tests (non azure) affected by the same issue. They wouldn't show up as we don't run them in CI
<ItemGroup> | ||
<Compile Remove="Grains\**" /> | ||
<EmbeddedResource Remove="Grains\**" /> | ||
<None Remove="Grains\**" /> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's all this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, I thought I fixed that - my bad.
There are conflicts now. If you resolve them and tests are passing, I'll merge it asap |
2473d77
to
6a3c469
Compare
@dotnet-bot test functional |
6a3c469
to
5419560
Compare
@dotnet-bot test bvt |
I'm happy for us to merge this now |
I was about to on Friday, but some multi cluster tests in vso are failing because of this. Should I merge anyway or wait until they are fixed? |
There were other issues with the multi-cluster tests. I think there's no need to block this waiting for them to be investigated. |
Oh, OK then. I'll merge as soon as the conflicts are resokved |
5419560
to
61eb1c4
Compare
resolved |
oh, too slow :( |
:) yeah. you are late for the party ;) . I verified that it passed in vso with Reuben. So I merged it |
The tests passed in .NET CI, so it's all good. I'm just glad this is merged. |
Yeah, sorry for the delay, I was thinking that you were planning to fix the multi-cluster tests... I was running tests in VSO for the entire week last week, waiting for the moment they passed so I could merge :) |
This should allow us to more easily migrate configuration over to non-legacy classes bit-by-bit as well as hopefully allow us to run our tests on .NET Core (i.e, by removing the reliance on creating AppDomains)
cc @jdom, who performed the initial work