Minor improvements to Service Fabric providers #3250

ReubenBond · 2017-07-26T04:16:44Z

In testing, I found that the method I had eagerly employed to ensure that we were not processing outdated Service Fabric partition change notifications was the cause of many lost updates during disaster recovery scenarios. Not blaming SF, it's likely because of how I was comparing partition equality. i.e, two ResolvedServicePartition instances which belong to a Singleton partition must belong to the same partition because logically there can only be one singleton partition. This PR removes that check, since it's unnecessary (any stale information will quickly be superseded by fresh information).

Fixed a NullReferenceException which was being thrown when a client attempts to resolve silos before the silo service has been successfully created.

Increased logging.

Reduced the eager refresh interval from 30s to 5s. If logging is verbose, then this will cause 5x more logs from that process. The MaxStaleness was also reduced to the refresh interval. This has the effect of blacklisted gateways being cleared much more quickly. I see no downside to this - the shortened polling interval is still not aggressive.

xiazen · 2017-07-31T18:08:00Z

src/OrleansServiceFabricUtils/FabricGatewayProvider.cs

@@ -67,14 +67,16 @@ public FabricGatewayProvider(IFabricServiceSiloResolver siloResolver)
        /// <inheritdoc />
        public bool SubscribeToGatewayNotificationEvents(IGatewayListListener subscriber)
        {
+            this.log.Verbose($"Unsubscribing {subscriber} to gateway notification events.");


Subsribe *
typo

ahhh thanks

I added these logs because of the weird behavior which eventually led to #3249

xiazen · 2017-07-31T19:43:32Z

src/OrleansServiceFabricUtils/FabricServiceSiloResolver.cs

-                        this.log.Info($"Update for partition {updated} is superseded by existing version.");
-
-                        // Do not update the partition if the exiting one has a newer version than the update.
-                        break;


I'm not an expert on service fabric. But would removing this post the risk of older partition overwrites newer partition? Yes the older partition would mostly be corrected by newer partition eventually. But this brought unnecessary handling on partitionChange, which can be avoided, right? or is this pre-mature optimization?

Not really, I was just finding that these checks really don't add value. In the worst case - that there's some race and updates are out of order, we have stale information for 1 polling cycle (5 seconds).

xiazen · 2017-07-31T20:07:37Z

src/OrleansServiceFabricUtils/Utilities/ServiceFabricExtensions.cs

-        /// </returns>
-        public static bool IsOlderThan(this ResolvedServicePartition left, ResolvedServicePartition right)
-        {
-            return left.Info.Id == right.Info.Id && left.CompareVersion(right) < 0;


can this be justleft.CompareVersion(right) < 0;? so that we removed the unnecessary check and also compared the version, so that FabricServiceSiloResolver.OnPartitionChange won't be processing older partition.

we can, but the check doesn't provide value

Actually, that wouldn't work - because the version would be reset when the partition ids change (and anyhow the two versions wouldn't be related to each other since they're for different physical partitions)

xiazen · 2017-07-31T22:51:42Z

@dotnet-bot test netstandard-win-functional

* Minor Service Fabric provider fixes and tweaks * Additional logging in SF gateway provider * FabricMembershipOracle reduce polling interval from 30s to 5s * review feedback

ReubenBond added 2 commits July 26, 2017 14:03

Minor Service Fabric provider fixes and tweaks

6ddbe78

Additional logging in SF gateway provider

683fc12

dnfclas added the cla-already-signed label Jul 26, 2017

FabricMembershipOracle reduce polling interval from 30s to 5s

90cd2f7

ReubenBond mentioned this pull request Jul 27, 2017

Cherry-picked fixes for 1.5.1 #3245

Merged

xiazen reviewed Jul 31, 2017

View reviewed changes

review feedback

755e15e

xiazen merged commit a568af6 into dotnet:master Aug 1, 2017

github-actions bot locked and limited conversation to collaborators Dec 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor improvements to Service Fabric providers #3250

Minor improvements to Service Fabric providers #3250

ReubenBond commented Jul 26, 2017 •

edited

Loading

xiazen Jul 31, 2017

ReubenBond Jul 31, 2017

ReubenBond Jul 31, 2017

xiazen Jul 31, 2017

ReubenBond Jul 31, 2017

xiazen Jul 31, 2017

ReubenBond Jul 31, 2017

ReubenBond Jul 31, 2017

xiazen commented Jul 31, 2017

Minor improvements to Service Fabric providers #3250

Minor improvements to Service Fabric providers #3250

Conversation

ReubenBond commented Jul 26, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xiazen commented Jul 31, 2017

ReubenBond commented Jul 26, 2017 •

edited

Loading