Renew retention lease with the last known synced checkpoint #18

tbhanu-amzn · 2021-06-18T09:08:02Z

Renew retention lease with the last known synced checkpoint

Issues Resolved

when follower node which has primary shard for an index is down, a replica shard picks up the task. In this case previous code always used to add retention lease with -1 sequence number leading to replication failure.

This change makes sure that the last checkpoint on a follower node is used to add retention lease and hence replication will resume gracefully. Following logs when a node is down shows that replication is resumed gracefully

Logs in Follower primary shard's node when node was terminated

[2021-06-18T08:35:48,944][INFO ][c.a.e.r.t.s.ShardReplicationTask] [node2] [test_index][0] Got 396 changes starting from seqNo: 247105
[2021-06-18T08:35:48,944][INFO ][c.a.e.r.t.s.ShardReplicationTask] [node2] [test_index][0] Renewing retentionlease of follower global check point: 244539
Connection to ec2-34-241-98-254.eu-west-1.compute.amazonaws.com closed by remote host.
Connection to ec2-34-241-98-254.eu-west-1.compute.amazonaws.com closed.

Logs in follower node which is selected as new primary where replication is resumed gracefully

[2021-06-18T08:35:49,824][INFO ][c.a.e.r.t.s.ShardReplicationExecutor] [node3] starting persistent replication task: {"remote_cluster":"leader-cluster-1node","remote_shard":"[test_index][0]","remote_index_uuid":"Gn6nr07ZQ3mg_lFvQ4SZ1w","follower_shard":"[test_index][0]","follower_index_uuid":"6TolcBpWSmuWSUNyQFMMMA"}, com.amazon.elasticsearch.replication.task.shard.FollowingState@4dbd6226, 5, {"state":"STARTED"}
[2021-06-18T08:35:49,863][ERROR][c.a.e.r.s.RemoteClusterRetentionLeaseHelper] [node3] retention lease with ID [replication:follower-cluster-node:[test_index][0]] already exists
[2021-06-18T08:35:49,864][INFO ][c.a.e.r.s.RemoteClusterRetentionLeaseHelper] [node3] Renew retention lease as it already exists replication:follower-cluster-node:[test_index][0] with 247500
[2021-06-18T08:35:49,866][INFO ][c.a.e.r.t.s.ShardReplicationTask] [node3] [test_index][0] Adding retentionlease of follower global check point: 247500
[2021-06-18T08:35:49,866][INFO ][c.a.e.r.t.s.ShardReplicationTask] [node3] [test_index][0] Follower Global check point is: 247500
[2021-06-18T08:35:49,866][INFO ][c.a.e.r.t.s.ShardReplicationTask] [node3] [test_index][0] Index local check point is : 247500
[2021-06-18T08:35:50,599][INFO ][c.a.e.r.t.s.ShardReplicationTask] [node3] [test_index][0] Got 513 changes starting from seqNo: 247501
[2021-06-18T08:35:50,600][INFO ][c.a.e.r.t.s.ShardReplicationTask] [node3] [test_index][0] Renewing retentionlease of follower global check point: 247500

gbbafna · 2021-06-18T09:29:21Z

src/main/kotlin/com/amazon/elasticsearch/replication/task/shard/ShardReplicationTask.kt

@@ -138,7 +140,8 @@ class ShardReplicationTask(id: Long, type: String, action: String, description:
                rateLimiter.release()
                continue
            }
-            retentionLeaseHelper.renewRetentionLease(remoteShardId, seqNo, followerShardId)
+            //renew retention lease with global checkpoint so that any shard that picks up shard replication task has data until then.
+            retentionLeaseHelper.renewRetentionLease(remoteShardId, indexShard.lastSyncedGlobalCheckpoint, followerShardId)


Why are we using localCheckpoint at one place and GlobalCheckpoint at other ?

GlobalCheckpoint can lag behind localCheckpoint and give exception RetentionLeaseInvalidRetainingSeqNoException in that case if we try to renew existing lease with lesser id

Ack changed this to GCP

amkhar · 2021-06-19T04:40:38Z

src/main/kotlin/com/amazon/elasticsearch/replication/task/shard/ShardReplicationTask.kt

-        var seqNo = indexShard.localCheckpoint + 1
+        // Adding retention lease at local checkpoint of a node. This makes sure
+        // new tasks spawned after node changes/shard movements are handled properly
+        log.info("Adding retentionlease at follower Sequence number: ${indexShard.localCheckpoint}")


Minor: small s in "Sequence" word ?

gbbafna

LGTM. Please add a backlog item to add IT for the scenario.

naveenpajjuri

LGTM

naveenpajjuri

LGTM

Renew retention lease with the last known synced checkpoint

426e22b

gbbafna reviewed Jun 18, 2021

View reviewed changes

amkhar reviewed Jun 19, 2021

View reviewed changes

tbhanu-amzn added 2 commits June 23, 2021 11:27

Renew retention lease with the last known synced checkpoint

a900709

fixing minot nitpick

06739c2

gbbafna previously approved these changes Jun 23, 2021

View reviewed changes

naveenpajjuri previously approved these changes Jun 23, 2021

View reviewed changes

Merge branch 'main' into tbhanu-followerGCP

a90bd17

tbhanu-amzn dismissed stale reviews from naveenpajjuri and gbbafna via a90bd17 June 23, 2021 10:52

naveenpajjuri approved these changes Jun 23, 2021

View reviewed changes

gbbafna approved these changes Jun 23, 2021

View reviewed changes

tbhanu-amzn merged commit 819107a into main Jun 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Renew retention lease with the last known synced checkpoint #18

Renew retention lease with the last known synced checkpoint #18

tbhanu-amzn commented Jun 18, 2021

gbbafna Jun 18, 2021

tbhanu-amzn Jun 23, 2021

amkhar Jun 19, 2021

tbhanu-amzn Jun 23, 2021

gbbafna left a comment

naveenpajjuri left a comment

naveenpajjuri left a comment

Renew retention lease with the last known synced checkpoint #18

Renew retention lease with the last known synced checkpoint #18

Conversation

tbhanu-amzn commented Jun 18, 2021