Replay history of operations in remote recovery #39153

dnhatn · 2019-02-19T21:10:55Z

The safe commit invariant does not hold for the following indices because we do not replay the history in remote recovery.

Relates #39000
Relates #35975

elasticmachine · 2019-02-19T21:10:57Z

Pinging @elastic/es-distributed

dnhatn · 2019-02-20T19:59:52Z

run elasticsearch-ci/default-distro

ywelsch

Thanks @dnhatn. I've had an initial look and this looks already great. I'm not super happy with restoreShard returning a Translog.Snapshot, as it's now up to the caller to complete building the shard but also see that it's tricky to otherwise keep information about session etc. local to the repository implementation.

server/src/main/java/org/elasticsearch/index/shard/StoreRecovery.java

ywelsch · 2019-02-25T08:55:55Z

server/src/main/java/org/elasticsearch/index/shard/StoreRecovery.java

+                     * to the max_seq_no. Then we won't have a safe commit for the restoring commit is not safe (missing translog).
+                     * To maintain the safe commit assumption, we have to forcefully flush a new commit here.
+                     */
+                    indexShard.flush(new FlushRequest().force(true).waitIfOngoing(true));


should we make sure that the global checkpoint is up-to-date (i.e. >= max-seq-no in the new commit that we create here) before calling this? Otherwise the shard will be marked as in-sync in the cluster state while the commit here will only become safe commit when the shard is locally started (and the gcp advanced). The main property we're after here is that every in-sync shard copy has a safe commit, which (AFAICS) is not guaranteed by the current recovery logic.

ywelsch · 2019-02-25T08:58:36Z

test/framework/src/main/java/org/elasticsearch/index/shard/IndexShardTestCase.java

+     */
+    public static void assertSafeCommitExists(IndexShard shard) throws IOException {
+        try {
+            if (shard.state != IndexShardState.STARTED) {


a shard in POST_RECOVERY should also satisfy this property already, see my comment above

ywelsch · 2019-02-25T09:09:11Z

server/src/main/java/org/elasticsearch/index/shard/StoreRecovery.java

+                            "checkpoint=" + newCommitInfo.localCheckpoint + " max_seq_no=" + newCommitInfo.maxSeqNo);
+                    }
+                }
+            }
            indexShard.finalizeRecovery();
            indexShard.postRecovery("restore done");


should we assert in post_recovery that we have a safe commit?

I added an assertion here, but this assertion may slow down our tests. I will post the difference.

Sadly, we don't have this invariant with closed indices because the global checkpoint is not persisted to the translog checkpoint during recovery.

server/src/main/java/org/elasticsearch/index/engine/Engine.java

dnhatn · 2019-03-11T21:13:43Z

Discussed with Yannick on another channel, I marked this as WIP since we need to make broader (maybe unrelated) changes to this PR to achieve the safe commit invariant.

# Conflicts: # server/src/test/java/org/elasticsearch/index/replication/IndexLevelReplicationTests.java

dnhatn · 2019-04-03T18:52:46Z

We now can achieve the safe commit invariant. I will open smaller pull requests for unrelated changes.

dnhatn · 2019-04-05T03:13:05Z

@ywelsch This is ready for another round. Can you please give it another go. Thank you!

dnhatn · 2019-05-16T20:07:06Z

I am closing this PR as the safe commit invariant does not hold for follower indices.

Replay history in remote recovery

6bae9cd

dnhatn added blocker v7.0.0 :Distributed Indexing/CCR Issues around the Cross Cluster State Replication features v6.7.0 v8.0.0 v7.2.0 labels Feb 19, 2019

dnhatn requested review from martijnvg, Tim-Brooks, bleskes, ywelsch, jasontedor and DaveCTurner February 19, 2019 21:10

dnhatn mentioned this pull request Feb 19, 2019

Do not wait for advancement of checkpoint in recovery #39006

Merged

dnhatn added 5 commits February 20, 2019 08:18

Merge branch 'master' into remote-recovery-history

620368c

add safe commit assertion

702ab2b

only check internal engine

a8b516d

Merge branch 'master' into remote-recovery-history

583af89

Merge branch 'master' into remote-recovery-history

61fd092

dnhatn added 3 commits February 20, 2019 19:22

Merge branch 'master' into remote-recovery-history

11e5907

make translog snapshot from ops

e3e9a18

Merge branch 'master' into remote-recovery-history

e2682ac

danielmitterdorfer added the >bug label Feb 21, 2019

dnhatn added 4 commits February 22, 2019 13:42

Flush if history was replayed after restoring snapshot

f3ef90d

Merge branch 'master' into remote-recovery-history

afddbe3

Merge branch 'master' into remote-recovery-history

dfabbd1

check only started shards

7d40d33

ywelsch suggested changes Feb 25, 2019

View reviewed changes

dnhatn removed request for Tim-Brooks, bleskes, jasontedor and DaveCTurner March 11, 2019 17:35

dnhatn added the WIP label Mar 11, 2019

bootstrap global checkpoint in phase 1 of peer recovery

ec082f8

dnhatn removed v6.7.0 v7.0.0 labels Mar 11, 2019

dnhatn added 10 commits March 11, 2019 22:37

stylecheck

18f4b76

Merge branch 'master' into remote-recovery-history

f867c17

Merge branch 'master' into remote-recovery-history

a834af4

adapt change after merge

f1a077c

Merge branch 'master' into remote-recovery-history

fc94128

assertion :D

dccbed5

send shard changes requests to any copy

f48126c

Merge branch 'master' into remote-recovery-history

4f5b1a5

# Conflicts: # server/src/test/java/org/elasticsearch/index/replication/IndexLevelReplicationTests.java

Merge branch 'master' into remote-recovery-history

e8e8b60

Merge branch 'master' into remote-recovery-history

04a7976

dnhatn added 2 commits April 4, 2019 22:55

Merge branch 'master' into remote-recovery-history

a59edc0

Merge branch 'master' into remote-recovery-history

c3dab87

dnhatn removed the WIP label Apr 5, 2019

dnhatn requested a review from ywelsch April 5, 2019 03:10

dnhatn closed this May 16, 2019

dnhatn deleted the remote-recovery-history branch May 16, 2019 20:07

dnhatn removed v7.2.0 v8.0.0 labels Jun 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replay history of operations in remote recovery #39153

Replay history of operations in remote recovery #39153

dnhatn commented Feb 19, 2019 •

edited

Loading

elasticmachine commented Feb 19, 2019

dnhatn commented Feb 20, 2019

ywelsch left a comment

ywelsch Feb 25, 2019

ywelsch Feb 25, 2019

ywelsch Feb 25, 2019

dnhatn Feb 26, 2019

dnhatn Mar 11, 2019

dnhatn commented Mar 11, 2019

dnhatn commented Apr 3, 2019

dnhatn commented Apr 5, 2019

dnhatn commented May 16, 2019

Replay history of operations in remote recovery #39153

Replay history of operations in remote recovery #39153

Conversation

dnhatn commented Feb 19, 2019 • edited Loading

elasticmachine commented Feb 19, 2019

dnhatn commented Feb 20, 2019

ywelsch left a comment

Choose a reason for hiding this comment

ywelsch Feb 25, 2019

Choose a reason for hiding this comment

ywelsch Feb 25, 2019

Choose a reason for hiding this comment

ywelsch Feb 25, 2019

Choose a reason for hiding this comment

dnhatn Feb 26, 2019

Choose a reason for hiding this comment

dnhatn Mar 11, 2019

Choose a reason for hiding this comment

dnhatn commented Mar 11, 2019

dnhatn commented Apr 3, 2019

dnhatn commented Apr 5, 2019

dnhatn commented May 16, 2019

dnhatn commented Feb 19, 2019 •

edited

Loading