Prevent CCR recovery from missing documents #38237

Tim-Brooks · 2019-02-01T23:29:24Z

Currently the snapshot/restore process manually sets the global
checkpoint to the max sequence number from the restored segements. This
does not work for Ccr as this will lead to documents that would be
recovered in the normal followering operation from being recovered.

This commit fixes this issue by setting the initial global checkpoint to
the existing local checkpoint.

…l_checkpoint

elasticmachine · 2019-02-01T23:29:25Z

Pinging @elastic/es-distributed

Tim-Brooks · 2019-02-01T23:35:51Z

This is not ready for production. I pushed it up here so we can discuss.

This commit:

Adds a test that will occasionally fail due to the local checkpoint != the max sequence number. Currently the test is set to run 5 times to guarantee consistent failures. However, that will be removed eventually.
This commit comments out store.bootstrapNewHistory(). This method manually sets the the local checkpoint == the max sequence number. We need to discuss how to deal with this as I think the bootstrap new history method probably offers some behavior we want to retain (even if the sequence number part does not work for CCR).
This commit attempts to advance the global checkpoint to the local check point after putting no-ops (for the normal non-ccr restore process). However, that cannot happen here as the replicationTracker.isPrimaryMode() is not yet in primary mode and that triggers an assertion. I'm not sure if we need to advance the global checkpoint in StoreRecovery#restore as I see that when the replication tracker goes to primary mode, it will advance the global checkpoint there. But this is something we should discuss.

    /**
     * Initializes the global checkpoint tracker in primary mode (see {@link #primaryMode}. Called on primary activation or promotion.
     */
    public synchronized void activatePrimaryMode(final long localCheckpoint) {
        assert invariant();
        assert primaryMode == false;
        assert checkpoints.get(shardAllocationId) != null && checkpoints.get(shardAllocationId).inSync &&
            checkpoints.get(shardAllocationId).localCheckpoint == SequenceNumbers.UNASSIGNED_SEQ_NO :
            "expected " + shardAllocationId + " to have initialized entry in " + checkpoints + " when activating primary";
        assert localCheckpoint >= SequenceNumbers.NO_OPS_PERFORMED;
        primaryMode = true;
        updateLocalCheckpoint(shardAllocationId, checkpoints.get(shardAllocationId), localCheckpoint);
        updateGlobalCheckpointOnPrimary();
        assert invariant();
    }

ywelsch · 2019-02-04T09:21:11Z

@tbrooks8 I've fixed things up as far as I think how we should handle this. I've also added a unit test. The integration test testFollowIndexWithConcurrentMappingChanges looks to fail sometimes, unrelated to the changes I've made here. It has some problems with concurrent mapping updates (where it tries to both update a field as text as well as a long).

While this fixes the problem here, it exposes another issue I think, namely that the primary will start off (i.e. be marked as started) with a history that contains gaps, i.e. local checkpoint != max sequence number. This can turn out to be problematic for replicas, because peer recovery only completes if all gaps are filled on the primary (see call to cancellableThreads.execute(() -> shard.waitForOpsToComplete(endingSeqNo)); in RecoverySourceHandler). This means that if someone does a PUT FOLLOW with wait_for_active_shards > 1, and we restore an IndexCommit with gaps, then the wait condition will time out, as we only start the following (which will complete the gaps) once the wait condition has passed. I think we should have initiateFollowing to first resume the following, and then execute the wait condition. This problem should be verifiable by extending the existing test with a wait_for_active_shards > 1 condition.

ywelsch

LGTM

…l_checkpoint

Tim-Brooks · 2019-02-05T04:24:41Z

@ywelsch your changes look good to me. There is a test failing (reproducibly). Looks like we assert that the sequence numbers match between Lucene and the translog. Did you mean for this to be the assertion for testRestoreShard:

closeShard(target, false);
closeShards(source);

instead of:

closeShard(source, false);
closeShards(target);

I assume we should not be asserting the ops are the same for the target since there will be some no-ops? I can update the PR, I just wanted to check if that is what you intended.

ywelsch · 2019-02-05T08:44:49Z

No, I only wanted the source index to be leniently closed (because we force-fully inject a gap). The problem was that the test was not ensuring that the index is restored with soft-deletes enabled when it is snapshotted with soft-deletes enabled. I've fixed this now

ywelsch · 2019-02-05T12:37:10Z

@elasticmachine run elasticsearch-ci/default-distro

ywelsch · 2019-02-05T15:34:20Z

@elasticmachine run elasticsearch-ci/2

…l_checkpoint

…nto ccr_initial_global_checkpoint

* master: (23 commits) Lift retention lease expiration to index shard (elastic#38380) Make Ccr recovery file chunk size configurable (elastic#38370) Prevent CCR recovery from missing documents (elastic#38237) re-enables awaitsfixed datemath tests (elastic#38376) Types removal fix FullClusterRestartIT warnings (elastic#38445) Make sure to reject mappings with type _doc when include_type_name is false. (elastic#38270) Updates the grok patterns to be consistent with logstash (elastic#27181) Ignore type-removal warnings in XPackRestTestHelper (elastic#38431) testHlrcFromXContent() should respect assertToXContentEquivalence() (elastic#38232) add basic REST test for geohash_grid (elastic#37996) Remove DiscoveryPlugin#getDiscoveryTypes (elastic#38414) Fix the clock resolution to millis in GetWatchResponseTests (elastic#38405) Throw AssertionError when no master (elastic#38432) `if_seq_no` and `if_primary_term` parameters aren't wired correctly in REST Client's CRUD API (elastic#38411) Enable CronEvalToolTest.testEnsureDateIsShownInRootLocale (elastic#38394) Fix failures in BulkProcessorIT#testGlobalParametersAndBulkProcessor. (elastic#38129) SQL: Implement CURRENT_DATE (elastic#38175) Mute testReadRequestsReturnLatestMappingVersion (elastic#38438) [ML] Report index unavailable instead of waiting for lazy node (elastic#38423) Update Rollup Caps to allow unknown fields (elastic#38339) ...

Currently the snapshot/restore process manually sets the global checkpoint to the max sequence number from the restored segements. This does not work for Ccr as this will lead to documents that would be recovered in the normal followering operation from being recovered. This commit fixes this issue by setting the initial global checkpoint to the existing local checkpoint.

Tim-Brooks added 4 commits February 1, 2019 14:08

WIP

3c24dff

WIP

e7b6e7c

Merge remote-tracking branch 'upstream/master' into ccr_initial_globa…

4ef8be2

…l_checkpoint

WIP

58b125e

Tim-Brooks added >bug v7.0.0 :Distributed Indexing/CCR Issues around the Cross Cluster State Replication features v6.7.0 labels Feb 1, 2019

Tim-Brooks requested review from martijnvg, ywelsch, jasontedor and dnhatn February 1, 2019 23:29

Tim-Brooks and others added 3 commits February 1, 2019 19:22

Changes

07b8a6d

adapt bootstrapNewHistory

40b400e

Add test for FollowEngine

bb71b3d

add tests

b4c85e2

ywelsch changed the title ~~WIP: Prevent Ccr recovery from missing documents~~ Prevent CCR recovery from missing documents Feb 4, 2019

cleanups

2699d33

ywelsch approved these changes Feb 4, 2019

View reviewed changes

Tim-Brooks added 2 commits February 4, 2019 16:42

Merge remote-tracking branch 'upstream/master' into ccr_initial_globa…

6d2610d

…l_checkpoint

WIP

6b0a2a7

ywelsch added 2 commits February 5, 2019 09:42

fix test

681de0e

no final

6105bd3

Merge remote-tracking branch 'elastic/master' into pr/38237

6e98199

Merge remote-tracking branch 'elastic/master' into pr/38237

622be53

Tim-Brooks added 4 commits February 5, 2019 10:27

Merge remote-tracking branch 'upstream/master' into ccr_initial_globa…

e6c1b6c

…l_checkpoint

checkstyle

1274346

Fix

4914308

Merge remote-tracking branch 'origin/ccr_initial_global_checkpoint' i…

5b2d8b1

…nto ccr_initial_global_checkpoint

Tim-Brooks merged commit c2a8fe1 into elastic:master Feb 5, 2019

Tim-Brooks added the backport pending label Feb 5, 2019

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Tim-Brooks removed the backport pending label Feb 9, 2019

dnhatn mentioned this pull request Feb 22, 2019

FsBlobStoreRepositoryIT#testSnapshotRestore fails reproducibly #39299

Closed

Tim-Brooks deleted the ccr_initial_global_checkpoint branch December 18, 2019 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent CCR recovery from missing documents #38237

Prevent CCR recovery from missing documents #38237

Tim-Brooks commented Feb 1, 2019

elasticmachine commented Feb 1, 2019

Tim-Brooks commented Feb 1, 2019

ywelsch commented Feb 4, 2019

ywelsch left a comment

Tim-Brooks commented Feb 5, 2019

ywelsch commented Feb 5, 2019

ywelsch commented Feb 5, 2019

ywelsch commented Feb 5, 2019

Prevent CCR recovery from missing documents #38237

Prevent CCR recovery from missing documents #38237

Conversation

Tim-Brooks commented Feb 1, 2019

elasticmachine commented Feb 1, 2019

Tim-Brooks commented Feb 1, 2019

ywelsch commented Feb 4, 2019

ywelsch left a comment

Choose a reason for hiding this comment

Tim-Brooks commented Feb 5, 2019

ywelsch commented Feb 5, 2019

ywelsch commented Feb 5, 2019

ywelsch commented Feb 5, 2019