Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Segment Replication]. Support graceful replication failure. #2332

Closed
Tracked by #2194
mch2 opened this issue Mar 3, 2022 · 1 comment
Closed
Tracked by #2194

[Segment Replication]. Support graceful replication failure. #2332

mch2 opened this issue Mar 3, 2022 · 1 comment

Comments

@mch2
Copy link
Member

mch2 commented Mar 3, 2022

Right now when there is any error on replicas during the replication process the shard breaks and is not pulled from rotation or recreated.

The shard should fail gracefully by being removed from allocation and recreated. When rebooted the shard should be able to recover and restart as new by picking up the next replication checkpoint.

@mch2
Copy link
Member Author

mch2 commented Jun 24, 2022

Closing this as outdated. Replicas will fail & be removed from rotation when a failure occurs. https://github.com/opensearch-project/OpenSearch/blob/main/server/src/main/java/org/opensearch/indices/replication/SegmentReplicationTargetService.java#L112

@mch2 mch2 closed this as completed Jun 24, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant