Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Segment Replication] Send Remote shard failure for lagging replicas #6606

Closed
mch2 opened this issue Mar 9, 2023 · 0 comments · Fixed by #6850
Closed

[Segment Replication] Send Remote shard failure for lagging replicas #6606

mch2 opened this issue Mar 9, 2023 · 0 comments · Fixed by #6850
Assignees
Labels
distributed framework enhancement Enhancement or improvement to existing feature or request

Comments

@mch2
Copy link
Member

mch2 commented Mar 9, 2023

As last part of #4478, we need a mechanism to recognize shards are stale (with metrics introduced with #6520) for too long and send a remote shard failure. This is a remote failure as these metrics are collected by the primary shard.

I am thinking we add a background task that checks on a short schedule (30s) and invokes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
distributed framework enhancement Enhancement or improvement to existing feature or request
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

5 participants