-
Notifications
You must be signed in to change notification settings - Fork 289
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
worker(dm): add retry for watch when network is weak #4553
Conversation
[REVIEW NOTIFICATION] This pull request has not been approved. To complete the pull request process, please ask the reviewers in the list to review by filling The full list of commands accepted by this bot can be found here. Reviewer can indicate their review by submitting an approval review. |
@@ -269,7 +269,11 @@ func WatchSourceBound(ctx context.Context, cli *clientv3.Client, worker string, | |||
case <-ctx.Done(): | |||
return | |||
case resp, ok := <-ch: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what's the reason of this WatchChan being closed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The link you given is not the version and path of etcd Watch we use.
Line 83 in b301406
go.etcd.io/etcd v0.5.0-alpha.5.0.20210512015243-d19fbe541bf9 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
w.streams will only be nil after w.Close()
, I don't think that's the root cause
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if you have time can you help locate the problem, I don't have any ideas here at the moment, I'll send you the log in private chat~
/run-dm-integration-tests |
/run-dm-integration-tests |
Codecov Report
Flags with carried forward coverage won't be shown. Click here to find out more. @@ Coverage Diff @@
## master #4553 +/- ##
================================================
- Coverage 55.6402% 55.5836% -0.0566%
================================================
Files 494 502 +8
Lines 61283 62432 +1149
================================================
+ Hits 34098 34702 +604
- Misses 23750 24270 +520
- Partials 3435 3460 +25 |
/run-dm-integration-tests |
/run-dm-integration-tests |
/run-dm-integration-tests |
/run-dm-integration-tests |
from etcd
seems grpc streams closed is not the root cause of #4548 , we upadted embed etcd in #4755, let's see if this can solve the problem 😵 |
/hold |
I tested the behaviour of watch, after I turned off the WiFi of my laptop, watch will retry infinitely rather than terminate. |
close for no updated now |
What problem does this PR solve?
Issue Number: close #4548
What is changed and how it works?
Check List
Tests
Code changes
Side effects
Related changes
Release note