-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest/bank: don't start chaos until nodes are serving SQL #30989
roachtest/bank: don't start chaos until nodes are serving SQL #30989
Conversation
Please hold off on this. I thought it would fix the second class of failures we saw in #30064, but I just had a repro so it doesn't seem like it did. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm surprised why you didn't change the break
on l156 into a return
.
I still have this cleanup on my plate, including the l156 fix, but I'd like to actually get this test working without flakes first, which is turning out to be surprisingly tricky. |
9280b9d
to
5bd0dec
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 1 of 1 files at r1, 1 of 1 files at r2.
Reviewable status:complete! 0 of 0 LGTMs obtained (and 1 stale)
pkg/cmd/roachtest/bank.go, line 210 at r1 (raw file):
consistentIdx int, ) { s.waitGroup.Add(1)
Is this kosher? You're calling Add
from a goroutine and then calling Wait
from the main routine. I think this would hit an internal assertion in WaitGroup
, except that the Wait
doesn't happen right away.
pkg/cmd/roachtest/bank.go, line 215 at r3 (raw file):
// Don't begin the chaos monkey until all nodes are serving SQL connections. // This ensures that we don't test cluster initialization under chaos. for i := 0; i < c.nodes; i++ {
Nit: for i := 0; i <= c.nodes; i++
. Then in the body you can use i
instead of i+1
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status:
complete! 1 of 0 LGTMs obtained
pkg/cmd/roachtest/bank.go, line 210 at r1 (raw file):
Previously, petermattis (Peter Mattis) wrote…
Is this kosher? You're calling
Add
from a goroutine and then callingWait
from the main routine. I think this would hit an internal assertion inWaitGroup
, except that theWait
doesn't happen right away.
Yeah, you're absolutely right.
pkg/cmd/roachtest/bank.go, line 215 at r3 (raw file):
Previously, petermattis (Peter Mattis) wrote…
Nit:
for i := 0; i <= c.nodes; i++
. Then in the body you can usei
instead ofi+1
.
Make it for i := 1; i <= c.nodes; i++
and you've got a deal.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
mod Peter's comments which it looks you're fixing up right now.
Reviewed 1 of 1 files at r1, 1 of 1 files at r2, 1 of 1 files at r3.
Reviewable status:complete! 1 of 0 LGTMs obtained (and 1 stale)
5bd0dec
to
0cff6f2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status:
complete! 0 of 0 LGTMs obtained (and 2 stale)
pkg/cmd/roachtest/bank.go, line 210 at r1 (raw file):
Previously, nvanbenschoten (Nathan VanBenschoten) wrote…
Yeah, you're absolutely right.
Done.
pkg/cmd/roachtest/bank.go, line 215 at r3 (raw file):
Previously, nvanbenschoten (Nathan VanBenschoten) wrote…
Make it
for i := 1; i <= c.nodes; i++
and you've got a deal.
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status:
complete! 0 of 0 LGTMs obtained (and 2 stale)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Having just read through the recent comments on #30064 I agree this is the right change to make. We should also reduce the migration lease duration, though. There's no need for it to be so long given that we renew it in the background while running migrations.
Release note: None
Release note: None
Informs cockroachdb#30064. This prevents us from running into issues with SQL migrations. Release note: None
0cff6f2
to
b1ab7dd
Compare
TFTRs.
I was hoping for a quick fix here, but it looks like this is tied to the bors r+ |
30989: roachtest/bank: don't start chaos until nodes are serving SQL r=nvanbenschoten a=nvanbenschoten Informs #30064. Release note: None Co-authored-by: Nathan VanBenschoten <[email protected]>
Build succeeded |
Informs #30064.
Release note: None