-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: failover/chaos/read-only failed #123736
Labels
A-testing
Testing tools and infrastructure
C-bug
Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior.
C-test-failure
Broken test (automatically or manually discovered).
O-roachtest
O-robot
Originated from a bot.
T-kv
KV Team
Milestone
Comments
cockroach-teamcity
added
branch-release-24.1
Used to mark GA and release blockers, technical advisories, and bugs for 24.1
C-test-failure
Broken test (automatically or manually discovered).
O-roachtest
O-robot
Originated from a bot.
release-blocker
Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked.
T-kv
KV Team
labels
May 7, 2024
nvanbenschoten
added
GA-blocker
and removed
release-blocker
Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked.
labels
May 8, 2024
We see that the n7 was unable to make any outgoing connections around the time of the failure. From the logs:
This is induced by the test:
With DistSender circuit breakers disabled, it makes sense why the request from n9 got stuck and eventually timed out. I'll remove the blocker label. However, before we close this out, I think we should turn on DistSender circuit breakers for these failover chaos tests. |
arulajmani
removed
GA-blocker
branch-release-24.1
Used to mark GA and release blockers, technical advisories, and bugs for 24.1
labels
May 8, 2024
arulajmani
added a commit
to arulajmani/cockroach
that referenced
this issue
May 8, 2024
Failover chaos tests create assymetric partitions, where DistSender circuit breakers are useful. It prevents failure modes such as cockroachdb#123736 (comment). Fixes cockroachdb#123736 Release note: None
craig bot
pushed a commit
that referenced
this issue
May 10, 2024
123820: roachtest: turn on DistSender circuit breakers for failover chaos tests r=nicktrav,andrewbaptist a=arulajmani Failover chaos tests create asymetric partitions where DistSender circuit breakers are useful. It prevents failure modes such as #123736 (comment). Fixes #123736 Release note: None Co-authored-by: Arul Ajmani <[email protected]>
blathers-crl bot
pushed a commit
that referenced
this issue
May 10, 2024
Failover chaos tests create asymetric partitions where DistSender circuit breakers are useful. It prevents failure modes such as #123736 (comment). Fixes #123736 Release note: None
nvanbenschoten
added
A-testing
Testing tools and infrastructure
C-enhancement
Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)
C-bug
Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior.
and removed
C-enhancement
Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)
labels
May 10, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
A-testing
Testing tools and infrastructure
C-bug
Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior.
C-test-failure
Broken test (automatically or manually discovered).
O-roachtest
O-robot
Originated from a bot.
T-kv
KV Team
roachtest.failover/chaos/read-only failed with artifacts on release-24.1 @ b083c4a9b946e2a7d6a79b46eef77410f0b742ce:
Parameters:
ROACHTEST_arch=amd64
ROACHTEST_cloud=gce
ROACHTEST_coverageBuild=false
ROACHTEST_cpu=2
ROACHTEST_encrypted=false
ROACHTEST_fs=ext4
ROACHTEST_localSSD=false
ROACHTEST_metamorphicBuild=false
ROACHTEST_ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
See: Grafana
This test on roachdash | Improve this report!
Jira issue: CRDB-38514
The text was updated successfully, but these errors were encountered: