-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: admission-control/disk-bandwidth-limiter failed #131484
Comments
😞 |
![]() ![]() Seemingly random spike in bandwidth. I think it is likely due to a compaction. And since the bandwidth limiter doesn't react instantly, we hit the threshold. Maybe we should smooth out the assertion here to avoid a spike like this to cause an assertion failure. My hypothesis is that it would auto recover by reducing the amount of tokens available in the next run. It is likely that our read estimation is problematic since we only adjust for it every 15s and it is based on the past window. Either way, it should have recovered. And we expect to do better (adjust reads at a higher frequency) once we have reads hooked up to the limiter as well. @sumeerbhola what do you think? |
We could also lower the utilization threshold. It is currently at 0.8, maybe a value of 0.7 makes more sense. This will give regular work more headroom. |
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 67dc7a1c9bf117046b10513c3277bf7ccf0db975:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 5400cb9a70e63bfe1aa2849a566c195ad63130d1:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ b6c13686495bbe9ad476b28033461ef7628e18a8:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ b6c13686495bbe9ad476b28033461ef7628e18a8:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters |
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ b6c13686495bbe9ad476b28033461ef7628e18a8:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ b6c13686495bbe9ad476b28033461ef7628e18a8:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters |
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 74333311616b937fea6a995462215a1cb5962686:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 74333311616b937fea6a995462215a1cb5962686:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters |
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ ec2573dc6aaeefc226440bb2c5a7c94a63989868:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters |
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ ec2573dc6aaeefc226440bb2c5a7c94a63989868:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 0c0af9540ed3f9d63eba523bc870eeb6c7eebe90:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 4de315c9ca4ccf7c3bdbf53a5226e8c14c84a68e:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 4de315c9ca4ccf7c3bdbf53a5226e8c14c84a68e:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters |
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ f842c3b4b5adc040d411bd17d7d10005273fc1b6:
Parameters:
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ becbd0fcdfa2e37a6ff23b33af70f2f91eca0790:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ f9918d8f81a1829df63ac734fd6d21c60141e338:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 17535c13cfed95db70cd8dfb1ba6a700686f57b1:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 17535c13cfed95db70cd8dfb1ba6a700686f57b1:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 6bb6dc96ebf0ee2f23c5c568fa0d421019dc0946:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 6bb6dc96ebf0ee2f23c5c568fa0d421019dc0946:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 27c521de897105cdeeed88c3a853380c14345a22:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 27c521de897105cdeeed88c3a853380c14345a22:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 015b2f48cf80a6d8b60d7038c8c3457d934c716a:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ d0e07efe30dfe64d36412363000a1b977b4d5d2e:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ a60d739746648922134ec3c0a22bb069bf1d283c:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ a44a9b1ffce25f51026b494a1dcb393cfc5361f3:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ a44a9b1ffce25f51026b494a1dcb393cfc5361f3:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 688e82e8d015350fe3aa263484416d28b232a25d:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 688e82e8d015350fe3aa263484416d28b232a25d:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 8f5366d09e6cf2144ca43f9cdda7e1128a13fbf8:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 8f5366d09e6cf2144ca43f9cdda7e1128a13fbf8:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 8f5366d09e6cf2144ca43f9cdda7e1128a13fbf8:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ ea4644b040dd4503f2eb7292cfebc31a58fd16fb:
Parameters:
Same failure on other branches
|
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ ea4644b040dd4503f2eb7292cfebc31a58fd16fb:
Parameters:
Help
See: roachtest README See: How To Investigate (internal) Grafana is not yet available for aws clusters Same failure on other branches
|
134430: roachtest: disk bandwidth limiter test should only asssert on writes r=sumeerbhola a=aadityasondhi Since we do not pace reads yet, the test will remain flaky in this assertion, as the system can see unbounded read bandwidth usage and fail the assertion even if writes are paced. Fixes #131484 Release note: None 134527: roachtest: add debugging to gossip/chaos r=tbg a=tbg This test has had a string of weird failures where either a `t.L().Printf` call or `time.Sleep(1s)` take dozens of seconds. This PR adds a goroutine that gets spawned right before and, unless signaled within 2s by both the Printf and the Sleep having completed, dumps stacks to stderr. See the main issue #130737. Closes the duplicates across various branches: Closes #132651. Closes #134495. Epic: none Release note: None 134751: lease: dump stacks if TestDescriptorRefreshOnRetry fails r=rafiss a=rafiss We added additional logging to help debug a source of flakiness in which the acquisition counts exceed the number of release counts. For that logging to be useful, we need to know the goroutine IDs and stacks. Marking this as fixing the linked issue so that the next time it fails, we are reminded to look at the logs. fixes: #134695 Release note: None 134953: kvserver/rangefeed: rename Disconnect to SendError for stream interface r=tbg,stevendanna a=wenyihu6 This patch renames `Disconnect` to `SendError` in the `rangefeed.Stream` interface to clarify its role for sending errors, distinguishing it from other similarly named functions like `registration.disconnect`. Part of: #110432 Release note: none Co-authored-by: Steven Danna [email protected] Co-authored-by: Aaditya Sondhi <[email protected]> Co-authored-by: Tobias Grieger <[email protected]> Co-authored-by: Rafi Shamim <[email protected]> Co-authored-by: Wenyi Hu <[email protected]>
Based on the specified backports for linked PR #134430, I applied the following new label(s) to this issue: branch-release-24.3. Please adjust the labels as needed to match the branches actually affected by this issue, including adding any known older branches. 🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf. |
Since we do not pace reads yet, the test will remain flaky in this assertion, as the system can see unbounded read bandwidth usage and fail the assertion even if writes are paced. Fixes #131484 Release note: None
roachtest.admission-control/disk-bandwidth-limiter failed with artifacts on master @ 67dc7a1c9bf117046b10513c3277bf7ccf0db975:
Parameters:
ROACHTEST_arch=amd64
ROACHTEST_cloud=gce
ROACHTEST_coverageBuild=false
ROACHTEST_cpu=8
ROACHTEST_encrypted=false
ROACHTEST_fs=ext4
ROACHTEST_localSSD=true
ROACHTEST_runtimeAssertionsBuild=false
ROACHTEST_ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
See: Grafana
This test on roachdash | Improve this report!
Jira issue: CRDB-42564
The text was updated successfully, but these errors were encountered: