-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kvserver: v21.2.7: nil pointer in gRPC transport when sending snapshots #81227
Comments
Huh, nil pointer in the gRPC transport. Probably unactionable. |
cc @cockroachdb/replication |
FWIW, thought this might be related to gRPC issues that Andrei has been looking at over in #80878, but doesn't seem to be the case (those were caused by calls to |
It looks like the UPD: More likely |
This seems to be fixed in grpc/grpc-go#5323. |
There are no recent re-occurrences of this panic in Sentry. |
Although it doesn't look like we picked up this change. Reopening to investigate further. |
Need to bump |
Fixes cockroachdb#81227 Release note: upgrade grpc from v1.46.0 to v.1.47.0 which fixes a rare bug that hit us with panic on a nil pointer.
88454: ui: insights transaction details support multiple blocking transactions r=j82w a=j82w This adds support for multiple blocking transactions for a single waiting transaction. The cards were merged into the table, and the data was piped through to show multiple rows. The total contention time was also fixed to aggregate the contention time instead of just picking the latest. before: https://loom.com/share/0384ed937a344e2fb0105fefbc313acb after: https://www.loom.com/share/78e906f50a694cd59ac893ddb9c2239a closes #88264 Release justification: Category 2: Bug fixes and low-risk updates to new functionality Release note: (ui change): Add support for multiple blocking transaction on insights transaction details page. Merged the cards into the table, and fixed the total contention time. 88470: *: upgrade grpc to v1.47.0 r=erikgrinaker a=pavelkalinnikov Fixes #81227 Release note: upgrade grpc from v1.46.0 to v.1.47.0 which fixes a subtle bug causing panic on a nil pointer. 88477: keys: mark 49 as reserved r=ajwerner a=ajwerner Release note: None 88496: persistedsqlstats: speed up a test r=yuzefovich a=yuzefovich Previously, a single unit test could take on the order of 4 minutes (or even exceed 5 minute timeout, rarely) because the job monitor checks whether a cluster setting has been updated only every minute, and we update the cluster setting twice in a unit test. This commit makes it so that in a testing setup the check happens every second. Release note: None 88499: bazel: upgrade `rules_go` r=rail a=healthy-pod Pull in cockroachdb/rules_go#8. Closes #88048 Release note: None Co-authored-by: j82w <[email protected]> Co-authored-by: Pavel Kalinnikov <[email protected]> Co-authored-by: Andrew Werner <[email protected]> Co-authored-by: Yahor Yuzefovich <[email protected]> Co-authored-by: healthy-pod <[email protected]>
Reopening, since we had to roll back to 1.46.0 due to other test failures (#88745). |
@nvanbenschoten I'm handing this over to KV, since it's a general bug in gRPC which requires an upgrade. We should try to get this in early in the 23.1 cycle, and fix the issues that motivated #88745. |
I also run into the panic in the gRPC that was fixed upstream in grpc/grpc-go#5323. Looking forward to getting the gRPC dependency bumped :) |
I'll pick this up again. |
cc @cockroachdb/replication |
cc @cockroachdb/replication |
This issue was autofiled by Sentry. It represents a crash or reported error on a live cluster with telemetry enabled.
Sentry link: https://sentry.io/organizations/cockroach-labs/issues/3269587070/?referrer=webhooks_plugin
Panic message:
Stacktrace (expand for inline code snippets):
/usr/local/go/src/runtime/panic.go#L964-L966 in runtime.gopanic
/usr/local/go/src/runtime/panic.go#L211-L213 in runtime.panicmem
/usr/local/go/src/runtime/signal_unix.go#L733-L735 in runtime.sigpanic
https://github.com/cockroachdb/cockroach/blob/37dee546a7c52870a8dc58826f0cffe2afa8d47a/vendor/google.golang.org/grpc/internal/transport/transport.go#L308-L310 in google.golang.org/grpc/internal/transport.(*Stream).compareAndSwapState
https://github.com/cockroachdb/cockroach/blob/37dee546a7c52870a8dc58826f0cffe2afa8d47a/vendor/google.golang.org/grpc/internal/transport/http2_client.go#L927-L929 in google.golang.org/grpc/internal/transport.(*http2Client).Write
https://github.com/cockroachdb/cockroach/blob/37dee546a7c52870a8dc58826f0cffe2afa8d47a/vendor/google.golang.org/grpc/stream.go#L845-L847 in google.golang.org/grpc.(*clientStream).CloseSend.func1
https://github.com/cockroachdb/cockroach/blob/37dee546a7c52870a8dc58826f0cffe2afa8d47a/vendor/google.golang.org/grpc/stream.go#L661-L663 in google.golang.org/grpc.(*clientStream).withRetry
https://github.com/cockroachdb/cockroach/blob/37dee546a7c52870a8dc58826f0cffe2afa8d47a/vendor/google.golang.org/grpc/stream.go#L852-L854 in google.golang.org/grpc.(*clientStream).CloseSend
cockroach/pkg/kv/kvserver/raft_transport.go
Lines 690 to 692 in 37dee54
cockroach/pkg/kv/kvserver/raft_transport.go
Lines 694 to 696 in 37dee54
cockroach/pkg/kv/kvserver/replica_command.go
Lines 2524 to 2526 in 37dee54
cockroach/pkg/util/contextutil/context.go
Lines 88 to 90 in 37dee54
cockroach/pkg/kv/kvserver/replica_command.go
Lines 2522 to 2524 in 37dee54
cockroach/pkg/kv/kvserver/replica_command.go
Lines 1679 to 1681 in 37dee54
cockroach/pkg/kv/kvserver/replica_command.go
Lines 1048 to 1050 in 37dee54
cockroach/pkg/kv/kvserver/replicate_queue.go
Lines 1362 to 1364 in 37dee54
cockroach/pkg/kv/kvserver/replicate_queue.go
Lines 1123 to 1125 in 37dee54
cockroach/pkg/kv/kvserver/replicate_queue.go
Lines 481 to 483 in 37dee54
cockroach/pkg/kv/kvserver/replicate_queue.go
Lines 303 to 305 in 37dee54
cockroach/pkg/kv/kvserver/queue.go
Lines 958 to 960 in 37dee54
cockroach/pkg/util/contextutil/context.go
Lines 88 to 90 in 37dee54
cockroach/pkg/kv/kvserver/queue.go
Lines 917 to 919 in 37dee54
cockroach/pkg/kv/kvserver/queue.go
Lines 837 to 839 in 37dee54
cockroach/pkg/util/stop/stopper.go
Lines 441 to 443 in 37dee54
/usr/local/go/src/runtime/asm_amd64.s#L1370-L1372 in runtime.goexit
v21.2.7
Jira issue: CRDB-15285
The text was updated successfully, but these errors were encountered: