Batch processor batch size will grow endlessly on error #1833
Labels
area:trace
Part of OpenTelemetry tracing
bug
Something isn't working
help wanted
Extra attention is needed
pkg:SDK
Related to an SDK package
Milestone
Description
Hey, came across this bug as I was sending invalid utf-8 strings through the gRPC driver. The invalid spans would cause the batch to roll-over and be retried. If the batch size is already at/over max then the batch size check fails permanently and the batch will only be sent again after the timeout. As the batch grows it risks hitting gRPC message size limits imposed by the server.
How this looks with some debug logging wrapped around the driver:
Environment
Steps To Reproduce
I don't have a repro setup outside my work project. It should be possible to reproduce using unit tests on the batch span processor though.
Expected behavior
Either for the failed batch to be dropped, or some more intelligent retry with eventual discard
The text was updated successfully, but these errors were encountered: