-
-
Notifications
You must be signed in to change notification settings - Fork 198
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Question] REBALANCE_IN_PROGRESS error while committing offsets #409
Comments
This has bugged me forever, too. Notes from spelunking the Kafka source / how it can interact with the client:
I think the fix internally is to block the join request within the client, I can work on this later today I hope |
Huh, thanks. It definitely makes me feel better that I'm not the only one with this problem :) |
If you could try this PR: #410 |
That was quick! |
Ok, I tried this out and I think that it's working. I deployed to a subset of consumer instances and I didn't see any of these errors from the members that I had upgraded. |
I'm aiming to investigate one to three more things before releasing this, fwiw -- I don't think this is that high priority to release ASAP, but it will be a patch release. |
No problem. Do you think #410 is in good enough shape for me to use that in the meantime? |
I loop tested it for a long time, it should be |
This is now pushed in v1.13.2 |
Thanks a lot! |
Hey!
I'm almost certain this is about me doing something wrong, and not a bug with the library, but I'm hoping you might have an idea of what it is that I'm doing wrong :)
So, I have a cooperative-sticky consumer that uses
BlockRebalanceOnPoll
andDisableAutoCommit
, and then during the poll loop I do something like:When the group rebalances (e.g., adding a new topic-partition), I see from the rebalance event logs that only one consumer instance has had a partition added (as I'd expect), but then I see a bunch of these logs:
franz-go/pkg/kgo/consumer_group.go
Line 1920 in 1a59c2d
CommitUncommittedOffsets
call returningkerr.RebalanceInProgress
.I'm not sure why the consumers are failing to commit their offsets given that rebalances are blocked during the poll loop? (I don't see any evidence of consumer instances being evicted from the group due to timeouts.)
Any insight you might have would be much appreciated.
The text was updated successfully, but these errors were encountered: