Add Chunk Size to RR Balancer (Increased Batching Ability) #1232

erushing · 2023-11-15T22:14:04Z

The motivation here is to try to get Kafka-Go's RR Balancer to batch more aggressively. The RR Balancer naively iterates through the available partitions, putting a single message on each of them until the batch timeout. If it could put x messages on each partition before moving onto the next one, it would still be evenly distributed, but batch better.

The coding approach was to bring in the same chunking mechanism our internal Bulrush library uses, but simplify it and adapt it for the code-style of Kafka-Go.

balancer.go

petedannemann · 2023-11-16T14:26:01Z

balancer.go

+// across all available partitions, but puts greater emphasis on batching by a chunk size
+// within a shorter time period than is possible via the regular RoundRobin Balancer.
+type ChunkedRoundRobin struct {
+	chunkSize int


I think we want chunkSize to be public so that users can configure it

Yeah, this is where I thought the test showed you could pass in a value to the struct, but I would find this out for real when I tried to actually test this with a kafka-go based service. I see that Bulrush uses a setter for this, so I'm sure I'm off base with the way I did this.

I had this worker in mind where I want to be able to pass in a chunk size as they pass in RR as a balancer.
https://github.com/segmentio/identity/blob/2d04b8f5a16d235453c922e1e9d1ff7ca1b2a92b/identity-resolver/worker.go#L290C21-L290C21

The test is in the same package as the balancer so it is able to reference private fields. That won't be the case for users of the balancer. We don't use setters elsewhere in this kafka-go package and just use public fields for fields that we want to expose to users so I'd be inclined to stick with that convention

erushing · 2023-11-27T17:41:24Z

This was tested on a live application and showed increased batching, despite fairly low throughput through each producer and a low batch timeout value (10ms). Chunk Size of 10

balancer.go

Eric Rushing added 2 commits November 15, 2023 16:13

dp-1862 - Initial Spike on Bulrush-Style chunked RR Balancer/Partitioner

ce466d1

lint

4f0b6b5

petedannemann reviewed Nov 16, 2023

View reviewed changes

Eric Rushing added 2 commits November 16, 2023 10:10

fix locking, make chunk size public

9cd93de

re-trigger CI tests

0dc200d

erushing requested a review from petedannemann November 27, 2023 17:52

erushing changed the title ~~dp-1862 - Initial Spike on Bulrush-Style chunked RR Balancer/Partitioner~~ Chunked RR Balancer/Partitioner (Increased Batching) Nov 27, 2023

Eric Rushing added 2 commits November 27, 2023 12:51

refactor to add this functionality to existing RR balancer

22cc61a

lint

514b759

erushing changed the title ~~Chunked RR Balancer/Partitioner (Increased Batching)~~ Add Chunk Size to RR Balancer (Increased Batching Ability) Nov 27, 2023

petedannemann reviewed Nov 27, 2023

View reviewed changes

balancer.go Outdated Show resolved Hide resolved

Eric Rushing added 2 commits November 27, 2023 13:35

refactor counter synchronization

f25bb1d

whitespace

3c41bc3

erushing requested a review from petedannemann November 27, 2023 20:21

petedannemann approved these changes Nov 27, 2023

View reviewed changes

erushing merged commit f568774 into main Nov 27, 2023

erushing deleted the er/test-rr-balancer branch November 27, 2023 21:54

petedannemann mentioned this pull request Dec 12, 2023

fix: data race in roundrobin balancer #1251

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Chunk Size to RR Balancer (Increased Batching Ability) #1232

Add Chunk Size to RR Balancer (Increased Batching Ability) #1232

erushing commented Nov 15, 2023 •

edited

Loading

petedannemann Nov 16, 2023

erushing Nov 16, 2023

petedannemann Nov 16, 2023

erushing commented Nov 27, 2023 •

edited

Loading

Add Chunk Size to RR Balancer (Increased Batching Ability) #1232

Add Chunk Size to RR Balancer (Increased Batching Ability) #1232

Conversation

erushing commented Nov 15, 2023 • edited Loading

petedannemann Nov 16, 2023

Choose a reason for hiding this comment

erushing Nov 16, 2023

Choose a reason for hiding this comment

petedannemann Nov 16, 2023

Choose a reason for hiding this comment

erushing commented Nov 27, 2023 • edited Loading

erushing commented Nov 15, 2023 •

edited

Loading

erushing commented Nov 27, 2023 •

edited

Loading