Construct produce requests incrementally #433

eapache · 2015-04-25T03:44:13Z

Between the messageAggregator and the flusher we actually re-shuffle all messages to produce three times: once into a single []*ProducerMessage for batching purposes, once into a map[string]map[int32][]*ProducerMessage to aid internal state transitions, and finally into the actual ProduceRequest structure. This whole process is rather silly and over-complicated.

Once #300 lands, we should be able to simplify this substantially. With the appropriate re-organization of state, the messageAggregator should be able to put messages directly into a ProduceRequest as they arrive, getting rid of both existing re-shuffling passes.

This change will also enable one other subtle optimization. Since compressed messages are wrapped together and sent as the payload of a single "message" in the protocol, the total size of compressed messages sent is not just limited by the MaxRequestSize, but also by MaxMessageBytes which is typically much smaller.

However, while MaxRequestSize is per-request, MaxMessageSize (as it applies to compressed message sets masquerading as single messages) is per partition. Unfortunately, the current messageAggregator enforces this limit per request because it doesn't have the state to calculate it per-partition. This has the effect of artificially limiting throughput when compression is enabled and multiple topics are being produced to the same broker (you end up with each request limited to 1MB instead of each partition of each request being limited to 1MB).

The text was updated successfully, but these errors were encountered:

Put them in a map right up front in the aggregator, it only requires tracking one exta piece of metadata (total messages in the map) and it means we don't have to shuffle them into this form before constructing the request anyways. One piece of #433.

eapache · 2015-09-25T17:45:11Z

#449 found a problem in this area which #538 fixed in a rather short-term hacky way. However this ends up re-organized must solve those problems as well.

eapache · 2015-10-16T14:07:20Z

Closed by #551

eapache added enhancement producer labels Apr 25, 2015

eapache mentioned this issue Apr 25, 2015

Split producer goroutines into sub-structs #300

Closed

eapache mentioned this issue Aug 13, 2015

Shuffle messages less in the producer. #513

Closed

This was referenced Sep 25, 2015

Correctly handle unencodable messages in the producer #538

Merged

Producer refactor #544

Closed

eapache mentioned this issue Oct 15, 2015

Producer refactor 3 #551

Merged

eapache closed this as completed Oct 16, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Construct produce requests incrementally #433

Construct produce requests incrementally #433

eapache commented Apr 25, 2015

eapache commented Sep 25, 2015

eapache commented Oct 16, 2015

Construct produce requests incrementally #433

Construct produce requests incrementally #433

Comments

eapache commented Apr 25, 2015

eapache commented Sep 25, 2015

eapache commented Oct 16, 2015