Name	Name	Last commit message	Last commit date
parent directory ..
gradle	gradle
kafka-consumer-concurrent-across-keys-with-coroutines	kafka-consumer-concurrent-across-keys-with-coroutines
kafka-consumer-concurrent-across-keys	kafka-consumer-concurrent-across-keys
kafka-consumer-concurrent-across-partitions-within-same-poll	kafka-consumer-concurrent-across-partitions-within-same-poll
kafka-consumer-concurrent-across-partitions	kafka-consumer-concurrent-across-partitions
kafka-consumer-sequential	kafka-consumer-sequential
runner	runner
scripts	scripts
README.md	README.md
do.nu	do.nu
gradlew	gradlew
gradlew.bat	gradlew.bat
settings.gradle.kts	settings.gradle.kts

kafka-consumer-algorithms

A comparison of algorithms for consuming messages from Kafka: sequential, partially concurrent, fully concurrent.

Overview

When designing and operating a system that uses Kafka, you will encounter different techniques for consuming messages. The most familiar pattern is synchronously polling a batch of records, processing each one, and then committing the new offsets back to Kafka.

While simple, this pattern suffers from bottle-necking because it's a sequential algorithm. You will eventually turn to concurrent algorithms to increase throughput and reduce end-to-end latency in systems that need it. This project showcases different consumer implementations and their performance characteristics. The goal of the project is that you learn something: the poll loop, concurrent programming in Java/Kotlin, or the semantics of in-order message processing and offset committing. Study and experiment with the code.

At a high level, this project explores Kafka consumption algorithms on two dimensions:

Concurrency level
Workload type (CPU-bound vs IO-bound)

Here is an overview of the explored algorithms, and how they perform based on the nature of the workload (CPU-bound vs IO-bound):

Algorithm	Concurrency Level	In-Process Compute (CPU bound)	Remote Compute (IO bound)
Sequential	💻 (None)	Slowest. It's fine when you have a CPU-bound workload and only one core.	Slowest. It's fine when the external compute can only handle one unit of work at a time.
Concurrent across partitions within same poll	💻💻	Much faster if there are multiple CPU cores, but uneven work is a bottleneck.	Much faster if the remote compute can handle many requests quickly, but uneven work is a bottleneck.
Concurrent across partitions	💻💻	✅ Fastest general-purpose consumer. Fully saturates the workload based on CPU capacity.	✅ Fastest general-purpose consumer. Fully saturates the workload based on the capacity of the remote compute.
Concurrent across partition-key groups	💻💻💻	A special case of even more concurrency if your domain permits it.	A special case of even more concurrency if your domain permits it.
(What sophisticated algorithm does your domain permit?)	💻💻💻❓

The Code

The test bed for these implementations is the combination of an example Kafka consumer application and a test harness. The example app is a simple data in, data out Java program that consumes from a Kafka topic, transforms the data, and then produces the transformed messages to another Kafka topic.

The example application computes prime numbers which is a CPU-intensive task. The application can also run in an alternative mode where the computation is delegated to a fictional remote "prime computing service" and this is useful for simulating an IO-bound workload.

Overall, this is a multi-module Gradle project with the following subprojects:

kafka-consumer-sequential/
- This the most basic Kafka consumer pattern. It processes each record in sequence (one at a time).
- See the README in kafka-consumer-sequential/.
kafka-consumer-concurrent-across-partitions-within-same-poll/
- A Kafka consumer that processes messages concurrently across partitions but is sequentially confined between polls.
- See the README in kafka-consumer-concurrent-across-partitions-within-same-poll/.
kafka-consumer-concurrent-across-partitions/
- A Kafka consumer that processes messages concurrently across partitions and decouples message processing from the poll loop.
- See the README in kafka-consumer-concurrent-across-partitions/.
kafka-consumer-concurrent-across-keys/
- A Kafka consumer that processes messages concurrently across partition-key groups.
- See the README in kafka-consumer-concurrent-across-keys/.
kafka-consumer-concurrent-across-keys-with-coroutines/
- A Kafka consumer that processes messages concurrently across partition-key groups and is implemented with Kotlin coroutines.
- See the README in kafka-consumer-concurrent-across-keys-with-coroutines/.
runner/
- This is the module with a main method. It encapsulates the example Kafka consumer application and the test harness.

Instructions

Follow these instructions to get up and running with Kafka, run the program, and simulate Kafka messages.

Pre-requisites: Java, Kafka and kcat
- I used Java 21.
- I used Kafka 3.8.0 installed via Homebrew.
- I used kcat 1.7.0 installed via Homebrew.
- Tip: check your HomeBrew-installed package versions with a command like the following.
- ```
brew list --versions kafka
```
Start Kafka:
- ```
./scripts/start-kafka.sh
```
Create the Kafka topics:
- ```
./scripts/create-topics.sh
```

Build and run the example consumer app in standalone mode

./gradlew runner:installDist --quiet && ./runner/build/install/runner/bin/runner standalone in-process-compute:sequential

Alternatively, you can run the app with one of the alternative modes. Use the following command.

./gradlew runner:installDist --quiet && ./runner/build/install/runner/bin/runner standalone remote-compute:concurrent-across-keys-with-coroutines

There are other options as well. Explore the code.

In a new terminal, build and run a test case that exercises the app:

./gradlew runner:installDist --quiet && ./runner/build/install/runner/bin/runner test-one-message

Try the other test scenarios.

./gradlew runner:installDist --quiet && ./runner/build/install/runner/bin/runner test-multi-message

./gradlew runner:installDist --quiet && ./runner/build/install/runner/bin/runner load-batch

Stop Kafka with:
- ```
./scripts/stop-kafka.sh
```
Stop the app
- Send Ctrl+C to the terminal where it's running

Performance

You should be skeptical of performance results, because they are so often a combination of misleading, biased, and most of all just plain wrong. It's difficult to operate a clean room environment because there is often a huge surface area of configuration in the software being tested and of course you have little control over the OS or virtualized OS that's running the program, the firmware of the storage, etc. That said, here are the performance results of running the "load-all" scenario:

Load (see the code for the details)

Consumer Algorithm	Type	Final Throughput (msg/s)	Total Time (s)
sequential	CPU	0.76	13.14
concurrent-across-partitions-within-same-poll	CPU	1.50	6.67
concurrent-across-partitions	CPU	1.40	7.16
concurrent-across-keys	CPU	1.70	5.88
concurrent-across-keys-with-coroutines	CPU	1.70	5.87
sequential	I/O	0.99	10.06
concurrent-across-partitions-within-same-poll	I/O	1.98	5.04
concurrent-across-partitions	I/O	1.81	5.53
concurrent-across-keys	I/O	2.21	4.52
concurrent-across-keys-with-coroutines	I/O	2.21	4.53

Uneven Load (see the code for the details)

Consumer Algorithm	Type	Final Throughput (msg/s)	Total Time (s)
sequential	CPU	0.77	25.93
concurrent-across-partitions-within-same-poll	CPU	0.78	25.67
concurrent-across-partitions	CPU	1.21	16.59
concurrent-across-keys	CPU	1.84	10.87
concurrent-across-keys-with-coroutines	CPU	1.84	10.90
sequential	I/O	0.99	20.12
concurrent-across-partitions-within-same-poll	I/O	0.99	20.13
concurrent-across-partitions	I/O	1.53	13.07
concurrent-across-keys	I/O	2.21	9.06
concurrent-across-keys-with-coroutines	I/O	2.21	9.06

Mostly what I expected, but I'm confused how concurrent-across-partitions-within-same-poll is faster than concurrent-across-partitions. Even in a small sample, the difference is significant enough.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kafka-consumer-algorithms

kafka-consumer-algorithms

README.md

kafka-consumer-algorithms

Overview

The Code

Instructions

Performance

Load (see the code for the details)

Uneven Load (see the code for the details)

Wish List

Finished Wish List Items

Reference

Files

kafka-consumer-algorithms

Directory actions

More options

Directory actions

More options

Latest commit

History

kafka-consumer-algorithms

Folders and files

parent directory

README.md

kafka-consumer-algorithms

Overview

The Code

Instructions

Performance

Load (see the code for the details)

Uneven Load (see the code for the details)

Wish List

Finished Wish List Items

Reference