Implement wrapper around `KafkaConsumer` to support batch consumption with head of line blocking #907

nscuro · 2023-11-13T19:16:00Z

We have multiple use cases where Kafka records are assembled into batches before they're processed.

While batch semantics can be achieved with Kafka Streams, it introduces additional overhead:

Record offsets become eligible for committing once a record successfully passes through a sub-topology
It is thus not possible to delay offset commits until the batch is processed, as record offsets may be committed before that, risking message loss when processing the batch fails
Assembling record batches requires Kafka Streams state stores (in-memory or RocksDB), which in turn necessitate a changelog topic for fault tolerance
Batches can only be assembled per-partition, because each state store is bound to a single Kafka Streams task, which itself is bound to one partition

An example of trying to address such batching use cases in Kafka Streams can be seen here: https://github.com/DependencyTrack/hyades-apiserver/pull/305/files

For simple batching use cases, ideally it should work like this:

Subscribe to N partitions of topic foo
poll records from all N partitions
Put records into in-memory batch, until a given max size is reached
(Optionally send records that failed to deserialize to a dead-letter-topic)
When max batch size is reached, or a given timeout is reached, "flush" / process records (i.e. write to database, do HTTP call, ...)
When flushing was successful, commit offsets
When flushing was unsuccessful, do not commits offsets and either:
- Fail the consumer entirely in case of non-retryable errors
- Restart consumer from last committed offset in case of retryable errors

Essentially, implement a batch consumer with head-of-line blocking.

HOL blocking semantics are often undesirable, but for certain cases they are useful:

Assuming all records are flushed to the same "sink" (e.g. database), failure to flush one record (e.g. database is down) always implies others can't be flushed either. There is no point in proceeding to later records in the topic.
Retries are simple, because they simply involve restarting from the last committed offset. There is no additional state keeping necessary, and no retry- or changelog-topic is required.

Areas where I think this might be useful:

Ingestion of ScanResults and vulnerabilities from the dtrack.vuln-analysis.result topic
Ingestion of AnalysisResults from the dtrack.repo-meta-analysis.result topic
Buffering of ScanResults when tracking vulnerability scan completion (see https://github.com/DependencyTrack/hyades-apiserver/pull/305/files)
Ingestion of mirrored vulnerabilities from the dtrack.vulnerability topic
Performing vulnerability analysis with Snyk or OSS Index which support PURL batching

The text was updated successfully, but these errors were encountered:

nscuro · 2023-11-13T19:18:15Z

A few example implementation of batching that I found:

nscuro · 2023-11-14T11:21:40Z

I just realized that Confluent's Parallel Consumer is doing exactly that: #346

Update: Confluent Parallel Consumer has no batch timeout behavior (doesn't wait for batches to become full). So ultimately we need to build our own batching consumer. But perhaps PC can be used as intermediary solution in the meantime.

Decoupled from #509 This merely adds an API on top of which processors can be implemented. We can migrate processors one-by-one from Kafka Streams to this API. Majority of this work was already done in #509, but got out of date due to changed priorities. At the very least said PR is good to take inspiration from. Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

Decoupled from #509 This merely adds an API on top of which processors can be implemented. We can migrate processors one-by-one from Kafka Streams to this API. Majority of this work was already done in #509, but got out of date due to changed priorities. At the very least said PR is good to take inspiration from. Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

…l Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

Decoupled from #509 This merely adds an API on top of which processors can be implemented. We can migrate processors one-by-one from Kafka Streams to this API. Majority of this work was already done in #509, but got out of date due to changed priorities. At the very least said PR is good to take inspiration from. Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

…l Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

…l Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

…l Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

…l Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

…l Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

… Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

…l Consumer Depends on #552 Relates to DependencyTrack/hyades#346 Relates to DependencyTrack/hyades#901 Relates to DependencyTrack/hyades#907 Signed-off-by: nscuro <[email protected]>

nscuro · 2024-06-05T10:57:04Z

Closing, as consumers in the API server have been migrated to Confluent parallel-consumer.

nscuro added enhancement New feature or request p3 Nice-to-have features size/L High effort component/api-server spike/research Requires more research before implementation labels Nov 13, 2023

nscuro self-assigned this Nov 24, 2023

nscuro mentioned this issue Dec 28, 2023

[STALE] Replace Kafka Streams with Confluent Parallel Consumer DependencyTrack/hyades-apiserver#509

Closed

3 tasks

nscuro mentioned this issue Feb 2, 2024

GA Roadmap #860

Open

34 tasks

nscuro mentioned this issue Feb 2, 2024

Implement foundational API for parallel-consumer based Kafka processors DependencyTrack/hyades-apiserver#552

Merged

2 tasks

nscuro mentioned this issue Feb 5, 2024

Migrate MirrorVulnerabilityProcessor from Kafka Streams to Parallel Consumer DependencyTrack/hyades-apiserver#553

Merged

2 tasks

nscuro mentioned this issue Feb 5, 2024

Migrate RepositoryMetaResultProcessor from Kafka Streams to Parallel Consumer DependencyTrack/hyades-apiserver#554

Merged

2 tasks

nscuro mentioned this issue Mar 21, 2024

Refactor KafkaEventDispatcher for better support of efficient Kafka producer usage patterns DependencyTrack/hyades-apiserver#631

Merged

2 tasks

nscuro mentioned this issue Mar 26, 2024

Migrate VulnerabilityScanResultProcessor from Kafka Streams to Parallel Consumer DependencyTrack/hyades-apiserver#637

Merged

2 tasks

nscuro closed this as completed Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement wrapper around `KafkaConsumer` to support batch consumption with head of line blocking #907

Implement wrapper around `KafkaConsumer` to support batch consumption with head of line blocking #907

nscuro commented Nov 13, 2023

nscuro commented Nov 13, 2023 •

edited

Loading

nscuro commented Nov 14, 2023 •

edited

Loading

nscuro commented Jun 5, 2024

Implement wrapper around KafkaConsumer to support batch consumption with head of line blocking #907

Implement wrapper around KafkaConsumer to support batch consumption with head of line blocking #907

Comments

nscuro commented Nov 13, 2023

nscuro commented Nov 13, 2023 • edited Loading

nscuro commented Nov 14, 2023 • edited Loading

nscuro commented Jun 5, 2024

Implement wrapper around `KafkaConsumer` to support batch consumption with head of line blocking #907

Implement wrapper around `KafkaConsumer` to support batch consumption with head of line blocking #907

nscuro commented Nov 13, 2023 •

edited

Loading

nscuro commented Nov 14, 2023 •

edited

Loading