k/record_batcher: Move to kafka/data #23832

Useful in audit_log_manager as well as transform logging Signed-off-by: Oren Leiman <[email protected]>

And adds make_batch_of_one Signed-off-by: Oren Leiman <[email protected]>

Signed-off-by: Oren Leiman <[email protected]>

We need these for sorting out which partitions are locally led Signed-off-by: Oren Leiman <[email protected]>

Previous implementation used a very high value for retries on the internal kafka client, which prevents the client from recovering certain types of errors. Instead, we batch up drained records on the manager side, allowing us to hold a copy of each batch in memory and retry failed produce calls from "scratch". This also allows us to be _much_ more aggressive about batching. The internal kafka client will calculate a destination partition for each record, round robin style over the number of partitions. In the new scheme, we shoot for a maximally sized batch first, then select a destination, still round-robin style, but biasing heavily toward locally led partitions. In this way, given the default audit per-shard queue limit and default max batch size (both 1MiB), the most common drain operation should result in exactly one produce request. Signed-off-by: Oren Leiman <[email protected]>

Followup to redpanda-data#23775

Commits on Oct 18, 2024

module.bazel.lock

oleiman committed Oct 18, 2024

Configuration menu

View commit details

Copy full SHA for 60863b3

Browse repository at this point

Copy the full SHA

60863b3 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

k/record_batcher: Move to kafka/data #23832

k/record_batcher: Move to kafka/data #23832

Commits on Oct 17, 2024

Commits on Oct 18, 2024

k/record_batcher: Move to kafka/data #23832

Are you sure you want to change the base?

k/record_batcher: Move to kafka/data #23832

Commits on Oct 17, 2024

Commits on Oct 18, 2024