raft: make the entries encoding maintainable #131561

pav-kv · 2024-09-28T18:12:02Z

Entries in raft contain the user-defined commands in a []byte slice, which our application layer has freedom to define. Today, we have the ever-growing encodings list + ad-hoc entry entry.Data parsing sprinkled in random places. This gets harder to maintain and reason about.

The pattern is that the first byte of Data contains the "encoding", and we then parse the entry differently, depending on this byte. We often want to sneak peek into the entry's "header" without parsing it entirely, because in most cases it is a protobuf with non-zero unmarshaling cost.

It would be great to replace this ad-hoc encoding with a clean/maintainable solution that allows:

Zero/low cost of unmarshaling.
Especially in the cases when we only need to check certain things about the entry, e.g. "is this a sideloaded entry?".

One approach to this is replacing the ad-hoc encoding with flatbuffers. We would treat the raftpb.Entry.Data field as a flatbuffer, with a well-defined / maintainable schema. It would be composed of “header” + data. The header would contain things like:

“is sideloaded” bit for routing the entries load/store to the right sub-storage
“sideloaded size”, for size accounting in log truncations stack (at the moment it needs to look at the file to know its size, which is inconvenient and e.g. the decoupled log trunc stack omits sideloaded files accounting in some cases)
the AC/RACv2 headers
maybe more

Then all the header checks (like “do we have a RACv2 header?”) would be cheap. As a bonus, we would have no allocations in the entries unmarshaling stack.

As a flip-side, this doesn’t integrate super well with protos, so maybe we would need some wrappers around these flatbuffers to convert to custom types or protos. Though at the scale of just one entries type this doesn’t sound too bad, and is probably better anyway than all the bug-prone parsing we have today. Another flip-side is that this requires a one-time migration that overwrites all raft logs.

However, this logs scan/migration is an opportunity to:

fixup invariants and remove some old code like this
solve problems like raft: eliminate log scan on campaigns #131559
check some invariants (e.g. indices are contiguous, terms are monotonic, etc)

Jira issue: CRDB-42604

The text was updated successfully, but these errors were encountered:

sumeerbhola · 2024-11-18T17:35:40Z

Another flip-side is that this requires a one-time migration that overwrites all raft logs.

That seems worrisome.

pav-kv · 2024-11-18T17:39:17Z

Yes, but:

it pays down some tech debt
it reduces cost
we have some other work that may require a raft log migration, e.g. ones mentioned in the issue description and kvserver: investigate SingleDelete for raft log truncation #8979. So this migration cost could be paid once if timed with other changes.
raft logs are small since they are compacted eagerly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

raft: make the entries encoding maintainable #131561

raft: make the entries encoding maintainable #131561

pav-kv commented Sep 28, 2024 •

edited

Loading

sumeerbhola commented Nov 18, 2024

pav-kv commented Nov 18, 2024 •

edited

Loading

raft: make the entries encoding maintainable #131561

raft: make the entries encoding maintainable #131561

Comments

pav-kv commented Sep 28, 2024 • edited Loading

sumeerbhola commented Nov 18, 2024

pav-kv commented Nov 18, 2024 • edited Loading

pav-kv commented Sep 28, 2024 •

edited

Loading

pav-kv commented Nov 18, 2024 •

edited

Loading