Fix extract transcript #7434

mhofman · 2023-04-17T15:39:05Z

refs: #7432

Description

Update the transcript extract tools (and swingstore) to support the latest changes to transcripts.

Add a bundle export tools
When exporting from kernelDB, reference bundle IDs for both the lockdown and supervisor bundles, as well as the vat source bundle
re-order the fields of create-vat in the slog extract tool to be more similar to the kernel DB extract tool (minus the bundle ID changes)

The swingstore is extended as needed to support the extract tooling.

Some of these commits are cherry-picks from #6723 that were omitted from #7432

Security Considerations

None, tooling only

Scaling Considerations

None

Documentation Considerations

Internal tooling

Testing Considerations

Manually verified with from the state and slog created by a modified test-vaults-integration.js bootstrap test. Verified in a merge with #7432 that the transcripts could be replayed successfully.

mhofman · 2023-04-17T15:39:43Z

FYI @warner

I plan on figuring something out for handling bundleIDs in the workerOptions.

warner

looks good

warner · 2023-04-18T07:26:27Z

packages/swing-store/src/transcriptStore.js

@@ -131,6 +140,14 @@ export function makeTranscriptStore(
    return transcripts;
  }

+  function* readVatTranscript(vatID) {


We might call this readFullVatTranscript to make it clear that it's fetching multiple spans, which is not something we'd normally do for operational purposes. readVatTranscript isn't wrong.. it fits with our hierarchy of names:

"transcript": everything from vat creation to vat termination (or latest delivery)

"incarnation": everything from one upgrade to the next upgrade (or latest delivery, or termination)

"span": everything from heap snapshot save/load to next upgrade, snapshot write, termination, or latest delivery

"item" or "entry": one delivery

but "transcript" to mean "everything" is so rarely used, that it doesn't hurt to give it a longer name.

warner · 2023-04-18T07:28:23Z

packages/swing-store/src/transcriptStore.js

+  function* readVatTranscript(vatID) {
+    for (const { startPos, endPos } of sqlDumpVatSpansQuery.iterate(vatID)) {
+      for (const row of sqlDumpItemsQuery.iterate(vatID, startPos, endPos)) {
+        yield row;


be thinking about what happens if we've pruned the span and/or the items, we probably want this to throw an error rather than yield broken items.. maybe?

Good call! This method will simply be missing the historical span entries that were pruned. While this is a "full transcript" I think it's acceptable for it to yield a partial history given what we yield is both the transcript entry and its position.

I've updated the consumer (extract-transcript) to not write anything until it sees a startVat delivery. There was already logic to throw if the entries were not continuous (gap in positions). Before the change it would have thrown on pruned historical transcript spans. Now it will only return a full transcript of the last incarnation, but handle partially pruned transcripts of previous incarnations.

warner · 2023-04-18T07:29:30Z

packages/SwingSet/misc-tools/extract-transcript-from-slogfile.js

        transcriptNum += 1;
+        const entry = { transcriptNum, d: delivery, syscalls };


BTW I'd love to find a clean way to keep the transcriptNum in the DB, to avoid potential drift or fencepost errors with this external incrementing counter.

It is in the DB already. Here it's from the slog, which doesn't include the transcript pos, so it has to be regenerated. It would indeed be nice to include this info in the slogfile.

warner · 2023-04-18T07:31:38Z

packages/SwingSet/misc-tools/extract-transcript-from-kerneldb.js

-  return kvStore.get(key);
+  const value = kvStore.get(key);
+  if (value === undefined) {
+    throw Error(`Inexistent kvStore entry for ${key}`);


I'd say "non-existent" or "missing", but "inexistent" sounds like a cool word too :)

You caught my Latin roots :)

warner · 2023-04-18T07:34:02Z

packages/SwingSet/misc-tools/extract-transcript-from-kerneldb.js

  }
+} else if (/(supervisor|lockdown)(B|-b)undle/.test(vatName)) {


is this abusing the CLI argument so that when you say "please replay the vat named supervisor-bundle", it will actually extract the given bundle and write it to disk?

weird, but I'm ok with it

I thought I had removed this, since it's been included in the extract-bundle tool instead...

warner · 2023-04-18T07:39:22Z

packages/SwingSet/misc-tools/extract-transcript-from-kerneldb.js

+  for (const { position, item } of transcript) {
+    const entry = JSON.parse(item);
+    if (entry.d[0] === 'startVat') {
+      fs.ftruncateSync(fd);


This truncate is new, right? Were you having problems with overwriting pre-existing files that needed it? If so, it seems like it'd be better to open the file in the mode that overwrites/truncates the old one, or at least do the truncation immediately after opening, instead of inside this loop.

Or.. oh, are you writing every entry, but then you discover a startVat and retroactively decide that the transcript ought to begin here?

Ah, ok, so what you really want is a transcriptStore.readIncarnationItems(vatID, incarnationID), or a readCurrentIncarnation(vatID), so you don't have to deduce where the current incarnation begins.

As we improve the transcript to track upgrades and snapshot reads/writes, we should consider exposing incarnation numbers to the DB (as I think we discussed already, you and me and @FUDCo, and I think we agreed in principle on doing it). Probably sooner rather than later.

Yup that was the only way I found to only extract the latest incarnation given our DB model, and the motivation for my request to include the incarnation number.

Does not include bundle info

mhofman marked this pull request as ready for review April 18, 2023 00:51

mhofman requested a review from warner April 18, 2023 00:52

mhofman mentioned this pull request Apr 18, 2023

Port pismo replay tool improvements #7432

Merged

warner approved these changes Apr 18, 2023

View reviewed changes

mhofman force-pushed the mhofman/fix-extract-transcript-from-kernel-db branch from c0c0997 to 4cbfc93 Compare April 18, 2023 15:28

mhofman added the automerge:rebase Automatically rebase updates, then merge label Apr 18, 2023

mhofman added 5 commits April 18, 2023 19:29

fix(swingset-tools): correct transcriptNum when extracting from slog

e05e37b

fix(swingset-tools): extract vat transcript

edbac04

feat(swingset-tools): reference bundleIDs when extracting transcript

d2d3047

feat(swingset-tools): add tool to extract bundles

0144ec1

fix(swingset-tools): Sync up slog extract create-vat

d67bec4

Does not include bundle info

mhofman force-pushed the mhofman/fix-extract-transcript-from-kernel-db branch from d3a7e00 to d67bec4 Compare April 18, 2023 19:29

mergify bot merged commit c966474 into master Apr 18, 2023

mergify bot deleted the mhofman/fix-extract-transcript-from-kernel-db branch April 18, 2023 20:09

mhofman mentioned this pull request Apr 23, 2023

add transcript events: init, snapshot save/load, shutdown #7484

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix extract transcript #7434

Fix extract transcript #7434

mhofman commented Apr 17, 2023 •

edited

Loading

mhofman commented Apr 17, 2023

warner left a comment

warner Apr 18, 2023

warner Apr 18, 2023

mhofman Apr 18, 2023

warner Apr 18, 2023

mhofman Apr 18, 2023

warner Apr 18, 2023

mhofman Apr 18, 2023

warner Apr 18, 2023

mhofman Apr 18, 2023

warner Apr 18, 2023

mhofman Apr 18, 2023

		transcriptNum += 1;
		const entry = { transcriptNum, d: delivery, syscalls };

		}
		} else if (/(supervisor\|lockdown)(B\|-b)undle/.test(vatName)) {

Fix extract transcript #7434

Fix extract transcript #7434

Conversation

mhofman commented Apr 17, 2023 • edited Loading

Description

Security Considerations

Scaling Considerations

Documentation Considerations

Testing Considerations

mhofman commented Apr 17, 2023

warner left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mhofman commented Apr 17, 2023 •

edited

Loading