Deduplicate instruction prefixes when disassembling #43

RyanGlScott · 2022-06-29T19:10:30Z

Previously, the opcode lookup table would encode every possible permutation of allowable prefixes for each instruction as a separate path. This is expensive in both space and time, as observed in #40. The new approach taken in this patch, as described in Note [x86_64 disassembly] in Flexdis86.Disassembler, is to only encode the VEX prefixe and opcode bytes in the lookup table, leaving out all other forms of prefix bytes entirely. Instead, disassembly will start by eagerly parsing as many prefix bytes as possible, proceeding to parse opcode bytes after the first non-prefix byte is encountered. After identifying the possible instructions from the opcode, we will then narrow down exactly which instruction it is by validating them against the set of parsed prefixes.

As noted in Note [x86_64 disassembly], we had to add some special cases for nop-like instructions—namely, endbr32, endbr64, pause, and xchg—to avoid some prefix byte–related ambiguity. The new handling for xchg is more accurate than it was before, so this patch fixes #42 as a side effect. This patch also addresses part (1) of #40 in that it should reduce the amount of memory that the lookup table uses, although there is potentially more work to be done (see part (2) of #40).

RyanGlScott · 2022-06-29T19:11:19Z

Still to come is a comparison of profiling reports for large SAW proofs before and after this patch to compare the memory usage. In the meantime, this patch is in a suitable state for review.

src/Flexdis86/Disassembler.hs

Previously, the opcode lookup table would encode every possible permutation of allowable prefixes for each instruction as a separate path. This is expensive in both space and time, as observed in #40. The new approach taken in this patch, as described in `Note [x86_64 disassembly]` in `Flexdis86.Disassembler`, is to only encode the VEX prefixe and opcode bytes in the lookup table, leaving out all other forms of prefix bytes entirely. Instead, disassembly will start by eagerly parsing as many prefix bytes as possible, proceeding to parse opcode bytes after the first non-prefix byte is encountered. After identifying the possible instructions from the opcode, we will then narrow down exactly which instruction it is by validating them against the set of parsed prefixes. As noted in `Note [x86_64 disassembly]`, we had to add some special cases for `nop`-like instructions—namely, `endbr32`, endbr64`, `pause`, and `xchg`—to avoid some prefix byte–related ambiguity. The new handling for `xchg` is more accurate than it was before, so this patch fixes #42 as a side effect. This patch also addresses part (1) of #40 in that it should reduce the amount of memory usage that the lookup table takes, although there is potentially more work to be done (see part (2) of #40).

RyanGlScott · 2022-07-01T18:18:33Z

Here is a heap profiling report for the same SAW proof as in #40 (comment), but with the patch in #43 applied:

flexdis86-patch.pdf

mkX64Disassembler doesn't even show up in this report's top offenders, which is exciting. I think this is as good of a sign as any that this patch does in fact reduce the memory usage in practice.

This patch brings in the changes from GaloisInc/flexdis86#43, which redesigns the x86_64 disassembler in `flexdis86` to use a substantially smaller lookup table, thereby saving a fair bit of resident memory when doing x86-related proofs.

RyanGlScott requested a review from travitch June 29, 2022 19:10

travitch reviewed Jun 30, 2022

View reviewed changes

src/Flexdis86/Disassembler.hs Outdated Show resolved Hide resolved

travitch approved these changes Jun 30, 2022

View reviewed changes

RyanGlScott force-pushed the T40-part-one branch from 2466b43 to 9ec23fa Compare June 30, 2022 23:02

RyanGlScott merged commit 7cb5fc6 into main Jul 1, 2022

RyanGlScott deleted the T40-part-one branch July 1, 2022 18:18

RyanGlScott mentioned this pull request Jul 1, 2022

The generated parse table consumes too much memory #40

Open

RyanGlScott mentioned this pull request Jul 1, 2022

Reduce memory use of x86 disassembly GaloisInc/saw-script#1697

Merged

RyanGlScott mentioned this pull request Sep 30, 2022

SAW memory performance improvements mega-issue GaloisInc/saw-script#1745

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deduplicate instruction prefixes when disassembling #43

Deduplicate instruction prefixes when disassembling #43

RyanGlScott commented Jun 29, 2022 •

edited

Loading

RyanGlScott commented Jun 29, 2022

RyanGlScott commented Jul 1, 2022

Deduplicate instruction prefixes when disassembling #43

Deduplicate instruction prefixes when disassembling #43

Conversation

RyanGlScott commented Jun 29, 2022 • edited Loading

RyanGlScott commented Jun 29, 2022

RyanGlScott commented Jul 1, 2022

RyanGlScott commented Jun 29, 2022 •

edited

Loading