i#4014 dr$sim phys: Add offline -use_physical support #5516

derekbruening · 2022-06-03T22:52:32Z

Adds support for physical addresses to offline dr$sim traces. To
support simulators wanting both virtual and physical addresses, and to
simplify post-processing where the virtual PC values are needed, the
regular trace entries remain all virtual. A new marker type
TRACE_MARKER_TYPE_PHYSICAL_ADDRESS listing the corresponding physical
address is added. The mappings are assumed to not change, allowing
just one marker for each newly-observed page. This is done
per-thread.

An explicit TRACE_MARKER_TYPE_PHYSICAL_ADDRESS_NOT_AVAILABLE marker is
inserted on failure to translate, to prevent analyzers from having to
infer this due to the lack of the already-sparse markers.

Separately emitted pairs of virtual and physical address markers were
considered, with raw2trace inserting the physical at the right place,
but that presents complexities with buffer handoff and with the first
buffer. Instead, the physical are inserted via memmove directly into
the buffer. This does not seem to be a performance concern: the
translation lookup is the bottleneck.

Adds support for the new markers to the view tool.

Adds a Linux x86_64 test that runs a tiny asm app and ensures a
physical address marker is inserted. The test needs to run as sudo,
along with its pre- and post- commands. Currently it is enabled
everywhere, so a user running interactive tests will have it pause
while it waits for input. This might cause issues with manually
running the test suite.

A number of items remain for further work:

Performance is poor: the hashtable and caching need improvement.
There is a hardcoded limit on how many markers can be added
per buffer. Once this is exceeded, further markers are dropped.
We should split the buffer to handle this.
We may want to add a mode that checks for mapping changes.
Missing privileges results in every physical address being 0 instead
of showing the failure. We need to check the capabilities to distinguish.
Better testing that we're actually getting physical addresses for online
tests.
Better offline testing with larger apps.
Basic blocks that cross a page have only the first one translated.
A file descriptor per thread is used, which will not scale well with
DR's descriptor protection and might hit rlimits.

Issue: #4014

Adds support for physical addresses to offline dr$sim traces. To support simulators wanting both virtual and physical addresses, and to simplify post-processing where the virtual PC values are needed, the regular trace entries remain all virtual. A new marker type TRACE_MARKER_TYPE_PHYSICAL_ADDRESS listing the corresponding physical address is added. The mappings are assumed to not change, allowing just one marker for each newly-observed page. This is done per-thread. An explicit TRACE_MARKER_TYPE_PHYSICAL_ADDRESS_NOT_AVAILABLE marker is inserted on failure to translate, to prevent analyzers from having to infer this due to the lack of the already-sparse markers. Separately emitted pairs of virtual and physical address markers were considered, with raw2trace inserting the physical at the right place, but that presents complexities with buffer handoff and with the first buffer. Instead, the physical are inserted via memmove directly into the buffer. This does not seem to be a performance concern: the translation lookup is the bottleneck. Adds support for the new markers to the view tool. Adds a Linux x86_64 test that runs a tiny asm app and ensures a physical address marker is inserted. The test needs to run as sudo, along with its pre- and post- commands. Currently it is enabled everywhere, so a user running interactive tests will have it pause while it waits for input. This might cause issues with manually running the test suite. A number of items remain for further work: + Performance is poor: the hashtable and caching need improvement. + There is a hardcoded limit on how many markers can be added per buffer. Once this is exceeded, further markers are dropped. We should split the buffer to handle this. + We may want to add a mode that checks for mapping changes. + Missing privileges results in every physical address being 0 instead of showing the failure. We need to check the capabilities to distinguish. + Better testing that we're actually getting physical addresses for online tests. + Better offline testing with larger apps. + Basic blocks that cross a page have only the first one translated. + A file descriptor per thread is used, which will not scale well with DR's descriptor protection and might hit rlimits. Issue: #4014

derekbruening · 2022-06-04T02:27:51Z

The failure is the known flake #5514

clients/drcachesim/common/trace_entry.h

clients/drcachesim/tools/view.cpp

clients/drcachesim/tracer/tracer.cpp

clients/drcachesim/tracer/instru_offline.cpp

clients/drcachesim/tracer/physaddr.cpp

clients/drcachesim/tracer/physaddr.h

suite/tests/CMakeLists.txt

clients/drcachesim/tracer/tracer.cpp

…g for pw locally by default

…ates

derekbruening · 2022-06-07T05:56:03Z

scattergather again failed: #5329

Adds support for physical addresses to offline dr$sim traces. To support simulators wanting both virtual and physical addresses, and to simplify post-processing where the virtual PC values are needed, the regular trace entries remain all virtual. A new marker type TRACE_MARKER_TYPE_PHYSICAL_ADDRESS listing the corresponding physical address is added. The mappings are assumed to not change, allowing just one marker for each newly-observed page. This is done per-thread. An explicit TRACE_MARKER_TYPE_PHYSICAL_ADDRESS_NOT_AVAILABLE marker is inserted on failure to translate, to prevent analyzers from having to infer this due to the lack of the already-sparse markers. Separately emitted pairs of virtual and physical address markers were considered, with raw2trace inserting the physical at the right place, but that presents complexities with buffer handoff and with the first buffer. Instead, the physical are inserted via memmove directly into the buffer. This does not seem to be a performance concern: the translation lookup is the bottleneck. Since the memmoves occur only on the first instance of each page, they are much rarer than all the virtual-to-physical translations. Adds support for the new markers to the view tool. Adds a Linux x86_64 test that runs a tiny asm app and ensures a physical address marker is inserted. The test needs to run as sudo, along with its pre- and post- commands. To avoid a confusing blocking password query in local runs, a new set of tests controlled by a new CMake option RUN_SUDO_TESTS is added. It is set only for automated_ci, where we assume a passwordless sudo. A number of items remain for further work: + Performance is poor: the hashtable and caching need improvement. + There is a hardcoded limit on how many markers can be added per buffer. Once this is exceeded, further markers are dropped. We should split the buffer to handle this. + We may want to add a mode that checks for mapping changes. + Missing privileges results in every physical address being 0 instead of showing the failure. We need to check the capabilities to distinguish. + Better testing that we're actually getting physical addresses for online tests. + Better offline testing with larger apps. + Basic blocks that cross a page have only the first one translated. + A file descriptor per thread is used, which will not scale well with DR's descriptor protection and might hit rlimits. + Online traces still replace all virtual addresses with physical. We should break compatibility and transition them to use these markers, with dr$sim computing the physical addresses from the markers. Issue: #4014

derekbruening added 2 commits June 3, 2022 18:52

Fix Windows warning

182b5ba

derekbruening requested a review from abhinav92003 June 4, 2022 02:27

abhinav92003 approved these changes Jun 6, 2022

View reviewed changes

derekbruening added 3 commits June 7, 2022 00:16

Add RUN_SUDO_TESTS option and set it for automated_ci, to avoid askin…

c41b3bf

…g for pw locally by default

Review requests: instr_has-type(); invariant checks; many comment upd…

b0ddc64

…ates

Fix filter-asm using renamed common.loglevel binary

f79298f

Merge branch 'master' into i4014-phys-offline

6efb61d

derekbruening merged commit d96d930 into master Jun 7, 2022

derekbruening deleted the i4014-phys-offline branch June 7, 2022 15:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i#4014 dr$sim phys: Add offline -use_physical support #5516

i#4014 dr$sim phys: Add offline -use_physical support #5516

derekbruening commented Jun 3, 2022

derekbruening commented Jun 4, 2022

derekbruening commented Jun 7, 2022

i#4014 dr$sim phys: Add offline -use_physical support #5516

i#4014 dr$sim phys: Add offline -use_physical support #5516

Conversation

derekbruening commented Jun 3, 2022

derekbruening commented Jun 4, 2022

derekbruening commented Jun 7, 2022