i#6635 core filter, part 6: Add core-sharded record filter output #6704

derekbruening · 2024-03-12T01:56:32Z

Multiple changes to allow the record filter to operate in core-sharded fashion:

Makes the pc2encoding table per-input, as one input can migrate across multiple core shards and thus one core can see a later instruction without ever having seen its encoding. To handle synchronization, there is no C++11 std:: rwlock, so we use mutexes -- but we limit their use to per-context-switch for the added global lock, and we assume there is no contention for the per-input lock as only one shard operates on one input at any one time.

Sets the memref counter reader to core_sharded_ to avoid asserts.

Appends footer records to ending-in-idle-record cores.

Adds an error check ensuring a single workload, as multiple will require expanding the keys used in some tables.

Renames the output files to include "core.<shard_index>" and not the tid. This is surprisingly complex, as an input filename is needed to determine the output filename compression type: yet not all shards are guaranteed to have an input at the start. A condition variable and mutex are used to coordinate this among shards.

Adds support for started-idle cores by synthesizing headers in record_filter; #6703 covers having the scheduler do this for all analyzers. Adds the version as another field available up front from the scheduler, and adds an idle-tid sentinel needed to be distinct from INVALID_THREAD_ID.

Adds two end-to-end tests, one with a single-threaded app scheduled onto 4 cores to test start-idle cores and one to test multiple threads. Adds a macro to share code with the existing end-to-end test.

Updates the unit test mock classes.

Issue: #6635, #6703

Multiple changes to allow the record filter to operate in core-sharded fashion: Makes the pc2encoding table per-input, as one input can migrate across multiple core shards and thus one core can see a later instruction without ever having seen its encoding. To handle synchronization, there is no C++11 std:: rwlock, so we use mutexes -- but we limit their use to per-context-switch for the added global lock, and we assume there is no contention for the per-input lock as only one shard operates on one input at any one time. Sets the memref counter reader to core_sharded_ to avoid asserts. Appends footer records to ending-in-idle-record cores. Adds an error check ensuring a single workload, as multiple will require expanding the keys used in some tables. Renames the output files to include "core.<shard_index>" and not the tid. This is surprisingly complex, as an input filename is needed to determine the output filename compression type: yet not all shards are guaranteed to have an input at the start. A condition variable and mutex are used to coordinate this among shards. Adds support for started-idle cores by synthesizing headers in record_filter; #6703 covers having the scheduler do this for all analyzers. Adds the version as another field available up front from the scheduler, and adds an idle-tid sentinel needed to be distinct from INVALID_THREAD_ID. Adds two end-to-end tests, one with a single-threaded app scheduled onto 4 cores to test start-idle cores and one to test multiple threads. Adds a macro to share code with the existing end-to-end test. Updates the unit test mock classes. Issue: #6635, #6703

abhinav92003

Still trying to understand some things. So will do another round of review.

clients/drcachesim/reader/reader.cpp

clients/drcachesim/scheduler/scheduler.cpp

clients/drcachesim/tools/filter/record_filter.cpp

clients/drcachesim/tools/filter/record_filter.h

clients/drcachesim/common/utils.h

clients/drcachesim/tools/filter/record_filter.cpp

…parate concerns

…re-sharded-record-filter

derekbruening · 2024-03-13T00:02:02Z

Failure is #4167 "Invalid trace entry type thread_exit before a bundle"

…cidentally left in place in PR #6704

Improves record_filter_t subclass support for initially-idle cores by refactoring get_output_basename() out of initialize_shard_output(), allowing a subclass to share the complex initial setup while still using its own output scheme. Also moves the setup variable to protected for access to output_ext_ in subclasses. Removes code that was refactored into initialize_shard_output() but accidentally left in place in PR #6704. Tested internally. Issue: #6635

derekbruening added 4 commits March 11, 2024 21:56

Add template for new test

b91f9db

Fix Windows 32-bit warning

fa2c3ed

Check more than just version for setting header values on skip

f6f14b2

derekbruening requested a review from abhinav92003 March 12, 2024 05:16

abhinav92003 requested changes Mar 12, 2024

View reviewed changes

derekbruening added 8 commits March 12, 2024 17:48

Fix typo: 'not not'

6c2e798

Review request: add found_filetype_ var

dcd2d2a

Review requests: clarifying comments

a951f9b

Review request: Split initialize_shard_output out of get_writer to se…

8233d99

…parate concerns

Review request: Use inline brace initializers to synthesize headers

9df79c1

Review request: use stream name for diagnostic

c9dff16

Review requests: add clarifying comments

75c3d67

Merge branch 'master' of github.com:DynamoRIO/dynamorio into i6635-co…

d789708

…re-sharded-record-filter

derekbruening requested a review from abhinav92003 March 13, 2024 00:02

abhinav92003 approved these changes Mar 13, 2024

View reviewed changes

derekbruening merged commit 6d7b1a4 into master Mar 13, 2024
15 of 16 checks passed

derekbruening deleted the i6635-core-sharded-record-filter branch March 13, 2024 00:25

derekbruening added a commit that referenced this pull request Mar 14, 2024

Remove code that was refactored into initialize_shard_output() but ac…

5e80113

…cidentally left in place in PR #6704

derekbruening mentioned this pull request Mar 14, 2024

i#6635 core filter, part 7: Improve init-idle subclass support #6707

Merged

derekbruening mentioned this pull request Mar 27, 2024

i#6734: Set filetype in record_filter start-idle shards #6735

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i#6635 core filter, part 6: Add core-sharded record filter output #6704

i#6635 core filter, part 6: Add core-sharded record filter output #6704

derekbruening commented Mar 12, 2024

abhinav92003 left a comment

derekbruening commented Mar 13, 2024

i#6635 core filter, part 6: Add core-sharded record filter output #6704

i#6635 core filter, part 6: Add core-sharded record filter output #6704

Conversation

derekbruening commented Mar 12, 2024

abhinav92003 left a comment

Choose a reason for hiding this comment

derekbruening commented Mar 13, 2024