Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
revise retired SSE/AVX flops events def for AMD Zen4 (#216)
Summary: Pull Request resolved: #216 sse/avx flops event config in linux perf tool is different from the one defined in hbt perf tool fp_ret_sse_avx_ops.all uses umask 0x1f, while hbt uses umask 0x0f according to AMD manual: {F1325336882} bit 4 is used to determine if bfloat mac should be counted as 2 ops. this should be true to provide consistent behavior so this diff make Zen3 and Zen4 machines use different event to monitor SSE/AVX FLOPs Reviewed By: bigzachattack Differential Revision: D52861397 fbshipit-source-id: 4c2acbee9742a15db36b8bf0a26f1af946b745d5
- Loading branch information