[CUDA] Update benchmark_mha.py to capture debug info to identify sdpa kernel #21804

tianleiwu · 2024-08-20T20:38:59Z

Description

Use debug info to identify sdpa kernel actually used, and show it in the output of benchmark_mha.py. This updated benchmark script was used to get the benchmark results in #21629.
(1) Change the output format of debug info to output like SdpaKernel=*
(2) Add a step to capture stdout from onnxruntime session, and use regular expression to parse SdpaKernel=* from the captured text.

Other minor changes:
(1) Set different default repeats during benchmark: 100 for CPU; and 10000 for CUDA.
(2) Fix PrintTensorByDims used in console dumper: if it is not enabled, do not dump tensor.
(3) Update some comments

Motivation and Context

Sometime, we will use fallback for a sdpa_kernel. It could confuse user unless we can tell exact kernel is used in benchmark.

onnxruntime/test/python/transformers/benchmark_mha.py

use debug info to identify sdpa_kernel in benchmark_mha

c94f18d

tianleiwu requested review from yufenglee, wangyems and kunal-vaishnavi August 20, 2024 20:39

github-advanced-security bot found potential problems Aug 20, 2024

View reviewed changes

onnxruntime/test/python/transformers/benchmark_mha.py Fixed Show fixed Hide fixed

fix warnings that captured may be used before it is initialized

4083416

github-advanced-security bot found potential problems Aug 20, 2024

View reviewed changes

onnxruntime/test/python/transformers/benchmark_mha.py Fixed Show fixed Hide fixed

initialize captured_text

35410c7

wangyems approved these changes Aug 22, 2024

View reviewed changes

tianleiwu merged commit 25d7a4f into main Aug 22, 2024
95 of 97 checks passed

tianleiwu deleted the tlwu/benchmark_mha_kernel_from_debug_info branch August 22, 2024 00:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] Update benchmark_mha.py to capture debug info to identify sdpa kernel #21804

[CUDA] Update benchmark_mha.py to capture debug info to identify sdpa kernel #21804

tianleiwu commented Aug 20, 2024 •

edited

Loading

[CUDA] Update benchmark_mha.py to capture debug info to identify sdpa kernel #21804

[CUDA] Update benchmark_mha.py to capture debug info to identify sdpa kernel #21804

Conversation

tianleiwu commented Aug 20, 2024 • edited Loading

Description

Motivation and Context

tianleiwu commented Aug 20, 2024 •

edited

Loading