-
Notifications
You must be signed in to change notification settings - Fork 650
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[LLVMGPU] Set prefetching on translation info #17744
Conversation
@kuhar You would need to change your tuning script to explicitly put prefetching in tuning config after this patch |
cc: @RattataKing ☝️ |
FYI this improved the SDXL benchmarks we run in this repo (on mi250 using rocm):
(we set
|
This would mean that not using prefetching on the tuned configurations is actually better. Adding a parameter for prefetching to the tuning script might be a good idea @kuhar @RattataKing |
Benchmark metrics improved (when using `--iree-llvmgpu-enable-prefetch=true`), so locking in the improvements. Context: #17744 (comment) Presubmit results: https://github.com/iree-org/iree/actions/runs/9765047731/attempts/1#summary-26955236756
This patch makes prefetch_shared_memory part of translation_info config dictionary, allowing us to control prefetching at dispatch level, instead of globally turning it on/off. Prefetching is still off by default, the flag makes KernelConfig add prefetch_shared_memory unit attribute to config dictionary. Signed-off-by: Lubo Litchev <[email protected]>
Benchmark metrics improved (when using `--iree-llvmgpu-enable-prefetch=true`), so locking in the improvements. Context: iree-org#17744 (comment) Presubmit results: https://github.com/iree-org/iree/actions/runs/9765047731/attempts/1#summary-26955236756 Signed-off-by: Lubo Litchev <[email protected]>
This patch makes prefetch_shared_memory part of translation_info config dictionary, allowing us to control prefetching at dispatch level, instead of globally turning it on/off. Prefetching is still off by default, the flag makes KernelConfig add prefetch_shared_memory unit attribute to config dictionary.