vulkan: Handle GPUs with less shared memory #10468

jeffbolznv · 2024-11-23T22:27:48Z

There have been reports of failure to compile on systems with <= 32KB of shared memory (e.g. #10037). This change makes the large tile size fall back to a smaller size if necessary, and makes mul_mat_id fall back to CPU if there's only 16KB of shared memory.

I don't have a real system with these smaller sizes. But I did try forcing a smaller shared memory size to be reported and verified there were no validation layer errors reported.

Fixes #10037.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

There have been reports of failure to compile on systems with <= 32KB of shared memory (e.g. ggml-org#10037). This change makes the large tile size fall back to a smaller size if necessary, and makes mul_mat_id fall back to CPU if there's only 16KB of shared memory.

jeffbolznv mentioned this pull request Nov 23, 2024

Bug: Vulkan backend freezes during its execution #10037

Closed

jeffbolznv requested a review from 0cc4m November 23, 2024 22:57

0cc4m approved these changes Nov 27, 2024

View reviewed changes

0cc4m merged commit 5b3466b into ggml-org:master Nov 27, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan: Handle GPUs with less shared memory #10468

vulkan: Handle GPUs with less shared memory #10468

jeffbolznv commented Nov 23, 2024 •

edited

Loading

vulkan: Handle GPUs with less shared memory #10468

vulkan: Handle GPUs with less shared memory #10468

Conversation

jeffbolznv commented Nov 23, 2024 • edited Loading

jeffbolznv commented Nov 23, 2024 •

edited

Loading