You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There is an inconsistency in the reported GPU free memory between the Intel Compute Runtime and tools such as xpu-smi. When using the Intel Compute Runtime on Intel Arc(TM) A770 Graphics, the reported free memory value is incorrect, consistently showing the same value as the total memory, even when memory is being consumed. This issue was observed in both Python (dpctl) and a standalone C++ executable.
Steps to Reproduce
Set up an environment with the Intel Compute Runtime and xpu-smi installed.
Execute the compiled binary with the environment variable ZES_ENABLE_SYSMAN set to 1:
export ZES_ENABLE_SYSMAN=1
./mem.x
Compare the output with the results from xpu-smi:
xpu-smi stats -d 0
Observed Behavior
The C++ code consistently reports the same value for global_mem_size and free_memory, implying 0 bytes of used memory, even when memory is being consumed by the GPU. In contrast, xpu-smi correctly reports non-zero GPU memory usage.
Expected Behavior
The free_memory value reported by the Intel Compute Runtime should reflect the actual free memory, showing a decrease when GPU memory is used, consistent with the output from xpu-smi.
Environment Details
OS: HiveOS (Based on Ubuntu 20.04 and 22.04)
GPU: Intel(R) Arc(TM) A770 Graphics
GPU driver versions tested:
1.3.27642
1.3.29735
Intel Compute Runtime: Relevant versions for the above drivers
Compiler: Intel DPC++/C++ Compiler (icpx)
Additional Information
This issue is tracked in the dpctl repository here. The problem appears to stem from the GPU driver or the Intel Compute Runtime itself, as confirmed by running a standalone C++ executable.
Please let me know if further information or testing is required. Thank you for investigating this issue.
The text was updated successfully, but these errors were encountered:
avimanyu786
changed the title
Incorrect Free Memory Reporting by Intel Compute Runtime for Intel Arc(TM) A770 Graphics
Incorrect Free Memory Reporting for Intel Arc(TM) A770 Graphics
Aug 1, 2024
For more added context, xpu-smi fetches the value of XPUM_STATS_MEMORY_USED to report the used GPU memory. I found this when I searched for "GPU Memory Used" in the https://github.com/intel/xpumanager repository.
Description
There is an inconsistency in the reported GPU free memory between the Intel Compute Runtime and tools such as
xpu-smi
. When using the Intel Compute Runtime on Intel Arc(TM) A770 Graphics, the reported free memory value is incorrect, consistently showing the same value as the total memory, even when memory is being consumed. This issue was observed in both Python (dpctl
) and a standalone C++ executable.Steps to Reproduce
xpu-smi
installed.mem.cpp
:ZES_ENABLE_SYSMAN
set to1
:export ZES_ENABLE_SYSMAN=1 ./mem.x
xpu-smi
:Observed Behavior
The C++ code consistently reports the same value for
global_mem_size
andfree_memory
, implying 0 bytes of used memory, even when memory is being consumed by the GPU. In contrast,xpu-smi
correctly reports non-zero GPU memory usage.Expected Behavior
The
free_memory
value reported by the Intel Compute Runtime should reflect the actual free memory, showing a decrease when GPU memory is used, consistent with the output fromxpu-smi
.Environment Details
icpx
)Additional Information
This issue is tracked in the
dpctl
repository here. The problem appears to stem from the GPU driver or the Intel Compute Runtime itself, as confirmed by running a standalone C++ executable.Please let me know if further information or testing is required. Thank you for investigating this issue.
The text was updated successfully, but these errors were encountered: