You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I encountered this on Orange Pi 5 Plus (RK3588) with Ubuntu 22.04 LTS. The compilation environments (in a Conda env) are:
Python 3.9.21
CMake 3.31.2
Clang 18.1.8
The compilation went successfully, but both run_inference.py (llama-cli) and e2e_benchmark.py (llama-bench) exited with some memory errors. Specifically in the benchmark case, there are still benchmark outputs, but I'm not sure will this affect the measured performance.
Note that there are different errors when benchmarking prefilling (p>0, n=0) and decoding (p=0, n>0).
I encountered this on Orange Pi 5 Plus (RK3588) with Ubuntu 22.04 LTS. The compilation environments (in a Conda env) are:
The compilation went successfully, but both
run_inference.py
(llama-cli) ande2e_benchmark.py
(llama-bench) exited with some memory errors. Specifically in the benchmark case, there are still benchmark outputs, but I'm not sure will this affect the measured performance.Note that there are different errors when benchmarking prefilling (p>0, n=0) and decoding (p=0, n>0).
Here are the corresponding commands and outputs:
This should be a similar issue to #143, but I'm not quite sure.
The text was updated successfully, but these errors were encountered: