-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CPU][ARM] KleidiAI integration and KleidiAI MM executor #28830
[CPU][ARM] KleidiAI integration and KleidiAI MM executor #28830
Conversation
fe0902d
to
ed92197
Compare
ed92197
to
071c016
Compare
919dbbc
to
e77b37f
Compare
e77b37f
to
2388725
Compare
src/plugins/intel_cpu/src/nodes/executors/kleidiai/kleidiai_mm.cpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_cpu/src/nodes/executors/kleidiai/kleidiai_mm.cpp
Outdated
Show resolved
Hide resolved
3b86719
to
60bad2c
Compare
60bad2c
to
e3350e9
Compare
@NishantPrabhuFujitsu Alex fixed major perf/memory issues. Will you be able to try it again and meausre the perf? |
src/plugins/intel_cpu/thirdparty/kleidiai_include_directories_patch.diff
Outdated
Show resolved
Hide resolved
Here are the benchmark results for TinyLlama-1.1B-Chat-v1.0 after incorporating the fixes.
The performance is at par with ACL now. Also the memory leak is resolved. Thanks! |
@NishantPrabhuFujitsu Thanks a lot! We will finalize the PR and merge it once get the approval. |
22b8186
to
b7e3f21
Compare
src/plugins/intel_cpu/src/nodes/executors/fullyconnected_implementations.cpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_cpu/tests/functional/custom/single_layer_tests/instances/arm/matmul.cpp
Outdated
Show resolved
Hide resolved
...ns/intel_cpu/tests/functional/custom/subgraph_tests/src/common/matmul_decompress_convert.cpp
Outdated
Show resolved
Hide resolved
...ns/intel_cpu/tests/functional/custom/subgraph_tests/src/common/matmul_decompress_convert.cpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_cpu/tests/functional/shared_tests_instances/skip_tests_config.cpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_cpu/src/nodes/executors/kleidiai/kleidiai_mm.hpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_cpu/src/nodes/executors/fullyconnected_implementations.cpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_cpu/src/nodes/executors/fullyconnected_implementations.cpp
Outdated
Show resolved
Hide resolved
6df0caf
to
db8959d
Compare
db8959d
to
a9fb2b9
Compare
4ab895b
to
16411be
Compare
16411be
to
0c6fc5e
Compare
Details:
kleidiai
is added as git submodulekleidiai
is built statically and linked into cpu plugin libraryTickets: