Create branch according to cpu core uarch #10521

chenfucn · 2022-02-10T22:50:49Z

Description:

This is a preparation change for a bigger goal.

On ARM64 CPUs with Big.Little, different cores are always the same architecture but different micro-architecture. Specifically, it is often that the little core has narrow memory buses that makes 128b load very slow. While if we always use 64b load in our kernels, the code will run slower on big cores. As a result, we need to run different code on different cores to achieve better performance.

This change constructs a manifold that pivot based on the core micro-architecture of the current core, so that we can develop and call different kernels accordingly.

onnxruntime/core/mlas/lib/mlasi.h

yufenglee

Prev merged pull request has a bug: #10521 It was aimed to detect current CPU core micro-architecture and select a best suited kernel. Unfortunately it assumes that a thread can never migrate from one core to another. This change tries to fix that problem. It introduces about 2-5% performance degradation on symmetric quantized matmul Co-authored-by: Chen Fu <[email protected]>

yufenglee reviewed Feb 10, 2022

View reviewed changes

onnxruntime/core/mlas/lib/mlasi.h Show resolved Hide resolved

yufenglee reviewed Feb 10, 2022

View reviewed changes

onnxruntime/core/mlas/lib/mlasi.h Outdated Show resolved Hide resolved

yufenglee reviewed Feb 10, 2022

View reviewed changes

onnxruntime/core/mlas/lib/mlasi.h Show resolved Hide resolved

Create branch according to cpu core uarch

8c83585

chenfucn force-pushed the cfu_uarch branch from 97a8f10 to 8c83585 Compare February 14, 2022 16:37

yufenglee approved these changes Feb 14, 2022

View reviewed changes

chenfucn merged commit 58f80c1 into microsoft:master Feb 14, 2022

chenfucn deleted the cfu_uarch branch February 14, 2022 23:16

chenfucn mentioned this pull request Feb 23, 2022

fix bug: getting current cpu core type #10630

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create branch according to cpu core uarch #10521

Create branch according to cpu core uarch #10521

chenfucn commented Feb 10, 2022

yufenglee left a comment

Create branch according to cpu core uarch #10521

Create branch according to cpu core uarch #10521

Conversation

chenfucn commented Feb 10, 2022

yufenglee left a comment

Choose a reason for hiding this comment