-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CPU EP] optimize qlinearsoftmax #22686
base: main
Are you sure you want to change the base?
Conversation
32a8c8c
to
6df1676
Compare
4233379
to
f7d97e1
Compare
Could you also include cycles for Softmax using the existing microbenchmark or by adding a new one?" |
What do you mean by " include cycles for Softmax" ? Mlas BENCHMARK_CAPTURE should have timer function to estimate the time cost. |
Given that the unoptimized QLinearSoftmax is slower than Softmax, I wanted to include the Softmax number as well. |
Description
X86-64
AVX2/AVX512
ARM
Not implemented
Improves
Motivation and Context