Support AVX-512 for SIMD unary operators, softmax #131

robertknight · 2024-04-28T07:13:48Z

Support AVX-512 for Exp, Sigmoid, Tanh, Erf, Softmax and other operators which use SIMD-accelerated implementations in rten-vecmath.

The performance benefit is small because these operations are limited by memory bandwidth, but doing this revealed some issues with supporting different instruction sets on the same architecture. Usage of #[inline] and #[target_feature] had to be corrected to ensure that SIMD intrinics are actually inlined reliably. Otherwise the function call overhead negates the benefits of SIMD entirely.

The SimdInt and SimdFloat docs have been updated to explain the rules around inlining and target features.

Support AVX-512 for Exp, Sigmoid, Tanh, Erf, Softmax and other operators which use SIMD-accelerated implementations in rten-vecmath. The performance benefit is small because these operations are limited by memory bandwidth, but doing this revealed some issues with supporting different instruction sets on the same architecture. Usage of `#[inline]` and `#[target_feature]` had to be corrected to ensure that SIMD intrinics are actually inlined reliably. Otherwise the function call overhead negates the benefits of SIMD entirely. The `SimdInt` and `SimdFloat` docs have been updated to explain the rules around inlining and target features.

Remove the `#[target_feature]` attributes from the generic erf, tanh functions. Previously they were being compiled with AVX2 instructions even when called from the fallback (non-AVX2) dispatch. This change follows the pattern established in #131, which fixed the issue for Exp, Sigmoid and Softmax operators.

robertknight merged commit 20c77d7 into main Apr 28, 2024
2 checks passed

robertknight deleted the avx512-vecmath branch April 28, 2024 07:17

This was referenced Apr 29, 2024

Fix crash in Erf, Tanh operators under under pre-AVX2 x64 CPUs #134

Merged

[CLI] Illegal instruction (core dumped) under Proxmox VM robertknight/ocrs#52

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support AVX-512 for SIMD unary operators, softmax #131

Support AVX-512 for SIMD unary operators, softmax #131

robertknight commented Apr 28, 2024

Support AVX-512 for SIMD unary operators, softmax #131

Support AVX-512 for SIMD unary operators, softmax #131

Conversation

robertknight commented Apr 28, 2024