Disable Avx512 #408

mbkumar · 2024-04-23T14:25:40Z

Intel is deprecating AVX512 from their chips. And AMD implementation of AVX512 is a non-native implementation with two 256 bit registers concatenated. The current AVX512 optimization code is not working when compiled with Intel compilers because xsimd is generating 4 doubles for simd_t when Intel compiler is used. However the disabled code is expecting 8 doubles. We can forego this optimization for more portability.

codecov · 2024-04-23T16:24:27Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.55%. Comparing base (d830719) to head (f595c3d).

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #408   +/-   ##
=======================================
  Coverage   91.55%   91.55%           
=======================================
  Files          74       74           
  Lines       12911    12911           
=======================================
  Hits        11821    11821           
  Misses       1090     1090

Flag	Coverage Δ
unittests	`91.55% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

andrewgiuliani · 2024-04-24T14:35:40Z

This is a neat optimization that would be a shame to remove. I can't reproduce the issue locally because I have an 11th generation intel CPU.

The current AVX512 optimization code is not working when compiled with Intel compilers because xsimd is generating 4 doubles for simd_t when Intel compiler is used. However the disabled code is expecting 8 doubles.

I don't understand here, could you give more details?

andrewgiuliani · 2024-04-24T14:35:13Z

src/simsoptpp/simdhelpers.h

@@ -129,6 +129,7 @@ using AlignedPaddedVec = std::vector<double, AlignedPaddedAllocator<double>>;
 #endif

 #if defined(USE_XSIMD)
+/*
 #if __AVX512F__ 


If AVX-512 is disabled, shouldn't __AVX512F__ not be defined, and this section of code would be avoided?

mbkumar · 2024-04-24T15:01:10Z

AMD implements AVX-512, but not in its native format. It uses two 256 bit registers to mimic 512 bit wide register expected by AVX-512 code. When I am using intel compilers on AMD Epyc processors, `simd_t` has only 4 doubles despite Epyc having AVX-512F support. But the disabled code expects 8 doubles for simd_t. So the code is not compiling on new AMD chips when I am using Intel compilers. No issues with gcc though. So, this could be an Intel compiler specific bug. However Intel compilers still outperform gcc on AMD chips. So this loss of optimization shouldn't be a big deal. We can try isolating this specific condition of the Intel compiler + AMD zen v4 chips. But Intel is also deprecating AVX-512 in their line of chips. It is going towards high efficiency E-cores in place of high width registers which consume a lot of power. So I felt it may not be worth the effort to have multiple branches targeting various scenarios. With most of the new HPC machines using AMD zen4 chips, this could lead to a lot of users complaining and more maintenance requests. We can keep the code commented for the time being or keep the PR open and do nothing. I am OK either way. Bharat Medasani

…

On Wed, Apr 24, 2024 at 10:36 AM Andrew Giuliani ***@***.***> wrote: This is a neat optimization that would be a shame to remove. I can't reproduce the issue locally because I have an 11th generation intel CPU. The current AVX512 optimization code is not working when compiled with Intel compilers because xsimd is generating 4 doubles for simd_t when Intel compiler is used. However the disabled code is expecting 8 doubles. I don't understand here, could you give more details? — Reply to this email directly, view it on GitHub <#408 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA62VEHSVWOYPBNP45VWGNTY667NFAVCNFSM6AAAAABGVBTSSOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANZVGEYDGMRVGI> . You are receiving this because you authored the thread.Message ID: ***@***.***>

andrewgiuliani · 2024-04-24T17:22:06Z

thanks for the explanation, I'm ok with merging this PR

mbkumar added 3 commits April 3, 2024 16:45

Tmp commit to license

ee590e3

Merge branch 'master' of github.com:hiddenSymmetries/simsopt

095e4c3

Disable avx512 code in simsopt

f595c3d

landreman self-requested a review April 24, 2024 10:06

landreman approved these changes Apr 24, 2024

View reviewed changes

landreman requested a review from andrewgiuliani April 24, 2024 10:08

andrewgiuliani reviewed Apr 24, 2024

View reviewed changes

andrewgiuliani self-requested a review April 24, 2024 17:22

andrewgiuliani approved these changes Apr 24, 2024

View reviewed changes

mbkumar merged commit fe1087e into master Apr 29, 2024
46 of 47 checks passed

mbkumar deleted the avx512 branch April 29, 2024 21:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable Avx512 #408

Disable Avx512 #408

mbkumar commented Apr 23, 2024

codecov bot commented Apr 23, 2024

andrewgiuliani commented Apr 24, 2024

andrewgiuliani Apr 24, 2024

mbkumar commented Apr 24, 2024 via email

andrewgiuliani commented Apr 24, 2024

Disable Avx512 #408

Disable Avx512 #408

Conversation

mbkumar commented Apr 23, 2024

codecov bot commented Apr 23, 2024

Codecov Report

andrewgiuliani commented Apr 24, 2024

andrewgiuliani Apr 24, 2024

Choose a reason for hiding this comment

mbkumar commented Apr 24, 2024 via email

andrewgiuliani commented Apr 24, 2024