feat: Enable SIMD optimizations #107

matthewfeickert · 2024-09-07T21:29:57Z

Add x86_64-microarch-level as a 'build' requirement.
- microarch_level 4 not supported yet so only add level 1 to 3.
- c.f. Easier cross-compiling for level 4? microarch-level-feedstock#5
Set the build number based on the microarch_level.
Add conda_build_config.yaml with microarch_level.
c.f. https://prefix.dev/blog/building_cpu_optimized_packages
Bump build number.

Checklist

Used a personal fork of the feedstock to propose changes
Bumped the build number (if the version is unchanged)
[N/A] Reset the build number to 0 (if the version changed)
Re-rendered with the latest conda-smithy (Use the phrase @conda-forge-admin, please rerender in a comment in this PR for automated rerendering)
Ensured the license file is being packaged.

matthewfeickert · 2024-09-07T21:30:04Z

@conda-forge-admin, please rerender

conda-forge-webservices · 2024-09-07T21:30:06Z

Hi! This is the friendly automated conda-forge-linting service.

I wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found some lint.

Here's what I've got...

For recipe/meta.yaml:

Selectors are suggested to take a <two spaces>#<one space>[<expression>] form. See lines [29]

* Add x86_64-microarch-level as a 'build' requirement. - microarch_level 4 not supported yet so only add level 1 to 3. * Set the build number based on the microarch_level. * Add conda_build_config.yaml with microarch_level. * c.f. https://prefix.dev/blog/building_cpu_optimized_packages * Bump build number.

conda-forge-webservices · 2024-09-07T21:31:02Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found it was in an excellent condition.

…nda-forge-pinning 2024.09.07.19.33.02

matthewfeickert · 2024-09-07T21:49:06Z

@conda-forge/iminuit this is ready for review. Let me know if you have any questions. 👍

henryiii · 2024-09-07T21:53:59Z

Do you know if PR's are actually making anything faster? Some code bases can effectively use vector instructions and get a nice 2x+ speed up and some don't see much benefit.

matthewfeickert · 2024-09-07T21:57:59Z

Do you know if PR's are actually making anything faster? Some code bases can effectively use vector instructions and get a nice 2x+ speed up and some don't see much benefit.

@henryiii Good question. I don't have any explict checks to show at the moment. I'll defer to you if you think this is worth doing here (and for the other ones that I've opened up), and if so I'll do some custom builds and test comparisons when I'm back from work travel in a few weeks.

matthewfeickert · 2024-09-09T07:14:15Z

Placed into draft given that needs to be rebased following PR #108 and needs conda-forge/microarch-level-feedstock#10 to be resolved before can be merged.

HDembinski

I cannot review this, but I appreciate the effort and I think it sound all good.

HDembinski · 2024-09-09T14:32:24Z

That being said, iminuit will not profit from such optimizations, because the minimizing algorithm is typically not the bottleneck, it is the calculation of the cost function, which ideally should be JIT-compiled for the specific architecture.

matthewfeickert · 2024-12-13T05:24:04Z

That being said, iminuit will not profit from such optimizations, because the minimizing algorithm is typically not the bottleneck

Sounds good. No need to add more build complexity if it won't help, so closing this. 👍 Thanks, Hans, for the info!

matthewfeickert force-pushed the feat/add-simd-optimizations branch from f8c62a0 to e5501cd Compare September 7, 2024 21:30

MNT: Re-rendered with conda-build 24.7.1, conda-smithy 3.39.1, and co…

a58dea9

…nda-forge-pinning 2024.09.07.19.33.02

matthewfeickert marked this pull request as ready for review September 7, 2024 21:48

matthewfeickert requested review from HDembinski, bsipocz, chrisburr, henryiii and mwcraig as code owners September 7, 2024 21:48

matthewfeickert mentioned this pull request Sep 7, 2024

feat: Enable SIMD optimizations conda-forge/pythia8-feedstock#49

Merged

4 tasks

matthewfeickert marked this pull request as draft September 9, 2024 07:13

HDembinski approved these changes Sep 9, 2024

View reviewed changes

matthewfeickert closed this Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Enable SIMD optimizations #107

feat: Enable SIMD optimizations #107

matthewfeickert commented Sep 7, 2024 •

edited

Loading

matthewfeickert commented Sep 7, 2024

conda-forge-webservices bot commented Sep 7, 2024

conda-forge-webservices bot commented Sep 7, 2024

matthewfeickert commented Sep 7, 2024

henryiii commented Sep 7, 2024

matthewfeickert commented Sep 7, 2024

matthewfeickert commented Sep 9, 2024

HDembinski left a comment

HDembinski commented Sep 9, 2024

matthewfeickert commented Dec 13, 2024

feat: Enable SIMD optimizations #107

feat: Enable SIMD optimizations #107

Conversation

matthewfeickert commented Sep 7, 2024 • edited Loading

matthewfeickert commented Sep 7, 2024

conda-forge-webservices bot commented Sep 7, 2024

conda-forge-webservices bot commented Sep 7, 2024

matthewfeickert commented Sep 7, 2024

henryiii commented Sep 7, 2024

matthewfeickert commented Sep 7, 2024

matthewfeickert commented Sep 9, 2024

HDembinski left a comment

Choose a reason for hiding this comment

HDembinski commented Sep 9, 2024

matthewfeickert commented Dec 13, 2024

matthewfeickert commented Sep 7, 2024 •

edited

Loading