Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

default to clang for faster CPU performance #666

Merged
merged 1 commit into from
Nov 26, 2024
Merged

Conversation

yshekel
Copy link
Collaborator

@yshekel yshekel commented Nov 26, 2024

Clang generates better code for CPU backend.

From a few measurements:
EC-addition is now ~3X faster
ECNTT ~3X faster
MSM ~2X faster
NTT ~1.2X faster

Edit: some loop unrolling improves EC-add ~3X more with clang for Montgomery.

Copy link
Contributor

@mickeyasa mickeyasa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

great job

@omershlo
Copy link
Member

amazing!

Copy link
Collaborator

@jeremyfelder jeremyfelder left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔥 🚀

@yshekel yshekel merged commit bcba346 into main Nov 26, 2024
29 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants