Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slight perf optimization for infer TBE #843

Closed
wants to merge 1 commit into from

Conversation

jianyuh
Copy link
Member

@jianyuh jianyuh commented Jan 2, 2022

Summary: ~5% perf improvement for INT4 / INT8 inference TBE on A100 GPUs.

Differential Revision: D33388153

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D33388153

jianyuh added a commit to jianyuh/FBGEMM that referenced this pull request Jan 2, 2022
Summary:
Pull Request resolved: pytorch#843

~5% perf improvement for INT4 / INT8 inference TBE on A100 GPUs.

Differential Revision: D33388153

fbshipit-source-id: 7d3734a0e283568e7ba9edeadd4f543aa2f330ee
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D33388153

Summary:
Pull Request resolved: pytorch#843

~5% perf improvement for INT4 / INT8 inference TBE on A100 GPUs.

Differential Revision: D33388153

fbshipit-source-id: 7571d6fe7f83e1c1cdce738f66795cc82473b99c
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D33388153

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants