Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Passing TMA descriptors through grid constant #3066

Closed
wants to merge 1 commit into from

Conversation

htyu
Copy link
Contributor

@htyu htyu commented Sep 3, 2024

Summary:
Improving the TMA kernel by passing the TMA descriptors through grid constant. Grid constant (D61692148) significantly reduces kernel invocation overhead.

Also enables bias for the TMA kernel.

Differential Revision: D61799463

Copy link

netlify bot commented Sep 3, 2024

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 76d37d7
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66d79723a20da00008f525e1
😎 Deploy Preview https://deploy-preview-3066--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61799463

htyu added a commit to htyu/FBGEMM that referenced this pull request Sep 3, 2024
Summary:
X-link: facebookresearch/FBGEMM#163

Pull Request resolved: pytorch#3066

Improving the TMA kernel by passing the TMA descriptors through grid constant. Grid constant (D61692148) significantly reduces kernel invocation overhead.

Also enables bias for the TMA kernel.

Reviewed By: sfzhu93

Differential Revision: D61799463
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61799463

Summary:
X-link: facebookresearch/FBGEMM#163

Pull Request resolved: pytorch#3066

Improving the TMA kernel by passing the TMA descriptors through grid constant. Grid constant (D61692148) significantly reduces kernel invocation overhead.

Also enables bias for the TMA kernel.

Reviewed By: sfzhu93

Differential Revision: D61799463
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61799463

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 9a0e7b0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants