FBGEMM CK FP8 Optimization for BS > 1 #2940

jwfromm · 2024-08-06T20:27:52Z

Summary: This diff adds better kernel optimization for larger batch sizes in llama shapes.

Reviewed By: jianyuh, mxz297

Differential Revision: D60680651

facebook-github-bot · 2024-08-06T20:28:14Z

This pull request was exported from Phabricator. Differential Revision: D60680651

netlify · 2024-08-06T20:30:20Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`3dc9893`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66b398f2543ae30008e03fb1
😎 Deploy Preview	https://deploy-preview-2940--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2024-08-06T20:32:18Z

This pull request was exported from Phabricator. Differential Revision: D60680651

facebook-github-bot · 2024-08-06T21:46:29Z

This pull request was exported from Phabricator. Differential Revision: D60680651

Summary: Pull Request resolved: pytorch#2940 X-link: facebookresearch/FBGEMM#42 This diff adds better kernel optimization for larger batch sizes in llama shapes. Reviewed By: jianyuh, mxz297 Differential Revision: D60680651

facebook-github-bot · 2024-08-06T21:51:15Z

This pull request was exported from Phabricator. Differential Revision: D60680651

Summary: Pull Request resolved: pytorch#2940 X-link: facebookresearch/FBGEMM#42 This diff adds better kernel optimization for larger batch sizes in llama shapes. Reviewed By: jianyuh, mxz297 Differential Revision: D60680651

facebook-github-bot · 2024-08-07T15:55:24Z

This pull request was exported from Phabricator. Differential Revision: D60680651

facebook-github-bot · 2024-08-07T19:13:20Z

This pull request has been merged in 0ebb3ae.

facebook-github-bot added the cla signed label Aug 6, 2024

facebook-github-bot added the fb-exported label Aug 6, 2024

jwfromm force-pushed the export-D60680651 branch from 399045c to 782c05e Compare August 6, 2024 20:32

jwfromm force-pushed the export-D60680651 branch from 782c05e to c176d2c Compare August 6, 2024 21:46

jwfromm force-pushed the export-D60680651 branch from c176d2c to 42b3fcd Compare August 6, 2024 21:51

FBGEMM CK FP8 Optimization for BS > 1 (pytorch#2940)

3dc9893

Summary: Pull Request resolved: pytorch#2940 X-link: facebookresearch/FBGEMM#42 This diff adds better kernel optimization for larger batch sizes in llama shapes. Reviewed By: jianyuh, mxz297 Differential Revision: D60680651

jwfromm force-pushed the export-D60680651 branch from 42b3fcd to 3dc9893 Compare August 7, 2024 15:55

facebook-github-bot closed this in 0ebb3ae Aug 7, 2024

facebook-github-bot added the Merged label Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FBGEMM CK FP8 Optimization for BS > 1 #2940

FBGEMM CK FP8 Optimization for BS > 1 #2940

jwfromm commented Aug 6, 2024

facebook-github-bot commented Aug 6, 2024

netlify bot commented Aug 6, 2024 •

edited

Loading

facebook-github-bot commented Aug 6, 2024

facebook-github-bot commented Aug 6, 2024

facebook-github-bot commented Aug 6, 2024

facebook-github-bot commented Aug 7, 2024

facebook-github-bot commented Aug 7, 2024

FBGEMM CK FP8 Optimization for BS > 1 #2940

FBGEMM CK FP8 Optimization for BS > 1 #2940

Conversation

jwfromm commented Aug 6, 2024

facebook-github-bot commented Aug 6, 2024

netlify bot commented Aug 6, 2024 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Aug 6, 2024

facebook-github-bot commented Aug 6, 2024

facebook-github-bot commented Aug 6, 2024

facebook-github-bot commented Aug 7, 2024

facebook-github-bot commented Aug 7, 2024

netlify bot commented Aug 6, 2024 •

edited

Loading