Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Skip fuse_mm_elementwise fusion with model output in the middle #1000

Closed
wants to merge 1 commit into from

Conversation

aakhundov
Copy link
Contributor

Summary: fuse_mm_elementwise transformation fuses linear patterns into single ops. As a result, all intermediate outputs in the pattern are eliminated. In a special case when one or more of those intermediate outputs are model outputs, this leads to those model outputs vanishing after the fusion. Here we add skipping the fusion when one of the intermediate outputs in the detected pattern is a model output.

Differential Revision: D56340320

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 19, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56340320

aakhundov added a commit to aakhundov/AITemplate-1 that referenced this pull request Apr 19, 2024
…bookincubator#1000)

Summary:

`fuse_mm_elementwise` transformation fuses linear patterns into single ops. As a result, all intermediate outputs in the pattern are eliminated. In a special case when one or more of those intermediate outputs are model outputs, this leads to those model outputs vanishing after the fusion. Here we add skipping the fusion when one of the intermediate outputs in the detected pattern is a model output.

Differential Revision: D56340320
aakhundov added a commit to aakhundov/AITemplate-1 that referenced this pull request Apr 19, 2024
…bookincubator#1000)

Summary:

`fuse_mm_elementwise` transformation fuses linear patterns into single ops. As a result, all intermediate outputs in the pattern are eliminated. In a special case when one or more of those intermediate outputs are model outputs, this leads to those model outputs vanishing after the fusion. Here we add skipping the fusion when one of the intermediate outputs in the detected pattern is a model output.

Differential Revision: D56340320
aakhundov added a commit to aakhundov/AITemplate-1 that referenced this pull request Apr 19, 2024
…bookincubator#1000)

Summary:

`fuse_mm_elementwise` transformation fuses linear patterns into single ops. As a result, all intermediate outputs in the pattern are eliminated. In a special case when one or more of those intermediate outputs are model outputs, this leads to those model outputs vanishing after the fusion. Here we add skipping the fusion when one of the intermediate outputs in the detected pattern is a model output.

Reviewed By: mengyingdu

Differential Revision: D56340320
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56340320

aakhundov added a commit to aakhundov/AITemplate-1 that referenced this pull request Apr 19, 2024
…bookincubator#1000)

Summary:

`fuse_mm_elementwise` transformation fuses linear patterns into single ops. As a result, all intermediate outputs in the pattern are eliminated. In a special case when one or more of those intermediate outputs are model outputs, this leads to those model outputs vanishing after the fusion. Here we add skipping the fusion when one of the intermediate outputs in the detected pattern is a model output.

Reviewed By: mengyingdu

Differential Revision: D56340320
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56340320

…bookincubator#1000)

Summary:

`fuse_mm_elementwise` transformation fuses linear patterns into single ops. As a result, all intermediate outputs in the pattern are eliminated. In a special case when one or more of those intermediate outputs are model outputs, this leads to those model outputs vanishing after the fusion. Here we add skipping the fusion when one of the intermediate outputs in the detected pattern is a model output.

Reviewed By: mengyingdu

Differential Revision: D56340320
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56340320

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in eb89bf6.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants