New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Recompute linear_cache_indices for pipeline prefetching #2147

Closed

sryap wants to merge 1 commit into pytorch:main from sryap:export-D50983176

Contributor

sryap commented Nov 21, 2023

Summary:
When pipeline prefetching is enabled (prefetch_pipeline=True) for
EmbeddingLocation.MANAGED_CACHING, TBE has to update
lxu_cache_locations to ensure cache consistency before the backward
pass. The lxu_cache_locations update requires
linear_cache_indices as an input. Prior to this diff, TBE keeps
linear_cache_indices alive after prefetching until the tensor is
used for the lxu_cache_locations update. This puts a lot of
pressure to the memory space requirement limiting the enablement of
pipeline prefetching for some models. This diff addresses the memory
limitation issue by recomputing linear_cache_indices when it is
needed.

Differential Revision: D50983176

facebook-github-bot added the cla signed label

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request was exported from Phabricator. Differential Revision: D50983176

facebook-github-bot added the fb-exported label

netlify bot commented Nov 21, 2023 •

edited

Loading

✅ Deploy Preview for pytorch-fbgemm-docs canceled.

Name	Link
🔨 Latest commit	`6c8e0a6`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/655c2d1264ff02000898e20a

sryap pushed a commit to sryap/FBGEMM that referenced this pull request


          Recompute linear_cache_indices for pipeline prefetching (pytorch#2147)

26da350

Summary:

When pipeline prefetching is enabled (`prefetch_pipeline=True`) for
`EmbeddingLocation.MANAGED_CACHING`, TBE has to update
`lxu_cache_locations` to ensure cache consistency before the backward
pass.  The `lxu_cache_locations` update requires
`linear_cache_indices` as an input.  Prior to this diff, TBE keeps
`linear_cache_indices` alive after prefetching until the tensor is
used for the `lxu_cache_locations` update.  This puts a lot of
pressure to the memory space requirement limiting the enablement of
pipeline prefetching for some models.  This diff addresses the memory
limitation issue by recomputing `linear_cache_indices` when it is
needed.

Differential Revision: D50983176

sryap force-pushed the export-D50983176 branch from e93e108 to 26da350 Compare

November 21, 2023 01:58

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request was exported from Phabricator. Differential Revision: D50983176

1 similar comment

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request was exported from Phabricator. Differential Revision: D50983176


          Recompute linear_cache_indices for pipeline prefetching (pytorch#2147)

6c8e0a6

Summary:

When pipeline prefetching is enabled (`prefetch_pipeline=True`) for
`EmbeddingLocation.MANAGED_CACHING`, TBE has to update
`lxu_cache_locations` to ensure cache consistency before the backward
pass.  The `lxu_cache_locations` update requires
`linear_cache_indices` as an input.  Prior to this diff, TBE keeps
`linear_cache_indices` alive after prefetching until the tensor is
used for the `lxu_cache_locations` update.  This puts a lot of
pressure to the memory space requirement limiting the enablement of
pipeline prefetching for some models.  This diff addresses the memory
limitation issue by recomputing `linear_cache_indices` when it is
needed.

Reviewed By: jspark1105

Differential Revision: D50983176

sryap added a commit to sryap/FBGEMM that referenced this pull request


          Recompute linear_cache_indices for pipeline prefetching (pytorch#2147)

6fbcef2

Summary:

When pipeline prefetching is enabled (`prefetch_pipeline=True`) for
`EmbeddingLocation.MANAGED_CACHING`, TBE has to update
`lxu_cache_locations` to ensure cache consistency before the backward
pass.  The `lxu_cache_locations` update requires
`linear_cache_indices` as an input.  Prior to this diff, TBE keeps
`linear_cache_indices` alive after prefetching until the tensor is
used for the `lxu_cache_locations` update.  This puts a lot of
pressure to the memory space requirement limiting the enablement of
pipeline prefetching for some models.  This diff addresses the memory
limitation issue by recomputing `linear_cache_indices` when it is
needed.

Reviewed By: jspark1105

Differential Revision: D50983176

sryap force-pushed the export-D50983176 branch from 26da350 to 6fbcef2 Compare

November 21, 2023 04:06

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request was exported from Phabricator. Differential Revision: D50983176

sryap added a commit to sryap/FBGEMM that referenced this pull request


          Recompute linear_cache_indices for pipeline prefetching (pytorch#2147)

9ce5068

Summary:

When pipeline prefetching is enabled (`prefetch_pipeline=True`) for
`EmbeddingLocation.MANAGED_CACHING`, TBE has to update
`lxu_cache_locations` to ensure cache consistency before the backward
pass.  The `lxu_cache_locations` update requires
`linear_cache_indices` as an input.  Prior to this diff, TBE keeps
`linear_cache_indices` alive after prefetching until the tensor is
used for the `lxu_cache_locations` update.  This puts a lot of
pressure to the memory space requirement limiting the enablement of
pipeline prefetching for some models.  This diff addresses the memory
limitation issue by recomputing `linear_cache_indices` when it is
needed.

Reviewed By: jspark1105

Differential Revision: D50983176

sryap force-pushed the export-D50983176 branch from 6fbcef2 to 9ce5068 Compare

November 21, 2023 04:06

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request was exported from Phabricator. Differential Revision: D50983176

sryap added a commit to sryap/FBGEMM that referenced this pull request


          Recompute linear_cache_indices for pipeline prefetching (pytorch#2147)

693bb6f

Summary:

When pipeline prefetching is enabled (`prefetch_pipeline=True`) for
`EmbeddingLocation.MANAGED_CACHING`, TBE has to update
`lxu_cache_locations` to ensure cache consistency before the backward
pass.  The `lxu_cache_locations` update requires
`linear_cache_indices` as an input.  Prior to this diff, TBE keeps
`linear_cache_indices` alive after prefetching until the tensor is
used for the `lxu_cache_locations` update.  This puts a lot of
pressure to the memory space requirement limiting the enablement of
pipeline prefetching for some models.  This diff addresses the memory
limitation issue by recomputing `linear_cache_indices` when it is
needed.

Reviewed By: jspark1105

Differential Revision: D50983176

sryap force-pushed the export-D50983176 branch from 9ce5068 to 693bb6f Compare

November 21, 2023 04:07

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request was exported from Phabricator. Differential Revision: D50983176

sryap force-pushed the export-D50983176 branch from 693bb6f to 6c8e0a6 Compare

November 21, 2023 04:07

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request was exported from Phabricator. Differential Revision: D50983176

facebook-github-bot closed this in

37111f5

facebook-github-bot added the Merged label

Contributor

facebook-github-bot commented Nov 21, 2023

This pull request has been merged in 37111f5.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed fb-exported Merged