forked from pytorch/FBGEMM
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enable sequence TBE CPU via AVX (pytorch#2195)
Summary: Instead of using the ref implementation for sequence embedding on CPU, this diff directs TBE to invoke the AVX implementation by forcing pooling factors of 1 (i.e., passing `at::arange(index_size)` as offfsets). The performance gained from using the AVX implementation offsets the overhead incurs in creating the new offsets. Differential Revision: D51918878
- Loading branch information
1 parent
a75b43f
commit c2f64ec
Showing
2 changed files
with
38 additions
and
83 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters