Fix bf16 support issues #2238

jianyuh · 2023-12-28T05:06:51Z

Summary:
For bf16 related cuda code, we have the following macro to distinguish between v100 vs. a100 (pre-a100 cuda/NV GPU doesn't support BF16):

#if !(                                                  \
    ((defined(CUDA_VERSION) && CUDA_VERSION < 11000) || \
     (defined(__CUDA_ARCH__) && (__CUDA_ARCH__ < 800))))

macro.

For AMD GPU (rocm), it will lead to always false. However, on the MI250 / MI300 GPU we have in house, they have BF16 supports. We re-enable BF16 for RoCM related usages.

Reviewed By: jiawenliu64

Differential Revision: D52438898

netlify · 2023-12-28T05:06:56Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`0f82766`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/658d0fc67f7fed0008c3d444
😎 Deploy Preview	https://deploy-preview-2238--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2023-12-28T05:07:05Z

This pull request was exported from Phabricator. Differential Revision: D52438898

houseroad

Re-export?

Summary: - Switch to hip related TARGETS (w/ _hip suffix) when AMD GPU build is used. - Add "supports_python_dlopen = True," to support dlopen on related deps. - Add missing deps like `"//deeplearning/fbgemm/fbgemm_gpu:split_table_batched_embeddings_hip",` Reviewed By: q10, zoranzhao Differential Revision: D52435932

Summary: For bf16 related cuda code, we have the following macro to distinguish between v100 vs. a100 (pre-a100 cuda/NV GPU doesn't support BF16): ``` #if !( \ ((defined(CUDA_VERSION) && CUDA_VERSION < 11000) || \ (defined(__CUDA_ARCH__) && (__CUDA_ARCH__ < 800)))) ``` macro. For AMD GPU (rocm), it will lead to always false. However, on the MI250 / MI300 GPU we have in house, they have BF16 supports. We re-enable BF16 for RoCM related usages. Reviewed By: houseroad, jiawenliu64 Differential Revision: D52438898

facebook-github-bot · 2023-12-28T06:02:57Z

This pull request was exported from Phabricator. Differential Revision: D52438898

Summary: For bf16 related cuda code, we have the following macro to distinguish between v100 vs. a100 (pre-a100 cuda/NV GPU doesn't support BF16): ``` #if !( \ ((defined(CUDA_VERSION) && CUDA_VERSION < 11000) || \ (defined(__CUDA_ARCH__) && (__CUDA_ARCH__ < 800)))) ``` macro. For AMD GPU (rocm), it will lead to always false. However, on the MI250 / MI300 GPU we have in house, they have BF16 supports. We re-enable BF16 for RoCM related usages. Reviewed By: houseroad, jiawenliu64 Differential Revision: D52438898

facebook-github-bot · 2023-12-28T06:03:24Z

This pull request was exported from Phabricator. Differential Revision: D52438898

facebook-github-bot · 2023-12-28T06:03:58Z

This pull request was exported from Phabricator. Differential Revision: D52438898

facebook-github-bot · 2023-12-28T18:44:23Z

This pull request has been merged in 9cd944a.

facebook-github-bot added the cla signed label Dec 28, 2023

facebook-github-bot added the fb-exported label Dec 28, 2023

houseroad reviewed Dec 28, 2023

View reviewed changes

jianyuh added 2 commits December 27, 2023 22:02

jianyuh force-pushed the export-D52438898 branch from cb7153d to c689c7b Compare December 28, 2023 06:02

jianyuh force-pushed the export-D52438898 branch from c689c7b to 90ecc97 Compare December 28, 2023 06:03

jianyuh force-pushed the export-D52438898 branch from 90ecc97 to 0f82766 Compare December 28, 2023 06:03

facebook-github-bot closed this in 9cd944a Dec 28, 2023

facebook-github-bot added the Merged label Dec 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bf16 support issues #2238

Fix bf16 support issues #2238

jianyuh commented Dec 28, 2023

netlify bot commented Dec 28, 2023 •

edited

Loading

facebook-github-bot commented Dec 28, 2023

houseroad left a comment

facebook-github-bot commented Dec 28, 2023

facebook-github-bot commented Dec 28, 2023

facebook-github-bot commented Dec 28, 2023

facebook-github-bot commented Dec 28, 2023

Fix bf16 support issues #2238

Fix bf16 support issues #2238

Conversation

jianyuh commented Dec 28, 2023

netlify bot commented Dec 28, 2023 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Dec 28, 2023

houseroad left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Dec 28, 2023

facebook-github-bot commented Dec 28, 2023

facebook-github-bot commented Dec 28, 2023

facebook-github-bot commented Dec 28, 2023

netlify bot commented Dec 28, 2023 •

edited

Loading