Support BFloat16 in convolution_backward #7807

swolchok · 2025-01-21T21:13:56Z

Partial fix for #7748.

[ghstack-poisoned]

swolchok · 2025-01-21T21:13:57Z

Stack from ghstack (oldest at bottom):

-> Support BFloat16 in convolution_backward #7807

pytorch-bot · 2025-01-21T21:13:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7807

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 11f4d2d with merge base 466d98f ():

NEW FAILURE - The following job has failed:

pull / test-phi-3-mini-runner-linux / linux-job (gh)
RuntimeError: Command docker exec -t 754646b24fc451bfd4efcbd62d396825679ccb1f7ee702a217eef81b8279adde /exec failed with exit code 2

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Partial fix for #7748. ghstack-source-id: f51e6f2acd84901aaf7a997658a7d02c93b958e7 ghstack-comment-id: 2605752234 Pull Request resolved: #7807

manuelcandales · 2025-01-22T17:27:39Z

kernels/test/op_convolution_backward_test.cpp

-  auto expected_grad_weight = tf.make({4, 3, 4, 2}, expected_grad_weight_data);
-  auto expected_grad_bias = tf.make({4}, expected_grad_bias_data);
+    if (DTYPE == ScalarType::Half || DTYPE == ScalarType::BFloat16) {
+      EXPECT_TENSOR_CLOSE_WITH_TOL(grad_input, expected_grad_input, 1e-2, 1e-8);


Why not use defaults here? EXPECT_TENSOR_CLOSE_WITH_TOL should apply the right tolerance given the type

because the default rtol is 1e-5; rtol and atol are different

right, but in the same way that we have kDefaultHalfAtol and kDefaultBFloat16Atol I think we should have kDefaultHalfRtol and kDefaultBFloat16Rtol and set it to a proper value.
You seem to be using 1e-2 for most of these tests. Why not introduced kDefaultHalfRtol and kDefaultBFloat16Rtol with value 1e-2?

Why not introduced kDefaultHalfRtol and kDefaultBFloat16Rtol with value 1e-2?

Because not all operators require the higher rtol.

It is not particularly uncommon to need to set rtol in pytorch core: https://github.com/search?q=repo%3Apytorch%2Fpytorch+%2Frtol%3D%5B1-9%5D%2F&type=code

Partial fix for #7748.

Partial fix for pytorch#7748.

Update

11f4d2d

[ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 21, 2025

swolchok added a commit that referenced this pull request Jan 21, 2025

Support BFloat16 in convolution_backward

3eee295

Partial fix for #7748. ghstack-source-id: f51e6f2acd84901aaf7a997658a7d02c93b958e7 ghstack-comment-id: 2605752234 Pull Request resolved: #7807

swolchok requested review from manuelcandales and kirklandsign January 21, 2025 21:14

swolchok added the release notes: ops & kernels Changes to the opset and any new / changed kernel implementations label Jan 21, 2025

manuelcandales reviewed Jan 22, 2025

View reviewed changes

manuelcandales mentioned this pull request Jan 23, 2025

Support Half/BFloat16 in cdist #7800

Merged

manuelcandales approved these changes Jan 23, 2025

View reviewed changes

swolchok merged commit dabd72f into main Jan 23, 2025
44 of 47 checks passed

swolchok deleted the gh/swolchok/158/head branch January 23, 2025 17:40

YIWENX14 pushed a commit that referenced this pull request Jan 28, 2025

Support BFloat16 in convolution_backward (#7807)

d76ffc1

Partial fix for #7748.

zonglinpeng pushed a commit to zonglinpeng/executorch that referenced this pull request Jan 30, 2025

Support BFloat16 in convolution_backward (pytorch#7807)

8e7b91e

Partial fix for pytorch#7748.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support BFloat16 in convolution_backward #7807

Support BFloat16 in convolution_backward #7807

swolchok commented Jan 21, 2025

swolchok commented Jan 21, 2025 •

edited

Loading

pytorch-bot bot commented Jan 21, 2025 •

edited

Loading

manuelcandales Jan 22, 2025

swolchok Jan 22, 2025

manuelcandales Jan 23, 2025

swolchok Jan 23, 2025

swolchok Jan 23, 2025

Support BFloat16 in convolution_backward #7807

Support BFloat16 in convolution_backward #7807

Conversation

swolchok commented Jan 21, 2025

swolchok commented Jan 21, 2025 • edited Loading

pytorch-bot bot commented Jan 21, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7807

❌ 1 New Failure

manuelcandales Jan 22, 2025

Choose a reason for hiding this comment

swolchok Jan 22, 2025

Choose a reason for hiding this comment

manuelcandales Jan 23, 2025

Choose a reason for hiding this comment

swolchok Jan 23, 2025

Choose a reason for hiding this comment

swolchok Jan 23, 2025

Choose a reason for hiding this comment

swolchok commented Jan 21, 2025 •

edited

Loading

pytorch-bot bot commented Jan 21, 2025 •

edited

Loading