Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PHI] Fix performance issue in bilinear interpolation's backward kernel. #68541

Merged
merged 4 commits into from
Oct 10, 2024
Merged

[PHI] Fix performance issue in bilinear interpolation's backward kernel. #68541

merged 4 commits into from
Oct 10, 2024

Conversation

A-nnonymous
Copy link
Contributor

PR Category

Performance Optimization

PR Types

Performance

Description

Due to the excessively dense atomic operations used in the original backpropagation operator for bilinear interpolation, the performance deteriorates drastically under high parallelism caused by atomic operation contention. Through extensive quantitative analysis, this work designs a high-performance backpropagation operator upon the base of original operator that circumvents atomic operations and activates it under a certain threshold.

In typical cases, the optimized operator performs more than 3x better than the original operator, and its speedup ratio increases with the interpolation factor.

pcard-76996

Copy link

paddle-bot bot commented Sep 30, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot paddle-bot bot added the contributor External developers label Sep 30, 2024
Copy link

paddle-ci-bot bot commented Oct 8, 2024

Sorry to inform you that c4a214d's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

@phlrain phlrain merged commit cbc9380 into PaddlePaddle:develop Oct 10, 2024
26 of 27 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contributor External developers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants