support fused_mutli_transformer_xpu_int8 #58902

NALLEIN · 2023-11-10T06:04:49Z

PR types

New features

PR changes

OPs

Description

Support fused_multi_transformer_ptq op for inference, optimize the performance of beam_search decode by optimizing the gather operator.

paddle/phi/kernels/fusion/xpu/fused_multi_transformer_int8_xpu_kernel.cc

zhiqiu

LGTM for const_cast

zhupengyang

LGTM

XiaoguangHu01

LGTM

NALLEIN changed the title ~~Ernie ptq~~ support fused_mutli_transformer_xpu_int8 Nov 10, 2023

NALLEIN force-pushed the ernie_ptq branch from fb29cea to e90c794 Compare November 10, 2023 09:11

paddle-bot bot added the contributor External developers label Nov 10, 2023

support fused_mt_int8_xpu

f8f23b4

NALLEIN force-pushed the ernie_ptq branch from e90c794 to f8f23b4 Compare November 14, 2023 08:20

NALLEIN added 2 commits November 22, 2023 11:41

add pass tester

be62dfd

fix codestyle

bae5236

NALLEIN force-pushed the ernie_ptq branch from b824b5c to bae5236 Compare November 23, 2023 03:35

zhupengyang reviewed Nov 23, 2023

View reviewed changes

paddle/phi/kernels/fusion/xpu/fused_multi_transformer_int8_xpu_kernel.cc Outdated Show resolved Hide resolved

paddle/phi/kernels/fusion/xpu/fused_multi_transformer_int8_xpu_kernel.cc Outdated Show resolved Hide resolved

delete debug message

28422d9

zhiqiu approved these changes Nov 24, 2023

View reviewed changes

zhupengyang approved these changes Nov 24, 2023

View reviewed changes

XiaoguangHu01 approved these changes Nov 24, 2023

View reviewed changes

From00 approved these changes Nov 24, 2023

View reviewed changes

zhupengyang merged commit 959f0c2 into PaddlePaddle:develop Nov 24, 2023

SecretXV pushed a commit to SecretXV/Paddle that referenced this pull request Nov 28, 2023

support fused_mutli_transformer_xpu_int8 (PaddlePaddle#58902)

98cb469

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support fused_mutli_transformer_xpu_int8 #58902

support fused_mutli_transformer_xpu_int8 #58902

NALLEIN commented Nov 10, 2023

zhiqiu left a comment

zhupengyang left a comment

XiaoguangHu01 left a comment

support fused_mutli_transformer_xpu_int8 #58902

support fused_mutli_transformer_xpu_int8 #58902

Conversation

NALLEIN commented Nov 10, 2023

PR types

PR changes

Description

zhiqiu left a comment

Choose a reason for hiding this comment

zhupengyang left a comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment