PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs #1107
joseph777111
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
https://arxiv.org/abs/2410.05265
https://github.com/ChenMnZ/PrefixQuant
PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs (Mengzhao Chen, Yi Liu, Jiahao Wang, Yi Bin, Wenqi Shao, Ping Luo)
ABSTRACT:
Beta Was this translation helpful? Give feedback.
All reactions