Named Parameters changing for same quantized model if loaded from HuggingFace or AutoAWQ #380
Unanswered
imohitmayank
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
While experimenting with LLaMa-2 model, I found that if I load base or AWQ model using HF or AutoAWQ code, the model load size and number of named parameters are different. Just wanted to understand why that is so. Below are the code and output.
Case 1: Base model for reference.
Case 2: Loading AWQ model with HF
Case 3: Loading AWQ model with AutoAWQ
My questions are,
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions