Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

量化效果问题咨询 #12

Open
wht1712 opened this issue May 26, 2023 · 3 comments
Open

量化效果问题咨询 #12

wht1712 opened this issue May 26, 2023 · 3 comments

Comments

@wht1712
Copy link

wht1712 commented May 26, 2023

您好,我可能对量化这方面的工作还不是很了解,我对您的这个工作有一个简单的疑问。请问为什么您展示的量化后的模型会超过原始全精度模型的指标呢?期待您的回复。

@YanjingLi0202
Copy link
Owner

您好,我可能对量化这方面的工作还不是很了解,我对您的这个工作有一个简单的疑问。请问为什么您展示的量化后的模型会超过原始全精度模型的指标呢?期待您的回复。

量化模型的训练load了pretrained全精度模型,之后再次训练300回合。可以粗略地认为量化模型训练了600回合(2x),所以有可能会超过全精度的效果。

@flymmmfly
Copy link

这个解释未免太过牵强了

@charliezjw
Copy link

这种现象很常见。除了作者所讲述的原因,quantization也起到了regularization的作用。原网络可能overfitting了,所以quantization之后的accuracy会上升一些。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants