Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

enable weight only quantization for language modeling #1053

Merged
merged 9 commits into from
Jul 4, 2023

Conversation

violetch24
Copy link
Contributor

Type of Change

enable weight only quantization for language modeling
model: gpt-j-6B
API not changed

Description

Expected Behavior & Potential Risk

How has this PR been tested?

local test

Dependency Change?

@violetch24 violetch24 requested a review from xin3he June 30, 2023 08:08
@xin3he
Copy link
Contributor

xin3he commented Jul 3, 2023

Please trigger the extension test.

@chensuyue chensuyue added this to the v2.3 milestone Jul 3, 2023
@chensuyue chensuyue merged commit 4b24be1 into master Jul 4, 2023
@chensuyue chensuyue deleted the zixuan/weight_only_lm branch July 4, 2023 09:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants