enable weight only quantization for language modeling #1053

violetch24 · 2023-06-30T08:05:43Z

Type of Change

enable weight only quantization for language modeling
model: gpt-j-6B
API not changed

Description

Expected Behavior & Potential Risk

How has this PR been tested?

local test

Dependency Change?

Signed-off-by: Cheng, Zixuan <[email protected]>

…al-compressor into zixuan/weight_only_lm

Signed-off-by: Cheng, Zixuan <[email protected]>

.../pytorch/nlp/huggingface_models/language-modeling/quantization/ptq_weight_only/fx/run_clm.py

...s/pytorch/nlp/huggingface_models/language-modeling/quantization/ptq_weight_only/fx/README.md

xin3he · 2023-07-03T01:30:15Z

Please trigger the extension test.

violetch24 and others added 2 commits June 30, 2023 16:04

enable weight only quantization for language modeling

30ccaf1

Signed-off-by: Cheng, Zixuan <[email protected]>

Update README.md

6e90724

violetch24 requested a review from xin3he June 30, 2023 08:08

violetch24 added 7 commits June 30, 2023 16:20

add to config

d738627

Signed-off-by: Cheng, Zixuan <[email protected]>

Merge branch 'zixuan/weight_only_lm' of https://github.com/intel/neur…

e81d46c

…al-compressor into zixuan/weight_only_lm

minor fix

9410566

Signed-off-by: Cheng, Zixuan <[email protected]>

edit dir

98ad709

Signed-off-by: Cheng, Zixuan <[email protected]>

minor fix

7c53980

Signed-off-by: Cheng, Zixuan <[email protected]>

Merge branch 'master' into zixuan/weight_only_lm

1280173

minor fix

3ef47cc

Signed-off-by: Cheng, Zixuan <[email protected]>

xin3he approved these changes Jul 3, 2023

View reviewed changes

.../pytorch/nlp/huggingface_models/language-modeling/quantization/ptq_weight_only/fx/run_clm.py Outdated Show resolved Hide resolved

...s/pytorch/nlp/huggingface_models/language-modeling/quantization/ptq_weight_only/fx/README.md Outdated Show resolved Hide resolved

chensuyue added examples extension test labels Jul 3, 2023

chensuyue added this to the v2.3 milestone Jul 3, 2023

chensuyue merged commit 4b24be1 into master Jul 4, 2023

chensuyue deleted the zixuan/weight_only_lm branch July 4, 2023 09:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable weight only quantization for language modeling #1053

enable weight only quantization for language modeling #1053

violetch24 commented Jun 30, 2023

xin3he commented Jul 3, 2023

enable weight only quantization for language modeling #1053

enable weight only quantization for language modeling #1053

Conversation

violetch24 commented Jun 30, 2023

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

xin3he commented Jul 3, 2023