change use_optimum_format=True and add bias #1431

xin3he · 2023-11-30T06:27:48Z

Type of Change

bug fix

Description

Optimum sets bias=True for QuantLinear when packing model. Here we follow this design of huggingface format for compatibility and set use_hf_format=True as default.
change argument name from hf to optimum.

Expected Behavior & Potential Risk

UT pass

Signed-off-by: Xin He <[email protected]>

hshen14 · 2023-12-03T23:52:55Z

shall we specify the format, e.g., use_gptq_format? HF format sounds too general - how about AWQ and GGUF format? People also upload these formats to HF.

xin3he · 2023-12-04T01:40:43Z

shall we specify the format, e.g., use_gptq_format? HF format sounds too general - how about AWQ and GGUF format? People also upload these formats to HF.

It's actually general, we can generate RTN, AWQ model with this format. GGUF is another format, we haven't supported it now.

Signed-off-by: Xin He <[email protected]>

chensuyue · 2023-12-06T07:45:43Z

/azp run Code-Scan

azure-pipelines · 2023-12-06T07:45:52Z

Azure Pipelines successfully started running 1 pipeline(s).

change use_hf_format=True and add bias

e8ebe12

Signed-off-by: Xin He <[email protected]>

xin3he requested review from hshen14 and changwangss November 30, 2023 06:28

xin3he added 2 commits November 30, 2023 15:43

auto fallback fp16 to fp32 if device is cpu

958b54d

Signed-off-by: Xin He <[email protected]>

fix ut

7e11d9f

Signed-off-by: Xin He <[email protected]>

chensuyue added this to the v2.4 milestone Dec 1, 2023

chensuyue added the bug fix Something isn't working label Dec 1, 2023

xin3he added 4 commits December 4, 2023 11:47

fix bug

3d2c12d

Signed-off-by: Xin He <[email protected]>

fix bug

c742653

Signed-off-by: Xin He <[email protected]>

fix ut

e3c4ff0

Signed-off-by: Xin He <[email protected]>

update ut for default hf_format

64b21af

Signed-off-by: Xin He <[email protected]>

PenghuiCheng approved these changes Dec 5, 2023

View reviewed changes

hshen14 approved these changes Dec 5, 2023

View reviewed changes

xin3he added 2 commits December 5, 2023 19:17

change flag name to optimum and add link

477d008

Signed-off-by: Xin He <[email protected]>

refine doc

b93a3a4

Signed-off-by: Xin He <[email protected]>

xin3he changed the title ~~change use_hf_format=True and add bias~~ change use_optimum_format=True and add bias Dec 5, 2023

chensuyue merged commit 0a06448 into master Dec 6, 2023
51 of 53 checks passed

chensuyue deleted the xinhe/hf_format branch December 6, 2023 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change use_optimum_format=True and add bias #1431

change use_optimum_format=True and add bias #1431

xin3he commented Nov 30, 2023 •

edited

Loading

hshen14 commented Dec 3, 2023

xin3he commented Dec 4, 2023 •

edited

Loading

chensuyue commented Dec 6, 2023

azure-pipelines bot commented Dec 6, 2023

change use_optimum_format=True and add bias #1431

change use_optimum_format=True and add bias #1431

Conversation

xin3he commented Nov 30, 2023 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

hshen14 commented Dec 3, 2023

xin3he commented Dec 4, 2023 • edited Loading

chensuyue commented Dec 6, 2023

azure-pipelines bot commented Dec 6, 2023

xin3he commented Nov 30, 2023 •

edited

Loading

xin3he commented Dec 4, 2023 •

edited

Loading