add pile calib, rename quant_block_list to to_quant_block_names #322

WeiweiZhang1 · 2024-11-13T09:11:41Z

No description provided.

Signed-off-by: Zhang, Weiwei1 <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Zhang, Weiwei1 <[email protected]>

for more information, see https://pre-commit.ci

auto_round/autoround.py

auto_round/mllm/autoround_mllm.py

Signed-off-by: Zhang, Weiwei1 <[email protected]>

…to-round into rename_quant_block_list

Signed-off-by: Zhang, Weiwei1 <[email protected]>

for more information, see https://pre-commit.ci

wenhuach21

I haven't carefully read the code, if possible, please test more scenarios

wenhuach21 · 2024-11-14T13:26:55Z

could fp layer also support fuzzy matching

wenhuach21 · 2024-11-14T13:31:43Z

fp_layers = args.fp_layers.split(",")
if bool(fp_layers):
    for n, m in model.named_modules():
        if isinstance(m, torch.nn.Linear) or isinstance(m, transformers.modeling_utils.Conv1D):
            name = n.split('.')[-1]
            if n in fp_layers or name in fp_layers:
                layer_config[n] = {"bits": 16}
                logger.info(
                    f"{n} will not be quantized.")

why coding like this , name = n.split('.')[-1]? how to exclude a exact layer

add pile calib, rename quant_block_list to to_quant_block_names

784ed49

Signed-off-by: Zhang, Weiwei1 <[email protected]>

WeiweiZhang1 requested review from wenhuach21 and n1ck-guo November 13, 2024 09:11

pre-commit-ci bot and others added 3 commits November 13, 2024 09:11

[pre-commit.ci] auto fixes from pre-commit.com hooks

67d99d1

for more information, see https://pre-commit.ci

fix scan issues

3c91294

Signed-off-by: Zhang, Weiwei1 <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

3c91fec

for more information, see https://pre-commit.ci

wenhuach21 reviewed Nov 13, 2024

View reviewed changes

auto_round/autoround.py Show resolved Hide resolved

wenhuach21 reviewed Nov 13, 2024

View reviewed changes

auto_round/mllm/autoround_mllm.py Outdated Show resolved Hide resolved

wenhuach21 reviewed Nov 13, 2024

View reviewed changes

auto_round/mllm/autoround_mllm.py Outdated Show resolved Hide resolved

WeiweiZhang1 and others added 5 commits November 13, 2024 19:22

bugfix

a9fe075

Signed-off-by: Zhang, Weiwei1 <[email protected]>

Merge branch 'rename_quant_block_list' of https://github.com/intel/au…

67242e0

…to-round into rename_quant_block_list

refine get_mllm_dataloader

f999d24

Signed-off-by: Zhang, Weiwei1 <[email protected]>

Merge branch 'main' into rename_quant_block_list

2afd5d4

[pre-commit.ci] auto fixes from pre-commit.com hooks

bd54b13

for more information, see https://pre-commit.ci

wenhuach21 self-requested a review November 14, 2024 13:07

wenhuach21 reviewed Nov 14, 2024

View reviewed changes

wenhuach21 self-requested a review November 14, 2024 13:08

wenhuach21 approved these changes Nov 14, 2024

View reviewed changes

wenhuach21 merged commit d493572 into main Nov 15, 2024
12 of 14 checks passed

wenhuach21 deleted the rename_quant_block_list branch November 15, 2024 01:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add pile calib, rename quant_block_list to to_quant_block_names #322

add pile calib, rename quant_block_list to to_quant_block_names #322

WeiweiZhang1 commented Nov 13, 2024

wenhuach21 left a comment

wenhuach21 commented Nov 14, 2024

wenhuach21 commented Nov 14, 2024

add pile calib, rename quant_block_list to to_quant_block_names #322

add pile calib, rename quant_block_list to to_quant_block_names #322

Conversation

WeiweiZhang1 commented Nov 13, 2024

wenhuach21 left a comment

Choose a reason for hiding this comment

wenhuach21 commented Nov 14, 2024

wenhuach21 commented Nov 14, 2024