Fix: Resolve #3060, `preload_module_classes` is lost for nested modules #3248

wejoncy · 2024-11-22T02:54:55Z

What does this PR do?

We ran into this issue when try to load VPTQ models by multiple devices, the VQLinear layer has embending as its sub-module. This function should move all constant buffers/parameters to the corresponding devices from Meta device.
So add_hook_to_module should pass preload_module_classes to the next recursive calling for the nested modules.

https://github.com/microsoft/VPTQ/blob/ac7258f461e214ca705f5895513f314576750528/vptq/layers/vqlinear.py#L160C18-L160C18

This PR is to fix #3060

Big modeling: @SunMarc

SunMarc

LGTM ! Thanks for fixing ! Could you share which specific issue you where having ? Also if you can implement a test, it would be nice but not needed to merged this PR. You can fix the CI with make style

HuggingFaceDocBuilderDev · 2024-11-22T15:16:02Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr

agreed with Marc, some more details to this issue and adding a test is needed.

wejoncy · 2024-11-26T05:32:03Z

Hi @muellerzr and @SunMarc Thanks for your review. I have briefly described the issue in overview of this PR.
And I will try to construct such a test case.

tests/test_accelerator.py

SunMarc · 2024-11-26T13:58:05Z

tests/test_accelerator.py

@@ -762,3 +762,64 @@ def test_save_model_with_stateful_dataloader(self, use_safetensors, tied_weights
            assert torch.allclose(original_linear1, new_linear1)
            assert torch.allclose(original_batchnorm, new_batchnorm)
            assert torch.allclose(original_linear2, new_linear2)
+
+    @require_cuda


we need a require_transformers decorator if you are going to import transformers here. Otherwise, we can try to not use it and just save the model with

checkpoint = os.path.join(tmp_dir, "pt_model.bin") torch.save(model.state_dict(), checkpoint)

resolve 3060

9a9b0fa

wejoncy mentioned this pull request Nov 22, 2024

preload_module_classes is lost in attach_execution_device_hook. #3060

Open

4 tasks

wejoncy changed the title ~~resolve 3060~~ Fix: Resolve #3060 Nov 22, 2024

SunMarc approved these changes Nov 22, 2024

View reviewed changes

SunMarc requested a review from muellerzr November 22, 2024 15:16

muellerzr reviewed Nov 22, 2024

View reviewed changes

wejoncy and others added 2 commits November 26, 2024 13:21

Merge branch 'huggingface:main' into main

532f328

format

1078cd2

wejoncy changed the title ~~Fix: Resolve #3060~~ Fix: Resolve #3060, pre is lost for nested modules Nov 26, 2024

wejoncy changed the title ~~Fix: Resolve #3060, pre is lost for nested modules~~ Fix: Resolve #3060, preload_module_classes is lost for nested modules Nov 26, 2024

wejoncy changed the title ~~Fix: Resolve #3060, preload_module_classes is lost for nested modules~~ Fix: Resolve #3060, preload_module_classes is lost for nested modules Nov 26, 2024

wejoncy added 2 commits November 26, 2024 14:58

add tests

3213231

fix

a0a847c

SunMarc reviewed Nov 26, 2024

View reviewed changes

tests/test_accelerator.py Outdated Show resolved Hide resolved

SunMarc reviewed Nov 26, 2024

View reviewed changes

fix

2ebc1ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Resolve #3060, `preload_module_classes` is lost for nested modules #3248

Fix: Resolve #3060, `preload_module_classes` is lost for nested modules #3248

wejoncy commented Nov 22, 2024 •

edited

Loading

SunMarc left a comment •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 22, 2024

muellerzr left a comment

wejoncy commented Nov 26, 2024 •

edited

Loading

SunMarc Nov 26, 2024 •

edited

Loading

Fix: Resolve #3060, preload_module_classes is lost for nested modules #3248

Are you sure you want to change the base?

Fix: Resolve #3060, preload_module_classes is lost for nested modules #3248

Conversation

wejoncy commented Nov 22, 2024 • edited Loading

What does this PR do?

SunMarc left a comment • edited Loading

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 22, 2024

muellerzr left a comment

Choose a reason for hiding this comment

wejoncy commented Nov 26, 2024 • edited Loading

SunMarc Nov 26, 2024 • edited Loading

Choose a reason for hiding this comment

Fix: Resolve #3060, `preload_module_classes` is lost for nested modules #3248

Fix: Resolve #3060, `preload_module_classes` is lost for nested modules #3248

wejoncy commented Nov 22, 2024 •

edited

Loading

SunMarc left a comment •

edited

Loading

wejoncy commented Nov 26, 2024 •

edited

Loading

SunMarc Nov 26, 2024 •

edited

Loading