Check tied parameters #1529

SunMarc · 2023-06-05T15:16:40Z

What does this PR do ?

This PR fixes two issues user can have when using big inference model:

Use their own device map but forget that parameters that are tied together should be on the same device. We return an error showing which parameters should be on the same device
Forget to tie the parameters before using infer_auto_device_map() which can create a bad device_map. We also return an error asking to tie the weights before using this function.

How to test it

Issue 1

import os
import torch
from transformers import AutoModelForCausalLM
from transformers import AutoTokenizer
from accelerate.utils import find_tied_parameters

checkpoint = "facebook/opt-350m"

device_map_work = {'model.decoder.embed_tokens': 'cpu',
 'model.decoder.embed_positions': 'cpu',
 'model.decoder.project_out': 'cpu',
 'model.decoder.project_in': 'cpu',
 'model.decoder.layers': 'cpu',
 'lm_head': 'cpu'}

device_map_do_no_work = {'model.decoder.embed_tokens': 'cpu',
 'model.decoder.embed_positions': 'cpu',
 'model.decoder.project_out': 'cpu',
 'model.decoder.project_in': 'cpu',
 'model.decoder.layers': 'cpu',
 'lm_head': 'disk'}

model = AutoModelForCausalLM.from_pretrained(checkpoint, device_map = device_map_do_no_work, offload_folder="offload",offload_state_dict = True)
tokenizer = AutoTokenizer.from_pretrained(checkpoint)

Issue 2

import torch
from transformers import AutoConfig,AutoModelForCausalLM
from accelerate import init_empty_weights, infer_auto_device_map, load_checkpoint_and_dispatch

checkpoint = "facebook/opt-350m"

config = AutoConfig.from_pretrained(checkpoint)
with init_empty_weights():
    model = AutoModelForCausalLM.from_config(config)
device_map = infer_auto_device_map(model, no_split_module_classes=["OPTDecoderLayer"])

HuggingFaceDocBuilderDev · 2023-06-05T15:20:45Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for those QOL improvements!

src/accelerate/utils/modeling.py

Fix log Co-authored-by: Sylvain Gugger <[email protected]>

Fix comments and tests Fix description

sgugger · 2023-06-05T19:03:59Z

src/accelerate/utils/modeling.py

+    has_tied_encoder_decoder = False
+    has_tied_module = False
+
+    if transformers.modeling_utils.PreTrainedModel in inspect.getmro(model.__class__):


I was thinking on testing the class __name__ to avoid the extra dep on Transformers.

SunMarc added 3 commits June 5, 2023 14:50

Check that parameters are tied correctly

f476a39

Fix style

6b0ad5a

Fix condition

1752d8c

SunMarc requested a review from sgugger June 5, 2023 15:16

SunMarc added 4 commits June 5, 2023 15:44

Fix failing test

10f6862

Fix check_tied_parameters function

a19b087

Fix condition

106e758

Fix arg

e33efbb

sgugger reviewed Jun 5, 2023

View reviewed changes

SunMarc and others added 2 commits June 5, 2023 12:02

Apply suggestions from code review

33cf46b

Fix log Co-authored-by: Sylvain Gugger <[email protected]>

Fix tests and comments

4857a3e

Fix comments and tests Fix description

sgugger reviewed Jun 5, 2023

View reviewed changes

Remove dep

d661271

sgugger approved these changes Jun 5, 2023

View reviewed changes

SunMarc merged commit b9628f1 into huggingface:main Jun 5, 2023

SunMarc deleted the check_tied_parameters branch June 5, 2023 19:19

SunMarc mentioned this pull request Jun 5, 2023

Add check for tied parameters huggingface/transformers#24029

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check tied parameters #1529

Check tied parameters #1529

SunMarc commented Jun 5, 2023

HuggingFaceDocBuilderDev commented Jun 5, 2023 •

edited

Loading

sgugger left a comment

sgugger Jun 5, 2023

Check tied parameters #1529

Check tied parameters #1529

Conversation

SunMarc commented Jun 5, 2023

What does this PR do ?

How to test it

Issue 1

Issue 2

HuggingFaceDocBuilderDev commented Jun 5, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

sgugger Jun 5, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jun 5, 2023 •

edited

Loading