Fix Trainer when model is loaded on a different GPU #23792

sgugger · 2023-05-26T13:33:58Z

What does this PR do?

When a small model is loaded with device_map="auto" it might end up all on GPU 1, so currently is_model_parallel is set to False (cause one device) and later on the Trainer moves the model to GPU 0 which fails the execution of all the Accelerate hooks.

This PR fixes this by making sure is_model_parallel is set to True when there is one device but it's not GPU 0.

HuggingFaceDocBuilderDev · 2023-05-26T13:53:58Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Thanks so much for this! LGTM

Fix Trainer when model is loaded on a different GPU

fb271b3

sgugger requested a review from younesbelkada May 26, 2023 13:33

younesbelkada approved these changes May 31, 2023

View reviewed changes

sgugger merged commit 68d53bc into main May 31, 2023

sgugger deleted the trainer_mp_devices branch May 31, 2023 11:54

sheonhan pushed a commit to sheonhan/transformers that referenced this pull request Jun 1, 2023

Fix Trainer when model is loaded on a different GPU (huggingface#23792)

acb6a14

gojiteji pushed a commit to gojiteji/transformers that referenced this pull request Jun 5, 2023

Fix Trainer when model is loaded on a different GPU (huggingface#23792)

6dffdf5

novice03 pushed a commit to novice03/transformers that referenced this pull request Jun 23, 2023

Fix Trainer when model is loaded on a different GPU (huggingface#23792)

311bfe3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Trainer when model is loaded on a different GPU #23792

Fix Trainer when model is loaded on a different GPU #23792

sgugger commented May 26, 2023

HuggingFaceDocBuilderDev commented May 26, 2023 •

edited

Loading

younesbelkada left a comment

Fix Trainer when model is loaded on a different GPU #23792

Fix Trainer when model is loaded on a different GPU #23792

Conversation

sgugger commented May 26, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented May 26, 2023 • edited Loading

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 26, 2023 •

edited

Loading