[fix] Change the condition of ValueError in "convert_checkpoint_from_transformers_to_megatron" #24769

SeongBeomLEE · 2023-07-12T08:04:49Z

The "target_tensor_model_parallel_size" is related to "num_attention_heads", and the "target_pipeline_model_parallel_size" is related to "num_hidden_layers".

However, the old code had "target_tensor_model_parallel_size" related to "num_hidden_layers".

So we modified the code and added the part about "target_tensor_model_parallel_size".

Thanks!

norm_factor is still torch.float32 after using model.half So I changed it to register_buffer so I can change it to torch.float16 after using model.half

convert_checkpoint_from_transformers_to_megatron

layers -> attention heads

amyeroberts · 2023-07-12T08:53:06Z

cc @pacman100

pacman100

Nice catch! thank you for the fix

amyeroberts

Thanks for fixing!

HuggingFaceDocBuilderDev · 2023-07-13T10:05:05Z

The documentation is not available anymore as the PR was closed or merged.

…transformers_to_megatron" (huggingface#24769) * fix: half inference error norm_factor is still torch.float32 after using model.half So I changed it to register_buffer so I can change it to torch.float16 after using model.half * fix: Added a variable "persistent=False" * run make style * [fix] Change the condition of ValueError convert_checkpoint_from_transformers_to_megatron * [fix] error wording layers -> attention heads

SeongBeomLEE added 6 commits April 20, 2023 18:49

fix: half inference error

a3817ea

norm_factor is still torch.float32 after using model.half So I changed it to register_buffer so I can change it to torch.float16 after using model.half

fix: Added a variable "persistent=False"

c636e24

run make style

d7e6826

Merge branch 'huggingface:main' into main

255d80a

[fix] Change the condition of ValueError

b238a7d

convert_checkpoint_from_transformers_to_megatron

[fix] error wording

6bef000

layers -> attention heads

pacman100 approved these changes Jul 13, 2023

View reviewed changes

amyeroberts approved these changes Jul 13, 2023

View reviewed changes

amyeroberts merged commit 21946a8 into huggingface:main Jul 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] Change the condition of ValueError in "convert_checkpoint_from_transformers_to_megatron" #24769

[fix] Change the condition of ValueError in "convert_checkpoint_from_transformers_to_megatron" #24769

SeongBeomLEE commented Jul 12, 2023

amyeroberts commented Jul 12, 2023

pacman100 left a comment

amyeroberts left a comment

HuggingFaceDocBuilderDev commented Jul 13, 2023 •

edited

Loading

[fix] Change the condition of ValueError in "convert_checkpoint_from_transformers_to_megatron" #24769

[fix] Change the condition of ValueError in "convert_checkpoint_from_transformers_to_megatron" #24769

Conversation

SeongBeomLEE commented Jul 12, 2023

amyeroberts commented Jul 12, 2023

pacman100 left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jul 13, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Jul 13, 2023 •

edited

Loading