MPT models on the Hub not working with `transformers` main #703

younesbelkada · 2023-10-30T13:45:00Z

Hi there!

Currently with transformers main loading MPT models from the Hub fails because it tries to import some private method (such as _expand_mask ) that has been recently removed: huggingface/transformers#27086

The simple loading script below should work

from accelerate import init_empty_weights
from transformers import AutoModelForCausalLM, AutoConfig

model_id = "mosaicml/mpt-7b"
config = AutoConfig.from_pretrained(
    model_id, trust_remote_code=True
)
with init_empty_weights():
    model = AutoModelForCausalLM.from_config(
        config, trust_remote_code=True
    )

The text was updated successfully, but these errors were encountered:

dakinggg · 2023-10-30T16:54:59Z

Thanks for letting us know Younes, will look into this ASAP

younesbelkada · 2023-10-30T16:58:46Z

Thanks @dakinggg !

dakinggg · 2023-10-30T21:19:24Z

@younesbelkada this should be resolved in the foundry code now, and I'm uploading the updated code to the hf hub as we speak.

dakinggg · 2023-10-30T21:58:55Z

Ok, this should be resolved completely now. Let me know if you see otherwise! Thanks again for the report :)

younesbelkada · 2023-10-31T13:58:58Z

Works now like charm! Thanks for the quick fix @dakinggg

younesbelkada added the bug Something isn't working label Oct 30, 2023

younesbelkada mentioned this issue Oct 30, 2023

[tests / Quantization] Fix bnb test huggingface/transformers#27145

Merged

dakinggg mentioned this issue Oct 30, 2023

Remove prefixlm support for OPT and Bloom #704

Merged

dakinggg closed this as completed in #704 Oct 30, 2023

younesbelkada mentioned this issue Oct 31, 2023

[Quantization / tests ] Fix bnb MPT test huggingface/transformers#27178

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPT models on the Hub not working with `transformers` main #703

MPT models on the Hub not working with `transformers` main #703

younesbelkada commented Oct 30, 2023 •

edited

Loading

dakinggg commented Oct 30, 2023

younesbelkada commented Oct 30, 2023

dakinggg commented Oct 30, 2023

dakinggg commented Oct 30, 2023

younesbelkada commented Oct 31, 2023

MPT models on the Hub not working with transformers main #703

MPT models on the Hub not working with transformers main #703

Comments

younesbelkada commented Oct 30, 2023 • edited Loading

dakinggg commented Oct 30, 2023

younesbelkada commented Oct 30, 2023

dakinggg commented Oct 30, 2023

dakinggg commented Oct 30, 2023

younesbelkada commented Oct 31, 2023

MPT models on the Hub not working with `transformers` main #703

MPT models on the Hub not working with `transformers` main #703

younesbelkada commented Oct 30, 2023 •

edited

Loading