Llama2 models not loading (Using main branch) #25388

dbuos · 2023-08-08T16:58:46Z

System Info

Error when loading the model "meta-llama/Llama-2-7b-chat-hf" using the following code:

from transformers import AutoModelForCausalLM
chk = 'meta-llama/Llama-2-7b-chat-hf'

if __name__ == '__main__':
    model = AutoModelForCausalLM.from_pretrained(chk, load_in_4bit=True, device_map='auto')
    print("Loaded Ok")

The error message was:

`do_sample` is set to `False`. However, temperature is set to 0.9 -- this flag is only used in sample-based generation modes. Set `do_sample=True` or unset temperature to continue.

This is because the method GenerationConfig.validate() raises a ValueError and that Error is not controlled in modeling_utils.py file.
One possible solution is to add the the ValueError to the except clause in that file:

Who can help?

@gante

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Using the main branch (install from source code)

from transformers import AutoModelForCausalLM
chk = 'meta-llama/Llama-2-7b-chat-hf'

if __name__ == '__main__':
    model = AutoModelForCausalLM.from_pretrained(chk, load_in_4bit=True, device_map='auto')
    print("Loaded Ok")

Expected behavior

To be able to load the model

The text was updated successfully, but these errors were encountered:

inconnu26 · 2023-08-08T17:19:43Z

I have same issue with another Model. I think this is urgent to fix.

Hanzofm · 2023-08-08T19:07:16Z

Same here. Meanwhile downgrading to transfomers 4.31 version solves the problem

dbuos · 2023-08-08T19:12:58Z

I found this issue in the dev branch of transformers-4.32.0.dev0. By the way, I made a PR that could solve that. #25389

graeme204 · 2023-08-09T07:37:44Z

Same here!

gante · 2023-08-09T10:55:50Z

Hey everyone 👋 If you're hitting this exception, it means that there is something wrong with your model's config file 💔

Meanwhile, we are deciding internally how to massage this question into a more user-friendly solution.

gante · 2023-08-09T11:51:11Z

After the PR above gets merged, you will be able to do everything as before.

The only difference from before is that you will see new warnings, related to poor generate parameterization (which may come from the generation config file, as in the case of llama 2) :)

dbuos mentioned this issue Aug 8, 2023

Handle ValueError in model_utils (generation config) #25389

Closed

gante mentioned this issue Aug 9, 2023

Generate: lower severity of parameterization checks #25407

Merged

gante closed this as completed in #25407 Aug 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama2 models not loading (Using main branch) #25388

Llama2 models not loading (Using main branch) #25388

dbuos commented Aug 8, 2023

inconnu26 commented Aug 8, 2023

Hanzofm commented Aug 8, 2023 •

edited

Loading

dbuos commented Aug 8, 2023

graeme204 commented Aug 9, 2023

gante commented Aug 9, 2023 •

edited

Loading

gante commented Aug 9, 2023

Llama2 models not loading (Using main branch) #25388

Llama2 models not loading (Using main branch) #25388

Comments

dbuos commented Aug 8, 2023

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

inconnu26 commented Aug 8, 2023

Hanzofm commented Aug 8, 2023 • edited Loading

dbuos commented Aug 8, 2023

graeme204 commented Aug 9, 2023

gante commented Aug 9, 2023 • edited Loading

gante commented Aug 9, 2023

Hanzofm commented Aug 8, 2023 •

edited

Loading

gante commented Aug 9, 2023 •

edited

Loading