ITREX need to do modification for llama3 new prompt format #1507

redhairerINTEL · 2024-04-23T14:41:10Z

New prompt format for llama3
https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/

kta-intel · 2024-04-23T16:53:26Z

a32543254 · 2024-04-25T09:03:45Z

here is the sample code if you want to use llama3 template:
all you need is to apply template to input_ids.

from transformers import AutoTokenizer, TextStreamer
from intel_extension_for_transformers.transformers import AutoModelForCausalLM, WeightOnlyQuantConfig

model_name = "meta-llama/Meta-Llama-3-8B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
streamer = TextStreamer(tokenizer)
model = AutoModelForCausalLM.from_pretrained(model_name, load_in_4bit=True)
messages = [
    {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
    {"role": "user", "content": "Who are you?"},
]

input_ids = tokenizer.apply_chat_template(
    messages,
    add_generation_prompt=True,
    return_tensors="pt"
).to(model.device)


outputs = model.generate(input_ids , streamer=streamer)

We will also add it to doc soon.

N3RDIUM · 2024-04-30T06:13:17Z

here is the sample code if you want to use llama3 template: all you need is to apply template to input_ids.

from transformers import AutoTokenizer, TextStreamer
from intel_extension_for_transformers.transformers import AutoModelForCausalLM, WeightOnlyQuantConfig

model_name = "meta-llama/Meta-Llama-3-8B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
streamer = TextStreamer(tokenizer)
model = AutoModelForCausalLM.from_pretrained(model_name, load_in_4bit=True)
messages = [
    {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
    {"role": "user", "content": "Who are you?"},
]

input_ids = tokenizer.apply_chat_template(
    messages,
    add_generation_prompt=True,
    return_tensors="pt"
).to(model.device)


outputs = model.generate(input_ids , streamer=streamer)

We will also add it to doc soon.

This gives me AssertionError: Fail to convert pytorch model

NeoZhangJianyu assigned kevinintel Apr 25, 2024

kevinintel assigned a32543254 Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ITREX need to do modification for llama3 new prompt format #1507

ITREX need to do modification for llama3 new prompt format #1507

redhairerINTEL commented Apr 23, 2024

kta-intel commented Apr 23, 2024

a32543254 commented Apr 25, 2024 •

edited

Loading

N3RDIUM commented Apr 30, 2024

ITREX need to do modification for llama3 new prompt format #1507

ITREX need to do modification for llama3 new prompt format #1507

Comments

redhairerINTEL commented Apr 23, 2024

kta-intel commented Apr 23, 2024

a32543254 commented Apr 25, 2024 • edited Loading

N3RDIUM commented Apr 30, 2024

a32543254 commented Apr 25, 2024 •

edited

Loading