Question/Answer with xgen #16

tonyphoang · 2023-07-18T20:33:00Z

Hi all,

I've been able to get xgen7b to work with sentence completion using GPU but cannot get it to work with question/answer.

The code I'm using is below:

import torch
torch.cuda.empty_cache()
from transformers import AutoTokenizer, AutoModelForCausalLM
device_map = {"cuda:0" if torch.cuda.is_available() else "cpu"}
model_name='Salesforce/xgen-7b-8k-base'

tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model =AutoModelForCausalLM.from_pretrained(model_name, 
                                            torch_dtype=torch.bfloat16, 
                                            device_map="auto")

header = (
    "You are an artificial intelligence assistant. "
    "The assistant gives helpful, detailed, and polite answers to the human's questions.\n\n"
)
prompt = f"who is the president of the usa?"

inputs = tokenizer(header + prompt, return_tensors="pt").to('cuda')
sample = model.generate(**inputs, max_length=128, do_sample=True, top_k=100, eos_token_id=50256)
print(tokenizer.decode(sample[0]))

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question/Answer with xgen #16

Question/Answer with xgen #16

tonyphoang commented Jul 18, 2023

Question/Answer with xgen #16

Question/Answer with xgen #16

Comments

tonyphoang commented Jul 18, 2023