Support for LLaMA #104

ustcwhy · 2023-03-31T05:34:23Z

Thanks for your wonderful work!
Meta released their newest LLM, LLaMA. The checkpoint is available on Huggingface[1]. zphang has presented the code to use LLaMA based on the transformers repo. For FlexGen, could I directly replace OPT model with LLaMA to make inferences on a local card? Do you have any plan to support LLaMA in the future?

[1] https://huggingface.co/decapoda-research
[2] huggingface/transformers#21955

BarfingLemurs · 2023-04-06T10:39:08Z

(duplicate) #60

ustcwhy closed this as completed Apr 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for LLaMA #104

Support for LLaMA #104

ustcwhy commented Mar 31, 2023

BarfingLemurs commented Apr 6, 2023

Support for LLaMA #104

Support for LLaMA #104

Comments

ustcwhy commented Mar 31, 2023

BarfingLemurs commented Apr 6, 2023