What changes should I make to apply the SPIN method on Llama2? #20

Labmem009 · 2024-02-29T11:41:57Z

I want to apply SPIN method on llama2 with alpaca-like finetuning datasets. What changes should I make to apply the SPIN method?
Thanks a lot!

TanZhendong · 2024-10-22T12:35:15Z

Hello. I've tried to apply SPIN on llama-2 with the ultrachat200k datasets. It seems that the tokenizer for llama2 and zephyr-7b are different (as shown in this document). As a result, the code in spin/run_spin.py likely needs to be modified (which is primarily focused on the apply_chat_template function). By the way, for alpaca-like datasets, the structure of the dataset is different from ultrachat200k (the key is "instruction" and "output" rather than "prompt" and "messages"). I believe just changing the code in spin/reformat.py to ensure the structure of the dataset is the same should be ok.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What changes should I make to apply the SPIN method on Llama2? #20

What changes should I make to apply the SPIN method on Llama2? #20

Labmem009 commented Feb 29, 2024

TanZhendong commented Oct 22, 2024

What changes should I make to apply the SPIN method on Llama2? #20

What changes should I make to apply the SPIN method on Llama2? #20

Comments

Labmem009 commented Feb 29, 2024

TanZhendong commented Oct 22, 2024