feat: add yayi2-30b example #208

mzbac · 2023-12-31T14:55:38Z

Yayi2-30b has a score of over 80 MMLU. I performed some fine-tuning on it using qlora, and from my quick test, it appears to be very promising. so, I created the mlx example for this model in case anyone wants to run it via mlx. However, this model has an unusual k,v layer that causes the quantization to fail. Currently, I haven't found any quantization tools that support this model (except for bnb nf4). It would be great if mlx could provide support for its quantization.

FYI:
https://huggingface.co/wenge-research/yayi2-30b
https://huggingface.co/mzbac/yayi2-30b-guanaco
ml-explore/mlx#328

mzbac · 2024-01-01T03:59:43Z

I have added the workaround for the quant as suggested in ml-explore/mlx#328. Now, the example works with 4-bit quantization. Once the PR gets merged, I will upload the 4-bit quantized model.

demo.mov

awni · 2024-01-10T04:19:52Z

HI @mzbac sorry for the delayed review here. Do you still want to merge this? I think given the non-standard size it wouldn't fit easily in our hf_llm example, but wdyt?

mzbac · 2024-01-10T04:23:37Z

I think this should be supported by hf_llm once we fix the quant non-32 dimension. I'm happy to close this one. Meanwhile, if people want to try this model in f16 precision, they should be able to run it via hf_llm.

awni · 2024-01-10T04:24:50Z

Sounds good, thank you!

mzbac added 3 commits January 1, 2024 01:49

feat: add yayi2-30b example

f4e38db

remove unused num_key_value_heads

b763ad3

chore: add workaround for quant

d81dcad

awni closed this Jan 10, 2024

mzbac deleted the yayi branch January 13, 2024 05:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add yayi2-30b example #208

feat: add yayi2-30b example #208

mzbac commented Dec 31, 2023 •

edited

Loading

mzbac commented Jan 1, 2024 •

edited

Loading

awni commented Jan 10, 2024

mzbac commented Jan 10, 2024

awni commented Jan 10, 2024

feat: add yayi2-30b example #208

feat: add yayi2-30b example #208

Conversation

mzbac commented Dec 31, 2023 • edited Loading

mzbac commented Jan 1, 2024 • edited Loading

awni commented Jan 10, 2024

mzbac commented Jan 10, 2024

awni commented Jan 10, 2024

mzbac commented Dec 31, 2023 •

edited

Loading

mzbac commented Jan 1, 2024 •

edited

Loading