correct template for Nemo model? #106

alist · 2024-12-01T01:09:04Z

Hi @guinmoon, thanks a ton for building this App. I am looking to run a 12B q4 GGUF Mistral-Nemo model on an 8Gb iPad (I have a 16gb on its way..), inference occurs but it seems like the decoding is failing since the tokens are not as expected....

Any ideas for settings or things I could be doing wrong? I really recommend this model, would love to get it working. Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

correct template for Nemo model? #106

correct template for Nemo model? #106

alist commented Dec 1, 2024

correct template for Nemo model? #106

correct template for Nemo model? #106

Comments

alist commented Dec 1, 2024