Model wishlist #217

nsarrazin · 2023-04-24T17:05:05Z

Hey everyone!

Just opening an issue to track which models people would like to see supported with Serge.

Are there any others you would like to see ?

kolabearafk · 2023-04-29T00:47:16Z

How about WizardLM (https://github.com/nlpxucan/WizardLM) ? Can this new model be supported in Serge please? I heard it's very fast and generates good results.

CrazyBonze · 2023-05-03T22:23:21Z

I was going to say WizardLM, its been getting some good reviews.

OrcVole · 2023-05-05T17:50:54Z

https://huggingface.co/reeducator/vicuna-13b-free/discussions

It tries to reduce censorship

fishscene · 2023-05-11T00:04:22Z

Google PaLM looks interesting.
https://9to5google.com/2023/05/10/google-palm-2/

However, as far as I can tell, the framework is open source and might be available? But I can’t find any info on the training data. I’m also very new to AI and I might be looking in the wrong places for the data needed to be usable for Serge.

rendel · 2023-05-11T10:01:22Z

MPT-7B: https://github.com/mosaicml/llm-foundry

noproto · 2023-05-25T17:48:18Z

I'd really like to see support for Guanaco models.
Edit: PR #334

ethdig · 2023-06-01T21:27:55Z

https://huggingface.co/medalpaca
https://huggingface.co/medalpaca/medalpaca-13b/tree/main
https://huggingface.co/ehartford/Wizard-Vicuna-30B-Uncensored/tree/main

Thank you

amiravni · 2023-06-02T07:18:26Z

https://huggingface.co/timdettmers/guanaco-33b-merged
https://huggingface.co/tiiuae/falcon-40b

Thanks!

Betanu701 · 2023-06-14T05:35:43Z

from my testing, it seems like all the Q4_0 and the Q8_0 work the best/fastest. The K varieties take too long with CPU only.
Honestly I think it is a toss up between Q4 and Q8. Both seem to run about the same within a margin of error.

wuast94 · 2023-06-18T15:29:09Z

https://huggingface.co/philschmid/instruct-igel-001

specked · 2023-07-08T20:38:03Z

CodeGen2.5

kagrith · 2023-07-12T02:57:21Z

from my testing, it seems like all the Q4_0 and the Q8_0 work the best/fastest. The K varieties take too long with CPU only. Honestly I think it is a toss up between Q4 and Q8. Both seem to run about the same within a margin of error.

I get this odd error when trying to run Wizard.

llama.cpp: loading model from /usr/src/app/weights/Wizard-30B-Q4_1.bin
error loading model: unknown (magic, version) combination: 67676a74, 00000003; is this really a GGML file?
llama_init_from_file: failed to load model

Did you get this?
How did you resolve?

tonyhardcode · 2023-08-03T13:25:14Z

WizardCoder would be nice to have!

https://huggingface.co/WizardLM/WizardCoder-15B-V1.0

laurentgoncalves · 2023-08-25T07:48:49Z

Very interested to have the just released code llama : https://ai.meta.com/blog/code-llama-large-language-model-coding/

gaby · 2023-11-17T14:08:09Z

Fixed via #866

nsarrazin pinned this issue Apr 24, 2023

nsarrazin added enhancement 🧠 Models labels Apr 24, 2023

This comment was marked as abuse.

Sign in to view

nsarrazin mentioned this issue Apr 24, 2023

New Model: Vicuna #134

Closed

fishscene mentioned this issue May 11, 2023

Add rwkv.cpp support (RWKV is a 100% RNN Language Model) #253

Closed

gaby added 💡 Help Wanted 👍 Accepting PR and removed enhancement labels Sep 4, 2023

gaby mentioned this issue Nov 14, 2023

Add support for GGUF models #866

Merged

gaby unpinned this issue Nov 14, 2023

gaby added 🛠 In Progress and removed 💡 Help Wanted 👍 Accepting PR labels Nov 14, 2023

gaby closed this as completed Nov 17, 2023

serge-chat locked as resolved and limited conversation to collaborators Nov 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model wishlist #217

Model wishlist #217

nsarrazin commented Apr 24, 2023 •

edited

Loading

This comment was marked as abuse.

This comment was marked as abuse.

kolabearafk commented Apr 29, 2023

CrazyBonze commented May 3, 2023

OrcVole commented May 5, 2023

fishscene commented May 11, 2023

rendel commented May 11, 2023

noproto commented May 25, 2023 •

edited

Loading

ethdig commented Jun 1, 2023

amiravni commented Jun 2, 2023 •

edited

Loading

Betanu701 commented Jun 14, 2023

wuast94 commented Jun 18, 2023

specked commented Jul 8, 2023

kagrith commented Jul 12, 2023

tonyhardcode commented Aug 3, 2023

laurentgoncalves commented Aug 25, 2023

gaby commented Nov 17, 2023

Model wishlist #217

Model wishlist #217

Comments

nsarrazin commented Apr 24, 2023 • edited Loading

This comment was marked as abuse.

This comment was marked as abuse.

kolabearafk commented Apr 29, 2023

CrazyBonze commented May 3, 2023

OrcVole commented May 5, 2023

fishscene commented May 11, 2023

rendel commented May 11, 2023

noproto commented May 25, 2023 • edited Loading

ethdig commented Jun 1, 2023

amiravni commented Jun 2, 2023 • edited Loading

Betanu701 commented Jun 14, 2023

wuast94 commented Jun 18, 2023

specked commented Jul 8, 2023

kagrith commented Jul 12, 2023

tonyhardcode commented Aug 3, 2023

laurentgoncalves commented Aug 25, 2023

gaby commented Nov 17, 2023

nsarrazin commented Apr 24, 2023 •

edited

Loading

noproto commented May 25, 2023 •

edited

Loading

amiravni commented Jun 2, 2023 •

edited

Loading