Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Codegen25-7B & Codegen2-4B Fail to download #511

Closed
ghthor opened this issue Oct 5, 2023 · 6 comments
Closed

Codegen25-7B & Codegen2-4B Fail to download #511

ghthor opened this issue Oct 5, 2023 · 6 comments
Labels
bug Something isn't working

Comments

@ghthor
Copy link
Contributor

ghthor commented Oct 5, 2023

Describe the bug

The Codegen25-7B[1] & Codegen2-4B[2] model fail to download using tabby download.

[1] https://huggingface.co/TabbyML/Codegen25-7B
[2] https://huggingface.co/TabbyML/Codegen2-4B

Command being run with error output

✦ ❯ docker run -it -v /opt/tabby/data/:/data --rm tabbyml/tabby download --model TabbyML/Codegen25-7B
2023-10-05T13:03:21.265769Z ERROR tabby::download: crates/tabby/src/download.rs:21: Failed to fetch model 'TabbyML/Codegen25-7B' due to 'Failed to fetch url https://huggingface.co/TabbyML/Codegen25-7B/resolve/main/tabby.json'

Thu Oct  5 09:03:22 2023 exit 1 ❌ ERROR took 13s
✦ ❯ docker run -it -v /opt/tabby/data/:/data --rm tabbyml/tabby download --model TabbyML/Codegen2-4B
2023-10-05T13:04:03.735169Z ERROR tabby::download: crates/tabby/src/download.rs:21: Failed to fetch model 'TabbyML/Codegen2-4B' due to 'Failed to fetch url https://huggingface.co/TabbyML/Codegen2-4B/resolve/main/ggml/q8_0.gguf'

Thu Oct  5 09:04:04 2023 exit 1 ❌ ERROR took 10s

Information about your GPU
Please provide output of nvidia-smi

Additional context

These models are my focus as they have support for more languages then python. I previously downloaded Codegen2-4B successfully and that is the model I'm currently running, but was hoping to try Codegen25-7B.

@ghthor ghthor added the bug Something isn't working label Oct 5, 2023
@ghthor
Copy link
Contributor Author

ghthor commented Oct 5, 2023

Codegen2-4B has by far the best completions for my workload of any of the models I've tried.

@wsxiaoys
Copy link
Member

wsxiaoys commented Oct 5, 2023

May I recommend considering StarCoder-3B and StarCoder-7B from our Models Directory? They are well-maintained, provide infilling support, and have been trained on a considerably larger range of languages (approximately 80).

The failure to download Codegen2-4B seems to be a bug; however, we will address it in an upcoming patch release.

@ghthor
Copy link
Contributor Author

ghthor commented Oct 6, 2023 via email

@ghthor
Copy link
Contributor Author

ghthor commented Oct 10, 2023

I will give StarCoder 7B a longer try, but I wasn’t initially impressed.

Unfortunately I don't have enough GPU Memory to run StarCoder 7B, and I'm getting better completion suggestions with Codegen2-4B vs StarCoder-3B

@wsxiaoys
Copy link
Member

I'm getting better completion suggestions with Codegen2-4B vs StarCoder-3B

Understood. We should enhance the documentation at https://github.com/TabbyML/tabby/blob/main/MODEL_SPEC.md to facilitate the community in creating their own model implementations.

@ghthor
Copy link
Contributor Author

ghthor commented Oct 11, 2023

Understood. We should enhance the documentation at https://github.com/TabbyML/tabby/blob/main/MODEL_SPEC.md to facilitate the community in creating their own model implementations.

That would be really awesome. I've already looked into it some but wasn't able to make any notable progress.

@wsxiaoys wsxiaoys closed this as completed Nov 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants