Does not operate on Llama-3 or Phi-3. #7

ymuichiro · 2024-04-29T00:57:55Z

Thank you for the wonderful project. I was able to run it by building locally and adding the coi-serviceworker mentioned in the README.md and #5.

However, when using a different model, the following behavior occurs:

Download the model.
Complete the model download.
Freeze without any logs appearing.

Is there a specification, for example, that does not support models above a certain size?

Example model

const models = [
   // ok
  "https://huggingface.co/stabilityai/stablelm-2-zephyr-1_6b/resolve/main/stablelm-2-zephyr-1_6b-Q4_1.gguf",
  // ng
  "https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf/resolve/main/Phi-3-mini-4k-instruct-q4.gguf", // 2.32GB
  // ng
  "https://huggingface.co/QuantFactory/Meta-Llama-3-8B-GGUF/resolve/main/Meta-Llama-3-8B.Q4_K_M.gguf", // 4.92GB
];

flatsiedatsie · 2024-05-02T18:44:07Z

The maximum file size is 2Gb. Have a look at Wllama for a way around this.

ymuichiro · 2024-05-07T01:11:19Z

@flatsiedatsie
Thank you! I was able to split it. However, a similar issue has arisen with another issue, so I will proceed with resolving that issue.

split-tool
https://github.com/ggerganov/llama.cpp/tree/master/examples/gguf-split

new issue
ngxson/wllama#12

flatsiedatsie · 2024-05-07T09:19:41Z

Let me guess: memory issues :-)

ymuichiro closed this as completed May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does not operate on Llama-3 or Phi-3. #7

Does not operate on Llama-3 or Phi-3. #7

ymuichiro commented Apr 29, 2024

flatsiedatsie commented May 2, 2024

ymuichiro commented May 7, 2024

flatsiedatsie commented May 7, 2024

Does not operate on Llama-3 or Phi-3. #7

Does not operate on Llama-3 or Phi-3. #7

Comments

ymuichiro commented Apr 29, 2024

Example model

flatsiedatsie commented May 2, 2024

ymuichiro commented May 7, 2024

flatsiedatsie commented May 7, 2024