bug: `Model failed to load with status code: 500`, `could not load engine llamacpp` #1422

dan-menlo · 2024-10-04T04:31:05Z

Goal

A lot of useful error messages are currently not being bubbled up to CLI
We should bubble up as many error messages, to allow users to file more informative bugs or know what is going on

Current State

Running llama3.1 failed with model status code 500:

The real error message is in the logs:

gabrielle-ong · 2024-10-04T08:50:47Z

v154 - also getting this error

Workaround

cortex engines install llama-cpp

which should have been pre-installed.

cc @hiento09 - discord discussion

gabrielle-ong · 2024-10-05T08:52:21Z

should be solved by #1369?

gabrielle-ong · 2024-10-05T10:09:22Z

Copying's Hien's post from #1396

I just tried a scenario as follows and encountered a similar issue to what you mentioned:
My machine is a Mac Silicon
I downloaded version 152, but the mac amd64 version – it still installed because my machine has Rosetta, but at this point, the llamacpp engine installed is the amd64 variant, which causes it to not run on Mac Silicon.
I ran cortex update to download version 154 universal and tried running cortex chat tinyllama:1b-gguf.

gabrielle-ong · 2024-10-17T01:45:24Z

Proposal: Investigating -> Feature (high pain) -> Eng Planning

original bug should be fixed, possibly same error as bug: cortex models start: Error HTTP error: Failed to read connection / Model failed to load 500 #1474
Task remains as a eng task: Useful error messages eg 402, 500 are not being bubbled up to CLI

gabrielle-ong · 2024-10-17T02:11:42Z

Decision from Dan / Nicole:

new Epic: bubble up error messages from Engines, highlight important error messages
Status: planning, Eng Spec: Planning, Sprints: 24/25

gabrielle-ong · 2024-10-17T09:42:34Z

Closing this bug as resolved,
creating a separate discussion for bubbling error logs #1514

dan-menlo added this to Menlo Oct 4, 2024

dan-menlo converted this from a draft issue Oct 4, 2024

gabrielle-ong assigned hiento09 Oct 4, 2024

gabrielle-ong changed the title ~~epic: cortex should bubble up better error messages~~ bug: Model failed to load with status code: 500, could not load engine llamacpp Oct 5, 2024

This was referenced Oct 3, 2024

ci: CI packaging of llama.cpp dependencies into the binary file by default for Cortex's integration into Jan #1369

Open

feat: build macos universal for binary and installer #1396

Closed

gabrielle-ong moved this from Planning to Investigating in Menlo Oct 16, 2024

gabrielle-ong assigned gabrielle-ong and unassigned hiento09 Oct 16, 2024

gabrielle-ong mentioned this issue Oct 17, 2024

planning: Bubble up error messages from logs #1514

Closed

gabrielle-ong closed this as completed Oct 17, 2024

github-project-automation bot moved this from Investigating to Review + QA in Menlo Oct 17, 2024

gabrielle-ong moved this from Review + QA to Completed in Menlo Oct 17, 2024

gabrielle-ong added os: Mac engine: llama.cpp category: engine management Related to engine abstraction labels Oct 17, 2024

gabrielle-ong added this to the v1.0.1 milestone Oct 17, 2024

gabrielle-ong mentioned this issue Oct 18, 2024

bug: API /models/start does not work - weird response message #1475

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: `Model failed to load with status code: 500`, `could not load engine llamacpp` #1422

bug: `Model failed to load with status code: 500`, `could not load engine llamacpp` #1422

dan-menlo commented Oct 4, 2024 •

edited

Loading

gabrielle-ong commented Oct 4, 2024 •

edited

Loading

gabrielle-ong commented Oct 5, 2024 •

edited

Loading

gabrielle-ong commented Oct 5, 2024

gabrielle-ong commented Oct 17, 2024

gabrielle-ong commented Oct 17, 2024 •

edited

Loading

gabrielle-ong commented Oct 17, 2024

bug: Model failed to load with status code: 500, could not load engine llamacpp #1422

bug: Model failed to load with status code: 500, could not load engine llamacpp #1422

Comments

dan-menlo commented Oct 4, 2024 • edited Loading

Goal

Current State

gabrielle-ong commented Oct 4, 2024 • edited Loading

Workaround

gabrielle-ong commented Oct 5, 2024 • edited Loading

gabrielle-ong commented Oct 5, 2024

gabrielle-ong commented Oct 17, 2024

gabrielle-ong commented Oct 17, 2024 • edited Loading

gabrielle-ong commented Oct 17, 2024

bug: `Model failed to load with status code: 500`, `could not load engine llamacpp` #1422

bug: `Model failed to load with status code: 500`, `could not load engine llamacpp` #1422

dan-menlo commented Oct 4, 2024 •

edited

Loading

gabrielle-ong commented Oct 4, 2024 •

edited

Loading

gabrielle-ong commented Oct 5, 2024 •

edited

Loading

gabrielle-ong commented Oct 17, 2024 •

edited

Loading