epic: llama.cpp is installed by default #1217

dan-menlo · 2024-09-15T09:22:58Z

Goal

Cortex.cpp should have a super easy UX to on par with market alternatives

User should have a 1-click installer, that prioritizes simple UX over size-complexity
- Installer packages (or downloads at install time) llama.cpp binaries (e.g. up to 1gb)
- Installer optimizes for "universal" installers (i.e. download all options, and then subsequently deletes all unnecessary files)
- e.g. Mac Universal includes llama.cpp Mac for Intel and Apple Silicon
- e.g. Windows and Nvidia Universal includes llama.cpp for both CUDA versions
For this epic, I am open to either:
- Pre-packaging (preferred)
- Install-time download of dependencies: https://github.com/janhq/cortex.cpp/pull/1219/files

Idea

I wonder whether the solution to this is a way to have an optional local lookup, as part of cortex engines install:

Installer can look in its installer folder to see if dependencies are available, and only pull from remote if needed
We do not need to make any changes to the installer (it still just runs cortex engines install)
This approach is elegant, and allows us flexibility in packaging

Out-of-scope (future)

We should offer a "cortex-alpine" installer which has minimal file size
- Targeted for embedded use cases, and if people want to use ONNX or TensorRT-LLM without llama.cpp
- User will have to download engines as a post-install step
We should offer "universal" installers that pre-package all potential dependencies
- e.g. Large installer size, but packages all dependencies for offline install

Outcomes

Cortex.cpp installer should install llama.cpp by default
Cortex.cpp installer should install the correct version of llama.cpp (based on hardware)

Key Questions

Should we align with llama.cpp's versions? (e.g. with Vulkan, sycl)

Appendix

Why?

Our current cortex.cpp v0.1 onboarding UX is not user friendly:

llama.cpp only seems to be downloaded on first run of Cortex (at least on Windows)
- Download UX is not very good (no progress indicator)
- Download is often slow, or drops

Very often, the llama.cpp engine download does not work, resulting in bad UX
- "Engine not loaded yet"

The text was updated successfully, but these errors were encountered:

dan-menlo · 2024-09-16T00:19:48Z

@hiento09 has worked on a PR that moves the llama.cpp Engine Install to Installer, but I am concerned that it's still not great UX

https://github.com/janhq/cortex.cpp/pull/1219/files

namchuai · 2024-09-18T17:49:48Z

Just saw this today. "Engine not loaded yet" does not mean the engine is not yet downloaded. It might have problem with the engine loading logic.

freelerobot · 2024-09-21T07:43:34Z

QA Updates (v75)

✅ works on windows
✅ works on mac

vansangpfiev · 2024-09-23T07:51:47Z

We are downloading the CUDA dependencies that Nvidia driver supports.
After the #1085 has been resolved, we should update CUDA dependencies logic for installer:

cortex searches for local folder to check CUDA 11.7/12.0 for llamacpp engine
if found, unzip the CUDA dependencies to the installation folder
if not, download CUDA 11.7/12.0 from jan host

cc: @hiento09 @dan-homebrew

gabrielle-ong · 2024-09-30T07:32:24Z

QA v123
✅ Mac
✅ Windows
✅ linux

dan-menlo added this to Menlo Sep 15, 2024

dan-menlo converted this from a draft issue Sep 15, 2024

dan-menlo changed the title ~~epic: Cortex.cpp is installed with llama.cpp by default~~ epic: llama.cpp is installed by default Sep 15, 2024

dan-menlo assigned hiento09 and vansangpfiev Sep 15, 2024

dan-menlo added the category: installer label Sep 15, 2024

dan-menlo assigned namchuai and unassigned vansangpfiev Sep 15, 2024

dan-menlo mentioned this issue Sep 16, 2024

epic: Cortex.cpp Installer MVP (Local & Network Installer) #1030

Closed

3 tasks

vansangpfiev self-assigned this Sep 16, 2024

dan-menlo mentioned this issue Sep 16, 2024

epic: Structure Manual QA for cortex.cpp #1225

Closed

5 tasks

vansangpfiev mentioned this issue Sep 17, 2024

feat: add hardware-info binary #1229

Closed

3 tasks

vansangpfiev mentioned this issue Sep 23, 2024

feat: install local engine #1292

Merged

3 tasks

dan-menlo moved this from In Review to Review + QA in Menlo Sep 29, 2024

gabrielle-ong closed this as completed by moving to Completed in Menlo Sep 30, 2024

github-project-automation bot moved this from Completed to Review + QA in Menlo Sep 30, 2024

gabrielle-ong added this to the v1.0.0 milestone Oct 3, 2024

gabrielle-ong mentioned this issue Nov 11, 2024

ci: CI packaging of llama.cpp dependencies into the binary file by default for Cortex's integration into Jan #1369

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epic: llama.cpp is installed by default #1217

epic: llama.cpp is installed by default #1217

dan-menlo commented Sep 15, 2024 •

edited by hiento09

Loading

dan-menlo commented Sep 16, 2024

namchuai commented Sep 18, 2024

freelerobot commented Sep 21, 2024

vansangpfiev commented Sep 23, 2024

gabrielle-ong commented Sep 30, 2024

epic: llama.cpp is installed by default #1217

epic: llama.cpp is installed by default #1217

Comments

dan-menlo commented Sep 15, 2024 • edited by hiento09 Loading

Goal

Idea

Out-of-scope (future)

Outcomes

Key Questions

Appendix

Why?

dan-menlo commented Sep 16, 2024

namchuai commented Sep 18, 2024

freelerobot commented Sep 21, 2024

QA Updates (v75)

vansangpfiev commented Sep 23, 2024

gabrielle-ong commented Sep 30, 2024

dan-menlo commented Sep 15, 2024 •

edited by hiento09

Loading