-
Notifications
You must be signed in to change notification settings - Fork 145
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
epic: Cortex Model Repo supports Default Model download #1418
Comments
cortex run
vs cortex pull
UX
I agree with I think we can go with option 2 before releasing because it's more future-proof. |
Ill update the 35 cortexso repos with @nguyenhoangthuan99 Can I get help on the recommended branches these 35 models? (listed from https://huggingface.co/cortexso) Local modelscortexso/llama3.2 (3b-gguf-q4-km) Future updates to default (when CI run to update branches)cortexso/tinyllama (1b-gguf) (future 1b-gguf-q4-ks) Remote models (no need for metadata.yml as no gguf file)cortexso/NVIDIA-NIM (remote model, we don't have gguf file for this model) |
QN: is it |
@gabrielle-ong, I agree with If possible, please start with |
Thanks @namchuai and @nguyenhoangthuan99! added to llama3.2 and working down the list |
Created all the metadata.yml files in the list |
Thanks @james and @nguyenhoangthuan99! |
Goal
cortex pull <model>
andcortex run <model>
High-level Structure
cortex run <model>:<branch>
pulls a specific versioncortex run <model>
pulls a default versionDecisions
Decision 1: Default Model Download
We need to figure out which model version
cortex pull <model>
andcortex run
model` will pull.Option 1:
main
branchmain
branch to hold our recommended version3b-gguf
intomain
branchHowever, I do not think this is correct long term:
main
branch is non-descriptive as a branch namemain
could hold3b-gguf
, but user is unaware3b-gguf
main
branch requires more work to manage longer-termmain
branch will take some time - i.e. merge, git issues etcIt is also incorrect to compare our approach to Ollama. Ollama uses a tag-based system similar to Docker, where
latest
is a pointer to3b
. It is difficult to replicate this in a "straightforward UX" in Git (i.e. tags are not very visible from main page)Option 2:
main
branch hasmetadata.yaml
metadata.yaml
approach to defining Default Model downloadsmetadata.yaml
is also used to generate the CLI UX forcortex pull
orcortex run
How it works
main
branch will hold a few files (see below)metadata.yaml
will be very simple# File system metadata.yaml README.md
# metadata.yaml version: 1 name: mistral default: 3b-gguf
In the future,
metadata.yaml
can be more complicated, and allow for fine-grained control of CLI UX, e.g. sections for 3b, 7b, or by engine.Furthermore, we can use
metadata.yaml
as a data structure to hold information about the different Model versions.The text was updated successfully, but these errors were encountered: