fix: pull ollama embedding model if necessary #1209

ashwinb · 2025-02-21T18:28:25Z

Embedding models are tiny and can be pulled on-demand. Let's do that so the user doesn't have to do "yet another thing" to get themselves set up.

Thanks @hardikjshah for the suggestion.

Also fixed a build dependency miss (TODO: distro_codegen needs to actually check that the build template contains all providers mentioned for the run.yaml file)

Test Plan

First run ollama rm all-minilm:latest.

Run llama stack build --template ollama && llama stack run ollama --env INFERENCE_MODEL=llama3.2:3b-instruct-fp16. See that it outputs a "Pulling embedding model all-minilm:latest" output and the stack starts up correctly. Verify that ollama list shows the model is correctly downloaded.

Pull ollama embedding model if necessary

ae1bcb9

ashwinb requested review from yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic, sixianyi0721, ehhuang and terrytangyuan as code owners February 21, 2025 18:28

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 21, 2025

ashwinb changed the title ~~Pull ollama embedding model if necessary~~ fix: pull ollama embedding model if necessary Feb 21, 2025

update the template properly

ec8abe1

raghotham approved these changes Feb 21, 2025

View reviewed changes

ashwinb merged commit 11697f8 into main Feb 21, 2025
4 checks passed

ashwinb deleted the ollama branch February 21, 2025 18:35

hardikjshah approved these changes Feb 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pull ollama embedding model if necessary #1209

fix: pull ollama embedding model if necessary #1209

ashwinb commented Feb 21, 2025

fix: pull ollama embedding model if necessary #1209

fix: pull ollama embedding model if necessary #1209

Conversation

ashwinb commented Feb 21, 2025

Test Plan