Fix issue when generating distros #755

terrytangyuan · 2025-01-14T01:25:30Z

I am not 100% sure if the diff is correct though but this is the result of running python llama_stack/scripts/distro_codegen.py.

llama_stack/templates/remote-vllm/vllm.py

llama_stack/templates/remote-vllm/run-with-safety.yaml

ashwinb · 2025-01-14T02:19:58Z

@dineshyv could you approve this PR? I am not sure if the changes to the run-with-safety.yaml files are good or not

yanxi0830 · 2025-01-14T22:28:41Z

llama_stack/templates/fireworks/run-with-safety.yaml

@@ -95,62 +95,20 @@ metadata_store:
  db_path: ${env.SQLITE_STORE_DIR:~/.llama/distributions/fireworks}/registry.db
 models:
 - metadata: {}
-  model_id: meta-llama/Llama-3.1-8B-Instruct
+  model_id: ${env.INFERENCE_MODEL}


Hmm, this run-with-safety template shouldn't be changed.

cc @vladimirivic Could you help take a look? I think this is introduced incorrectly in

llama-stack/llama_stack/templates/fireworks/fireworks.py

Lines 123 to 140 in 91907b7

"run-with-safety.yaml": RunConfigSettings(

provider_overrides={

"inference": [

inference_provider,

embedding_provider,

],

"memory": [memory_provider],

"safety": [

Provider(

provider_id="llama-guard",

provider_type="inline::llama-guard",

config={},

),

Provider(

provider_id="code-scanner",

provider_type="inline::code-scanner",

config={},

),

.

It should follow smt like together's run-with-safety template: https://github.com/meta-llama/llama-stack/blob/91907b714e825a1bfbca5271e0f403aab5f10752/llama_stack/templates/together/together.py#L120C32-L141

I rebased and re-generated the files again. Not sure if it incorported the recent fixes correctly

I think it pulled changes in #766.

The fireworks templates have been fixed in #766 . But I think the vllm run.yaml without tool_runtime still suggest you might have an older version? Could you help double check?

Looks like there are indeed some issues with my local python env. I think I fixed them now. Just pushed an update

Signed-off-by: Yuan Tang <[email protected]>

yanxi0830 · 2025-01-15T05:17:08Z

LG!

terrytangyuan requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic and sixianyi0721 as code owners January 14, 2025 01:25

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 14, 2025

terrytangyuan mentioned this pull request Jan 14, 2025

Fixed typo in default VLLM_URL in remote-vllm.md #723

Merged

yanxi0830 reviewed Jan 14, 2025

View reviewed changes

llama_stack/templates/remote-vllm/vllm.py Show resolved Hide resolved

yanxi0830 reviewed Jan 14, 2025

View reviewed changes

llama_stack/templates/remote-vllm/run-with-safety.yaml Outdated Show resolved Hide resolved

terrytangyuan force-pushed the fix-typo-dis branch from 7186407 to 3d45a03 Compare January 14, 2025 01:58

yanxi0830 reviewed Jan 14, 2025

View reviewed changes

Fix issue when generating vLLM distros

7c72682

Signed-off-by: Yuan Tang <[email protected]>

terrytangyuan force-pushed the fix-typo-dis branch from 3d45a03 to 7c72682 Compare January 15, 2025 00:53

Update

3da3d26

Signed-off-by: Yuan Tang <[email protected]>

terrytangyuan changed the title ~~Fix issue when generating vLLM distros~~ Fix issue when generating distros Jan 15, 2025

yanxi0830 approved these changes Jan 15, 2025

View reviewed changes

ashwinb merged commit 300e6e2 into meta-llama:main Jan 15, 2025
2 checks passed

terrytangyuan deleted the fix-typo-dis branch January 15, 2025 17:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue when generating distros #755

Fix issue when generating distros #755

terrytangyuan commented Jan 14, 2025

ashwinb commented Jan 14, 2025

yanxi0830 Jan 14, 2025

terrytangyuan Jan 15, 2025

terrytangyuan Jan 15, 2025

yanxi0830 Jan 15, 2025

terrytangyuan Jan 15, 2025

yanxi0830 commented Jan 15, 2025

	"run-with-safety.yaml": RunConfigSettings(
	provider_overrides={
	"inference": [
	inference_provider,
	embedding_provider,
	],
	"memory": [memory_provider],
	"safety": [
	Provider(
	provider_id="llama-guard",
	provider_type="inline::llama-guard",
	config={},
	),
	Provider(
	provider_id="code-scanner",
	provider_type="inline::code-scanner",
	config={},
	),

Fix issue when generating distros #755

Fix issue when generating distros #755

Conversation

terrytangyuan commented Jan 14, 2025

ashwinb commented Jan 14, 2025

yanxi0830 Jan 14, 2025

Choose a reason for hiding this comment

terrytangyuan Jan 15, 2025

Choose a reason for hiding this comment

terrytangyuan Jan 15, 2025

Choose a reason for hiding this comment

yanxi0830 Jan 15, 2025

Choose a reason for hiding this comment

terrytangyuan Jan 15, 2025

Choose a reason for hiding this comment

yanxi0830 commented Jan 15, 2025